Projects

Spam Filter.

Implemented the Naive-Bayes Classifier to detect email spam using Bernoulli, Gaussian Random Variable, and the Histogram Method. Performed Machine Learning on training data set to calculate probabilities of certain features being spam and predict spam on testing data.

Implemented the Pagerank Algorithm on a Web Crawl of 2 million Web Pages.

In this project using python I implemented the pagerank algorithm and found out pageranks of 183811 web pages and then did a brief analysis of the results.

Search Engine running on The Lemur Project Database.

Developed a search engine using Vector retrieval, Language Model and BM23 and analyzed the results to determine which model gives better results for different types of queries

Created a VPN tunnel using SSH and TUN/TAP.

In this project I created a tunnel using TUN command between machine 1 and and machine 3. Machine 3 used to send data for machine 2 which was intercepted by machine 1 and then send to machine 2 by using raw sockets.Machine 2 used to get reply from machine 2 which it used to send back through the tunnel to machine 3

Speaker Identification Using MFCC and Vector Quantization.

The algorithm Kekre's Median Codebook Generation (KMCG) has been efficiently used for image processing but had not been tried for processing audio signals. We designed a system which collected voice samples of people to generate a codebook of their voice charachteristics with the help of KMCG. Once the Codebook was generated the system compared the voice signal of the user with the already generated Codebook thus carrying out the feature matching process with the help of Euclidean distance. We used MFCC for extracting the features of the voice samples and Vector Quantization to create a codebook and for the vector matching.

Activities

Software Developer Co-op at Empirix.

  1. Developed "Anonymization" feature for E-XMS product in JAVA. Participated in the Analysis and design for the feature, developed the code.
  2. Bug fixes for the E-XMS 4.5 Release.
  3. Undertook Skunk works Project to port the application from Tomcat 5.5 to TomEE Plus Web Server.

Summer Intern at Analog Devices Inc.

  1. Developed and tested a WCF Web Service to scan files before uploading them on the server .
  2. Accomplished enhancement of a tool used in Content Management System to incorporate new features and fix existing bugs using ASP.NET, JavaScript and Telerik.
  3. Created a website for generating reports for users using ASP.NET and Microsoft SQL Server.

Submitted research paper at ICCICT, 2012 on Speaker recognition using Vector Quantization by MFCC and KMCG clustering algorithm

Submitted research paper at ICWET, 2011 on Cloud Computing and Windows Azure.