Innovated and motivated Data and Software engineer, with two years of experience in object-oriented design, Machine learning, Big Data technologies, deep knowledge of data structure and algorithms. Currently looking for full-time opportunities as
Data Engineer/Data Scientist/Software Engineer.
Education
New York University, TANDON SCHOOL OF ENGINEERING, BROOKLYN, NY Aug 2016 – May 2018
- Masters of Science in Computer Science(GPA 3.633)
SRM University, NCR CAMPUS, INDIA Aug 2011 - May 2015
- Bachelors of Technology in Computer Science (GPA 3.4)
Technical Skills
- Languages : Python, C, C++, Java, SQL, HTML, Prolog, R, Shell, HTML5, CSS3, SQL, Apache Spark, Pig
- Frameworks : Django
- Database : MongoDB, PostgreSQL, Cassandra, SQLite, Mysql
- DevOps : Jenkins, Docker
- Tools & Methods : Linux, Agile, Scrum, RESTful APIs, ElasticSearch, Microsoft Suite, Oracle 9i, IntelliJ, Scikit-learn, Jupyter, Visio, Git, Tensorflow, MySql workbench, AWS, Kafka, Rabbit mq, zmq
Professional Experience
MTA New York City Transit, New York, USA
SYSTEM & DATA SCIENCE RESEARCH INTERN SEP 2017 – Present
- Built an api using python for calculating performance metrics and ridership estimations by producing aggregated data
- Writing and maintaining ETL scripts
- Implementing KAFKA to build real-time data pipelines and streaming applications by collecting data from different zmq's
- Built different classifiers to predict future performance metrics using machine learning classification algorithms
- Statistical analysis done to gain insights of the performance metrics
HCL TECHNOLOGIES, NOIDA, INDIA
SOFTWARE ENGINEER INTERN OCT 2015 - APRIL 2016
- Performed bug tracking in C++
- Designed and developed software for operating systems
- Fixed ill-defined requirements of the software
- Deployment of various requirements using Scrum Methodology
Projects
iNEWS SEP 2017 - DEC 2017
- Mobile application that uses a News API to fetch the latest news from sources including CNN and the New York Times
and deliver it to the user; whether or not user is connected to the Internet
- Sends a push notification to user every day at 9 AM EST to awaken the app, so that the latest news can be downloaded
- Developed backend of the application using Amazon Web Services such as Lambda, SQS, SNS and DynamoDB to
retrieve unique news from source. Also, developed APIs to retrieve and search news from AWS ElasticSearch
- Two factor authentications are provided for security purposes using AWS Cognito
ANALYZING NYPD COMPLAINT DATA (HISTORIC) July2017 – Aug2017
- Analyzed NYPD Complaint data to uncover hidden patterns, unknown correlations, crime trends and other anomalies
- Generated hypothesis based on revelations by correlating with datasets like Weather, census and employment data.
- Used Spark Scripts, Map-reduce and Sql queries to clean data
- Python libraries such as seaborn, matlib, bokeh and folium are used for data visualization
YELP DATASET ANALYSIS June2017 – July2017
- Performed some basic statistics like summarizing reviews by city and category, ratings of businesses around University of
Wisconsin-Madison, based on number of reviews on the dataset by executing scripts written in PigLatin and ApacheSpark
- Performed port mapping to run Hue web Interface through local browser by setting cloudera quick start container in Docker
- Created visualizations in Tableau based on data obtained from the resulted scripts
CLASSIFIERS TO IDENTIFY TWITTER ACCOUNTS AS BOTS OR NOT BOTS Jan 2017 – April 2017
- Built four classifiers (Multinomial Naive Bayes, Decision Trees, Logistic Regression and Random Forest) using python
libraries pandas to train our model to predict twitter accounts as bots
- Compared model’s accuracy from different classifiers we built
ANALYZING TRAITS SHARED BETWEEN TWO TWITTER USERS April 2017 – May 2017
- Built a fully functional, Watson-powered application using Python to interact with the Twitter API and IBM's Personality Insights API in order to analyze traits shared between two Twitter users
- Displayed the top 5 personality traits shared between two Twitter users