Neeraj Chaudhari
Neeraj Chaudhari

Neeraj Chaudhari

MS in Data Science, Rutgers University

New Brunswick, NJ

I'm an MS in Data Science student at Rutgers University passionate about transforming data into actionable insights. With expertise spanning data analysis, machine learning, and data engineering, I build comprehensive solutions from ETL pipelines and predictive models to interactive dashboards. I'm driven by the challenge of turning complex data into strategic business value through innovative analytics and scalable infrastructure.

Technical Skills


Programming

Python SQL R

Machine Learning & AI

Scikit-Learn TensorFlow PyTorch Statistical Analysis EDA

Data Engineering & Big Data

AWS PySpark ETL Data Pipelines Cloud Computing

Data Visualization & BI

Pandas NumPy Matplotlib Seaborn Tableau Power BI Excel Streamlit

Web & App Development

Flask React.js

Tools & Developer Environment

Git Docker Jupyter Notebook VS Code

Projects


Clarity: AI-Powered Data Quality Platform

Python Streamlit Great Expectations Docker

Developed a full-stack data quality platform to automate data cleaning, profiling, and validation for enterprise datasets. Led the design and implementation of an AI-powered fuzzy duplicate detection system and integrated Great Expectations for robust validation. The platform reduced data cleaning time by 60% and improved data quality scores by 25% for client datasets, enabling teams to make faster, more reliable business decisions.

View on GitHub

Fast Tweet Search Application

Python Couchbase PostgreSQL Streamlit

Engineered a scalable system for processing and storing over 1 million tweets and 500,000 user records, utilizing Couchbase and PostgreSQL for efficient storage and Streamlit for an interactive search interface. Implemented advanced data retrieval techniques, reducing query latency by 70% and enabling real-time analytics for large-scale datasets, significantly improving user experience and data accessibility.

View on GitHub

Causal Impact of Growth-Mindset

R Python Statistical Analysis Causal Inference

Analyzed data for over 10,000 students to evaluate a growth-mindset intervention. Applied OLS regression and propensity score analysis to estimate causal effects on academic achievement, demonstrating a 0.41 standard deviation improvement. The project’s results informed policy recommendations and were presented at an academic conference, showcasing the impact of data-driven evaluation in education.

View on GitHub

Experience


Data Analyst Intern

The Head Story | Jun 2021 – Nov 2021

SQL Python Power BI Data Analysis
  • Developed and automated Power BI dashboards and weekly reporting pipelines, reducing manual effort by 40% and enabling real-time tracking of marketing and product KPIs.
  • Performed advanced data analysis using SQL and Python to uncover actionable insights on user acquisition, conversion funnels, and retention, directly influencing business strategy and campaign optimization.
  • Collaborated with cross-functional teams to define requirements, streamline data workflows, and deliver training that improved dashboard adoption by 30% and ensured smooth onboarding for future analysts.

Education


Master of Science in Data Science

Rutgers University – New Brunswick

Sep. 2023 – May 2025

Relevant Coursework: Probability and Statistical Inference, Financial Data Mining, Database Management, Natural Language Processing, Data Structures and Algorithms, Statistical Modeling, Regression and Time Series Analysis, Data Analysis and Visualization.

BTech in Electronics and Telecommunication

KJ Somaiya College of Engineering

Aug. 2019 – May 2023

Get In Touch


I'm currently seeking new opportunities and am open to messages. Whether you have a question or just want to say hi, I'll try my best to get back to you!