I am a skilled data analyst with expertise in Python, Numpy, Pandas, Matplotlib, Seaborn, Scipy.Stats, Scikit-Learn, PyCharm, Bayes' Theorem, Central Limit Theorem, Hypotheses Testing, Feature Engineering, MS Excel, SQL, Tableau, Power BI, and GitHub. I hold a B.Tech degree from Delhi Technical Campus and am currently pursuing Scaler Academy's Data Science and Machine Learning Program. Additionally, I have earned certifications in Data Analysis with Python, SQL for Data Science, and Fundamentals of Visualization with Tableau.
Projects:
Throughout my journey, I have worked on several impactful projects, showcasing my analytical prowess and ability to derive valuable insights from data:
1. Aerofit - Building Customer Profile:
I identified characteristics of the target audience for each treadmill type offered by the company. Using Matplotlib and Seaborn, I performed visual analysis to build customer profiles based on factors like marital status and age, analyzing their impact on product purchases. Moreover, I calculated marginal and conditional probabilities using two-way contingency tables and explored correlations among different factors using heat maps.
2. Walmart - Analyzing Customer Purchase Behavior:
In this project, I analyzed customer purchase behavior based on gender and other factors. I tracked the average amount spent by customers and used the Central Limit Theorem to find confidence intervals for population averages. I also explored the effects of changing sample sizes and interval widths on expense distribution and checked for overlapping confidence intervals.
3. Yulu - Factors Affecting Demand for Electric Cycles:
For Yulu, I investigated the factors influencing demand for shared electric cycles in the Indian market. I checked assumptions for hypotheses tests, such as Normality and Equal Variance, and conducted 2-sample T tests, ANOVA, and Chi-square tests to examine the impact of working days, weather, and seasons on the number of cycles rented.
4. Delhivery - Data Analysis and Visualization:
In this project, I processed data and gained insights to facilitate building forecasting models. Pre-processing involved condensing data based on a single trip and extracting features like city and state. I compared and visualized time and distance fields, conducting hypotheses tests and visual analysis between actual and engine-generated values. Additionally, I performed one-hot encoding for categorical variables and normalized/standardized numerical features using MinMaxScaler or StandardScaler.
5. Target - eCommerce Data Analysis through SQL:
I curated business insights from eCommerce data using SQL, focusing on sales trends, order frequency, and customer distribution from different states and regions. My analysis covered months from January to August, and I made use of tools like BigQuery extensively.
6. Tesla and GameStop - Extracting and Visualizing Stock Data:
Extracting essential data from datasets, I displayed stock data using yfinance and webscraping. By plotting stock graphs, I enabled informed decision-making based on the data.
7. Analysis of HR Dataset through SQL:
For HR dataset analysis, I utilized SQL's group by and aggregate functions to gather information from employees of different departments and locations. Additionally, I employed joins, window functions, and date-time functions for in-depth analysis.
8. Analysis of Farmer's Market Dataset through SQL:
In this project, I analyzed a Farmer's Market dataset using SQL with filtering, sub-queries, joins, group by, and aggregate functions. I conducted in-depth exploration using joins, window functions, and date-time functions.
Languages:
I am fluent in English, Hindi, and Urdu, which enables effective communication in diverse environments.
Overall, I am a passionate technology enthusiast who thrives on efficiently leveraging data to unlock potential and create a significant impact. Seeking challenges, I aspire to contribute to innovative projects and grow both personally and professionally. My dedication to data-driven decision-making and expertise in data analysis and visualization make me a valuable asset to any team.