Hire the Best Data Scientists in Pune, IN

More than 3,000 reviews on G2
Rating is 4.5 out of 5.
4.5/5
of Upwork by G2 peer reviewers
Amol W.

Pune, India

$50/hr
5.0
107 jobs

Expert-Vetted (Top 1% of Upwork talent)🏆🏆🏆 🎓 NLP, ML, LLM and AI expert 💬 custom Chatbots using OpenAI/ AWS bedrock, langchain, vector databases. LLMs like chatgpt, GPT4, Claude3.5, Llama and Falcon 🚀 AI Agents development using frameworks like LangGraph, Autogen or CrewAI 📊 Sentiment Analysis, Text Classification, text generation, text summarization, Topic modelling, and Data Clustering 🚀 Certified AWS Architect skilled in designing and developing AI pipelines using AWS Bedrock and SageMaker, lambda, RDS specializing in NLP, ML, LLM, and AI technologies. 💬 Finetuning open-source LLMs on custom data 🤖 Low code/No code AI automations using tools like Make.com and Zapier 🖼️ Custom image generation using stable diffusion models 🎥 Object Detection, Motion Tracking, Scene Recognition, and Anomaly Detection. 🎯 Recommendation Engines Expert: Specialized in designing and implementing recommendation systems using AWS Personalize, Google Cloud Recommendations AI, and custom solutions built from scratch. Unlike many pseudo-AI experts who simply know how to call OpenAI or Anthropic APIs, I bring 𝟗 𝐲𝐞𝐚𝐫𝐬 of deep, hands-on experience in the AI field, mastering everything from traditional ML to advanced Generative AI. I understand the ins and outs of building real AI solutions that go far beyond basic API integrations. 🤖 𝐄𝐱𝐩𝐞𝐫𝐭𝐢𝐬𝐞 𝐢𝐧 𝐋𝐚𝐫𝐠𝐞 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐌𝐨𝐝𝐞𝐥𝐬 (𝐋𝐋𝐌) 𝐚𝐧𝐝 𝐀𝐈: ➜ Fine Tuning: Specialized in finetuning LLMs like llama, openai models, qwen2.5, mistral for domain adaptation, style adaptation, persona writing, QnA, medical, legal, and more. ➜ LLM Synthetic Dataset Generation for finetuning ➜ LLM Evaluation Framework ➜ LLM Deployment: On Cloud platforms like AWS DLC, Lambda, and more. ➜ AI Agents / Voice Bots: Proficient with CrewAI/AutoGen, Amazon Polly, Deepgram, OpenAI swarm. ➜ AI Automation using make and zapier ➜ custom LLM Deployment: On AWS/GCP/RunPod using SkyPilot, vLLM/TGI frameworks 🛠️ 𝐓𝐨𝐨𝐥𝐬/𝐅𝐫𝐚𝐦𝐞𝐰𝐨𝐫𝐤𝐬: ➜ Langchain, LangServe, LangSmith, llamandex, Heystack, HuggingFace, Transformers ➜ VectorDB: Chromadb, FAISS, PineCone, Qdrant, opensearch, Azure Cosmosdb, Milvus ➜ Flowise AI, LangFlow, StackAI ➜ GCP Vertex AI, Google Colab, AWS Sagemaker, Azure ML studio, Runpod, Vast AI 🐍 Python Frameworks: ➜ Low-Code UI Tools: Streamlit, Gradio, Panel API Frameworks: FastAPI, Flask, Django, Pydantic Machine Learning / Deep Learning Frameworks:PyTorch, TensorFlow, Keras, HuggingFace Transformers ➜Data Wrangling / Processing:Pandas, NumPy, Dask, Polars, Scikit-learn ➜Model Serving and Deployment: Triton Inference Server, TorchServe, ONNX Runtime, MLflow, BentoML 🗄️ 𝐃𝐚𝐭𝐚𝐛𝐚𝐬𝐞𝐬: ➜ SQL: MySQL, PostgreSQL, SQLite, Azure SQL ➜ NoSQL: MongoDB, DynamoDB, Firebase, Redis ➜ GraphDB: Neo4j, Amazon Neptune 💻 𝐅𝐫𝐨𝐧𝐭𝐞𝐧𝐝 𝐓𝐞𝐜𝐡: ➜ React, Angular, Next.js, Vue.js, Tailwind CSS, Bootstrap I am lead AI/ML engineer with more than 9 years of experience traditional ML, deep learning, advanced NLP, generative AI LLMs like chatgpt, GPT4, Llama, Falcon and Mistral, Mixtral, Qwen. Strong experience in executing custom AI and NLP solutions and integrating them in business workflows, along with advanced skills in object detection, motion tracking, scene recognition, and anomaly detection. If you're working with any sort of data for your project, I'm here to help! Whether you have raw and unprocessed data that needs cleaning, or you need help scraping and annotating new data, I've got you covered. As an AI professional with a specialization in AI, NLP, LLMs, I've worked with various models, including GPT3, Chatgpt/GPT4, llama3, Qwen, and GPT-J, and have experience in applying state-of-the-art NLP techniques to projects. If you need help training a deep learning model, I can help you experiment with cutting-edge models such as T5, Bert, M2M, FLAN-T5 and RoBerta to achieve the best possible performance. I can train/Fine tune open source LLMs like Llama, mpt7b, Falcon using efficient techniques like QLora. I'm well-versed in working with transformer-based models and can help you fine-tune and transfer learning to get the most out of your data. If you have text data I can help with text classification, natural language understanding, and natural language generation. If you're looking for a chatbot or conversational AI solution, I can help you develop a solution using Chatgpt, langchain and vector databases like pinecone. In addition to NLP, I'm experienced in working with sequential data, time series forecasting, and PyTorch code debugging. I have successfully completed over 60 jobs on Upwork, logging more than 4000 hours of work while consistently achieving client satisfaction. If you're looking for an AI professional who can help with anything remotely related to LLMs or AI Agents, any other NLP/ML task don't hesitate to reach out to me. I'll be more than happy to assist you in achieving success with your project.

  • Machine Learning
  • Natural Language Processing
  • Python
  • Artificial Intelligence
  • Deep Learning
  • AI Agent Development
  • AI App Development
  • Large Language Model
  • Generative AI
  • TensorFlow
  • LLM Prompt Engineering
  • AI Development
  • AI Chatbot
  • Chatbot Development
  • LangChain
Siddhant M.

Pune, India

$15/hr
4.8
43 jobs

Data Engineer & AI Developer | 3+ Years Financial Industry Experience I build data pipelines, AI-powered applications, and automation systems that run reliably at scale. My background spans web scraping, LLM integration, computer vision, betting automation, and full-stack data dashboards — delivered to clients across the US, UK, Europe, and Japan. 💼 Background — 3+ years at a leading Indian bank building risk models, credit scorecards, and AutoML pipelines — PG Diploma in Big Data Analysis ⚡ What I Deliver — Web scrapers handling 1.2M+ URLs and 120K daily pipelines — LLM/AI apps using GPT-4, Gemini, LangChain, RAG, Text-to-SQL — Full Betting automation for horse racing, golf, and football signals — Computer vision pipelines with YOLOv8 and PaddleOCR — Streamlit dashboards, risk scorecards, and AutoML tools 🏆 Notable Work — PitchBook scraper — 1.2M URLs — Njuskalo — 120K daily real estate listings — Text-to-SQL architecture — BetFare — full Betfair automation — LLM Notebook — $1,420 solo delivery — Anti-bot bypass systems 🛠️ Stack Python · Playwright · Selenium · GPT-4 · Gemini · LangChain · Streamlit · PySpark · SQL · YOLOv8 · PaddleOCR · FastAPI · Betfair API · n8n Clean code. Clear communication. Delivered on time.

  • Data Science
  • Data Analysis
  • Python
  • SQL
  • PySpark
  • Java
  • Front-End Development
  • Streamlit
  • AI Chatbot
  • API
  • Web Scraping
  • Selenium
  • PyQt
  • YOLO
Akash K.

Pune, India

$15/hr
5.0
9 jobs

I am a Data Scientist with a Master's degree in Statistics and professional experience delivering data-driven solutions through statistical modeling, machine learning, and advanced analytics. I help businesses, researchers, and organizations transform complex data into actionable insights that support informed decision-making. My expertise includes statistical analysis, predictive modeling, machine learning, data visualization, and interactive application development. I have worked on projects across business analytics, healthcare, biostatistics, transportation, and academic research, developing solutions that are both technically robust and practically useful. I specialize in building end-to-end analytical workflows, from data collection and preprocessing to modeling, visualization, and reporting. I also develop interactive applications using R Shiny and other Posit products to help stakeholders explore data and automate decision-making processes. Areas of Expertise • Statistical Modeling and Data Analysis • Machine Learning and Predictive Analytics • Hypothesis Testing and Experimental Design • Regression Analysis and Multivariate Methods • Data Cleaning, Preprocessing, and Feature Engineering • Exploratory Data Analysis (EDA) • R Shiny Application Development • Interactive Dashboards and Data Visualization • Business Analytics and Decision Support • Research and Biostatistical Analysis Technical Skills • Python (Pandas, NumPy, Scikit-learn, XGBoost) • R (Shiny, tidyverse, ggplot2, caret) • SQL • Posit Products (Shiny, Quarto, R Markdown) • Excel and Power BI • Statistical and Machine Learning Techniques • Master's Degree in Statistics • Top Rated Freelancer with 100% Job Success • Experience delivering machine learning, analytics, and statistical solutions • Strong communication with both technical and non-technical stakeholders • Commitment to producing accurate, reproducible, and well-documented work Whether you need advanced statistical analysis, predictive models, interactive R Shiny applications, or comprehensive data solutions, I am committed to delivering high-quality results that create measurable value.

  • Data Analysis
  • Data Mining
  • Machine Learning
  • R
  • Machine Learning Model
  • Artificial Intelligence
  • Analytical Presentation
  • Mining
  • RStudio
  • Image Segmentation
  • Statistical Analysis
Manish B.

Pune, India

$10/hr
4.9
97 jobs

I am a Lead Data Scientist and Data Engineer with over 10 years of experience turning messy, complex datasets into scalable, high-accuracy predictive systems and automated data pipelines. I specialize in the Retail/FMCG and Healthcare/Pharmaceutical sectors, bridging the gap between advanced technical architecture and actionable business strategy. Backed by an Executive PG in Data Science & AI from IIT Roorkee and holding Microsoft Certifications in Python and Azure (AZ-900 & DP-100), I build the "intelligence layers" that drive enterprise decision-making. My Core Technical Expertise: Data Engineering & ETL: GCP (BigQuery), SQL, dbt, Parquet, Batch Processing, Multiprocessing Programming & AI/ML: Python (Pandas, Scikit-learn, Regex, Flask, Streamlit), R, SAS, Predictive Modeling, GenAI Business Intelligence & Viz: Power BI, Looker, Clinical Dashboards Proven Domain Impact: Retail & FMCG: Engineered automated POS data pipelines (36-month initiative), improving loading efficiency by 30%. Led a 24-month product harmonization project using predictive modeling to resolve discrepancies across millions of records. Designed GCP/BigQuery demand forecasting pipelines to optimize supply chain inventory. Healthcare & Clinical Analytics: Deep experience integrating open-source tools (Python, R, SAS) for clinical trial data visualization. Proficient in building interactive clinical dashboards utilizing Pharmaverse, SDTM, and ADaM datasets. Cloud Architecture & Metrics: Migrated massive datasets to modern cloud environments and developed robust DAU/MAU tracking structures inside BigQuery and Looker. Why Work With Me? I do not just execute code; I solve business problems. Alongside my technical career, I manage my own coworking space business, meaning I inherently understand the commercial urgency, stakeholder management, and extreme ownership required to deliver ROI. Whether you need to deploy an AI model, untangle enterprise ERP data, or build an interactive dashboard from scratch, I write clean, maintainable, and production-ready code. Let’s connect to discuss how we can scale your data infrastructure and build models that actually impact your bottom line.

  • Data Analysis
  • Data Science Consultation
  • Machine Learning
  • Python
  • R
  • Statistics
  • SAS
  • Microsoft Excel
  • RStudio
  • Hypothesis Testing
  • Report
  • Jupyter Notebook
  • Azure Machine Learning
  • Predictive Analytics
Gandhali Narhari J.

Pune, India

$12/hr
5.0
32 jobs

I am a Data Scientist and AI Automation Engineer with 4+ years of freelancing experience building end-to-end data and AI solutions. I specialize in web data extraction, ETL pipelines, AI-powered automation, and interactive analytics dashboards that help businesses transform raw data into actionable insights. I have developed scalable scraping systems using Python (BeautifulSoup, Selenium, Scrapy), built ETL pipelines integrating MySQL and AWS data infrastructure, and created interactive dashboards using Dash and Tableau for real-time analytics. My recent work also includes building LLM-powered applications such as Retrieval-Augmented Generation (RAG) systems, AI chatbots, and document analysis pipelines using LangChain and modern LLM APIs. Previously, I worked for 9.5 years as an Automation Engineer with global companies such as Baker Hughes and Honeywell, where I developed strong skills in engineering systems, problem-solving, and delivering reliable solutions for industrial clients.

  • Data Analysis
  • Data Mining
  • Machine Learning
  • Natural Language Processing
  • Python
  • Deep Learning
  • Tableau
  • Data Scraping
  • Microsoft Excel
  • Data Visualization
  • Large Language Model
  • Retrieval Augmented Generation
  • LLM Prompt Engineering
  • SQL
  • Amazon Web Services
Tejas L.

Pune, India

$15/hr
5.0
34 jobs

As a freelance professional with a Master's in Science in Statistics, I bring a unique blend of academic knowledge and practical expertise to every project I undertake. With a strong foundation in statistical theory, advanced data analysis techniques and the ability to effectively communicate complex statistical concepts, I am equipped to provide valuable insights and solutions to clients across various industries. My academic journey in pursuing a Master's of Science in Statistics has provided me with a solid understanding of statistical principles and their applications. Through coursework in probability theory, statistical inference, regression analysis, experimental design, and multivariate analysis, I have gained a comprehensive skill set that enables me to tackle diverse data analysis challenges. One of the key aspects of my academic training has been the emphasis on statistical software and programming languages. I have developed proficiency in utilizing tools such as R and Excel to manipulate, analyze, and visualize large datasets. This technical expertise allows me to efficiently process and extract meaningful information from complex data, facilitating evidence-based decision-making for my clients.

  • Data Analytics & Visualization Software
  • Data Analysis
  • Data Mining
  • Machine Learning
  • R
  • Statistics
  • Microsoft Excel
  • RStudio
  • Hypothesis Testing
  • Regression Analysis
  • Probability Theory
  • Multivariate Statistics
  • Data Modeling
  • Biostatistics
  • R Shiny

How it works

Post a job for free Post a job

Tell us what you need. Create your own job post or generate one with AI then filter talent matches.

Hire top talent fast

Consult, interview, and hire quickly, so you can meet the freelancers you're excited about.

Collaborate easily

Use Upwork to chat or video call, share files, and track project progress right from the app.

Payment simplified

Manage payments in one place with flexible billing options. Only pay for approved work, hourly or by milestone.

Don't just take our word for it

Resources to help you hire

Cost to hire a Data Scientist

Cost to hire a Data Scientist

Explore typical Data Scientist rates and what businesses pay to hire top talent.

Data Scientist job description template

Data Scientist job description template

Get tips to write a job post that attracts qualified Data Scientists.

Data Scientist interview questions

Data Scientist interview questions

Top interview questions to help you hire the right Data Scientists, faster.

How do I hire a Data Scientist near Pune, on Upwork?

You can hire a Data Scientist near Pune, on Upwork in four simple steps:

  • Create a job post tailored to your Data Scientist project scope. We’ll walk you through the process step by step.
  • Browse top Data Scientist talent on Upwork and invite them to your project.
  • Once the proposals start flowing in, create a shortlist of top Data Scientist profiles and interview.
  • Hire the right Data Scientist for your project from Upwork, the world’s largest work marketplace.

At Upwork, we believe talent staffing should be easy.

How much does it cost to hire a Data Scientist?

Rates charged by Data Scientists on Upwork can vary with a number of factors including experience, location, and market conditions. See hourly rates for in-demand skills on Upwork.

Why hire a Data Scientist near Pune, on Upwork?

As the world’s work marketplace, we connect highly-skilled freelance Data Scientists and businesses and help them build trusted, long-term relationships so they can achieve more together. Let us help you build the dream Data Scientist team you need to succeed.

Can I hire a Data Scientist near Pune, within 24 hours on Upwork?

Depending on availability and the quality of your job post, it’s entirely possible to sign up for Upwork and receive Data Scientist proposals within 24 hours of posting a job description.