Hire the Best Pyspark Developers in New York
New York, New York
Are you in search of a highly skilled professional to elevate your business operations and enhance customer interactions through intelligent chatbot solutions? Look no further! I am a dedicated LLM Engineer and AI enthusiast specializing in Python API integration, GPT (OpenAI Assistant, ChatGPT, GPT 4, Whisper), and various OpenAI solutions and LLMs in general! With a robust background in Artificial Intelligence and NLP.I have cultivated expertise in constructing chatbots using Langchain, OpenAI API, OpenAI Assistants, and open-source models on HuggingFace. My services cover a broad spectrum of chatbot development needs: 🌟 Are you encountering challenges in developing chatbots using Langchain, OpenAI API, OpenAI Assistants, or open-source models on HuggingFace? 🌟 Do you require expert assistance in seamlessly integrating chatbot functionalities into your website or application? 🌟 Are you aiming to create a personalized chatbot that effortlessly comprehends and responds to user queries? 🌟 Do you wish to optimize the performance and accuracy of your existing chatbot solution? 🌟 Have you come across any errors or issues in your chatbot's functionality that demand immediate troubleshooting? 🌟 Are you interested in creating an agent with various features (web, database search, vector store retrieval, ...)? Rest assured, I am here to provide comprehensive solutions and steadfast support for all your chatbot development needs. Why choose me? As a Creative Python API Integration and Python solution expert, I bring a wealth of experience in working with OpenAI LLMs (GPT 4, OpenAI Assistants, ChatGPT), Langchain, Pinecone, Weaviate, FAISS, and other LLM / VectorStores. Here are the technologies I specialize in: ✔️ LLM Agents (Access to Web, APIs, Databases, ...) ✔️ GPT Apps development using Python, Langchain, LLM (OpenAI Assistants / ChatGPT / GPT 4 / HuggingFace models), VectorStores (Pinecone, Weaviate, FAISS, ...) ✔️ Automation and integration using ChatGPT API. ✔️ Python solutions using ChatGPT API, Whisper API, GPT 4 model, and other OpenAI API services. ✔️ Integrating OpenAI's ChatGPT and GPT 4 API to handle user prompts. ✔️ Prompt Engineer with extensive experience in designing various problems for a variety of AI use cases. My commitment lies in crafting seamless and intuitive conversational experiences that consistently exceed the expectations of my esteemed clients. Together, we can create an intelligent chatbot solution tailored to your unique requirements, driving unprecedented success. If you are ready to unlock the transformative potential of chatbots and revolutionize your business, please feel free to reach out to me. Let's collaborate and turn your vision into a reality. Best regards, Chintan Soni
- PySpark
- GPT-4
- ChatGPT
- Python
- MySQL
- Django
- Artificial Intelligence
- Large Language Model
- AI Chatbot
- Vector Database
- Whisper AI
- Generative AI
- CI/CD
- Databricks Platform
- Data Analysis
Brooklyn, New York
Is your business struggling with scattered data, slow dashboards, expensive Azure bills, or legacy systems that can’t scale? I help companies design modern, secure, and cost-efficient end-to-end Azure data solutions - from ingestion and transformation to analytics and reporting, with a strong focus on performance, scalability, and long-term sustainability. 💎What's my benefit? - 8+ years of hands-on experience designing and implementing scalable Azure data architectures, cloud-native ETL pipelines, and enterprise-grade data platforms using Azure Synapse, Fabric, Data Factory, Databricks, and Azure SQL. - Agile project development lifecycle - I won't leave you halfway and disappear. 🔥 Here are the technologies I specialize in: - Cloud & Data Platforms: Microsoft Azure, Azure Synapse Analytics, Azure Data Factory (ADF), Azure Databricks, Azure Data Lake (ADLS Gen2), Azure IoT Hub, Logic Apps, Azure Stream Analytics, Microsoft Fabric - Data Engineering & Processing: Python, PySpark, SQL, ETL/ELT Pipelines, Medallion Architecture (Bronze/Silver/Gold), Azure Functions - Data Warehousing: Azure SQL Database, Synapse Dedicated/Serverless Pools, SQL Server, Azure Data Explorer - Databases: PostgreSQL, MySQL, Microsoft SQL Server, ADX - Analytics & BI: Power BI, DAX, Data Modeling (Star & Snowflake Schema) - DevOps & Infrastructure: Azure DevOps, Git, CI/CD Pipelines, Docker, ARM/Bicep Templates If you're looking for a reliable expert who can take full ownership and deliver results, feel free to reach out
- PySpark
- Docker
- Databricks Platform
- Azure IoT HuB
- Microsoft Azure SQL Database
- Microsoft Power BI
- Data Lake
- Data Analytics & Visualization Software
- Azure DevOps
- Fabric
- Python
- Real Time Stream Processing
- Terraform
- ETL
Ithaca, New York
I am an AI Agent Developer & LLM Engineer specializing in AI agents, chatbots, Claude AI, RAG systems, and automation workflows. I build AI agents, AI chatbots, and Claude-powered systems that automate business operations, customer support, and internal workflows. I help startups and SaaS businesses design, deploy, and scale AI automation systems using AI agents, LLMs, and modern cloud infrastructure. My expertise includes AI agents, chatbot development, Claude AI, RAG pipelines, prompt engineering, semantic search, and LLMOps. 💬 Message me to build AI agents, chatbots, or Claude-powered automation systems. 🚀 What I Build → AI chatbots for customer support, websites, SaaS, and APIs → AI agents for workflow automation, task execution, and operations → Claude-powered assistants for structured reasoning and workflows → RAG systems for knowledge bases, documents, and enterprise data → AI automation systems for business processes and operations → Prompt engineering and LLM optimization for accuracy and cost → Multi-agent systems with tool-calling and orchestration → Semantic search and vector database AI systems → AI copilots and conversational assistants → AI dashboards and automation interfaces 📈 Results & Impact (SaaS / Startups) → “We reduced our support tickets by over 65% after deploying the AI chatbot trained on our docs and FAQs.” → “The Claude-powered onboarding agent improved our user activation rate by 30% within weeks.” → “Customer support is now instant, the AI chatbot handles responses 24/7 without delays.” → “The RAG support assistant resolves about 80% of user issues without needing human support.” → “Our trial-to-paid conversion increased by 25% thanks to the AI lead qualification chatbot.” → “The AI agent now handles refunds, account issues, and support queries, saving us 15+ hours per week.” → “We no longer rely heavily on support, the semantic search system lets users find answers instantly.” → “We cut our AI API costs by 40% after optimizing LLM workflows and prompt engineering.” ⚙️ Tech Stack 🧠 LLMs & GenAI: GPT-4o | Claude | Gemini | Llama 3 | Open-Source LLMs 🧾 AI & NLP: AI Agents | Chatbots | Prompt Engineering | NLP | Embeddings | Evaluation 🔗 LLM Frameworks: LangChain | LangGraph | LlamaIndex | DSPy | GraphRAG 📚 RAG Systems: Retrieval-Augmented Generation | Hybrid Search | Reranking | Evaluation 🧭 Vector Databases: Pinecone | Weaviate | FAISS | pgVector | Milvus ⚙️ Backend & APIs: Python | FastAPI | Node.js | REST APIs | Webhooks ☁️ Cloud AI: AWS | GCP | Azure | Serverless | Data Infrastructure 🔁 LLMOps / MLOps: MLflow | Docker | Kubernetes | CI/CD | Monitoring 📈 Advanced: AI Agents | Automation Systems | Fine-Tuning | Multi-Agent Systems 🧠 Why Clients Hire Me ✔ AI Agents & Chatbot specialist focused on automation ✔ Build production-ready AI systems, not prototypes ✔ Strong expertise in Claude AI, RAG, and LLM workflows ✔ Focus on AI automation, efficiency, and business impact ✔ Scalable AI architecture for SaaS and business systems ✔ Fast execution and clear communication If you need AI agents, chatbots, or Claude-powered automation systems, I design and deploy scalable solutions. 💬 Message me to build your AI automation system. Keywords AI automation, AI agents, AI chatbot, Chatbot development, Generative AI, Large Language Models, LLM, OpenAI API, Claude AI, GPT-4, AI developer, Machine learning, NLP, Natural language processing, RAG, Retrieval augmented generation, LangChain, LangGraph, AI engineer, AI SaaS, Automation workflows, Business automation, Conversational AI, AI assistant, AI integration, API integration, Python, FastAPI, Node.js, TypeScript, Prompt engineering, AI solutions, AI development, Chatbot integration, Customer support chatbot, AI support agent, AI automation tools, Workflow automation, AI consulting, AI architecture, AI system design, AI deployment, AI app development, SaaS automation, AI product development, AI startup, Vector database, Pinecone, Weaviate, FAISS, Semantic search, Embeddings, Knowledge base chatbot, Document chatbot, AI copilots, Multi agent systems, Autonomous agents, Tool calling agents, AI orchestration, LLM fine tuning, AI optimization, AI cost optimization, AI pipelines, Data pipelines, MLOps, LLMOps, Docker, Kubernetes, AWS, Google Cloud, Azure, Serverless AI, Backend development, REST API, Webhooks, Data engineering, Predictive analytics, Text classification, Named entity recognition, AI dashboards, Real time systems, AI monitoring, AI evaluation, Hallucination reduction, AI performance tuning, AI scaling, Enterprise AI, AI integration services, AI chatbot for website, AI chatbot for SaaS, Lead generation chatbot, Sales chatbot, AI workflow automation
- LLM Prompt
- Generative AI
- Retrieval Augmented Generation
- Chatbot
- Amazon Web Services
- LangChain
- GPT-4
- Vector Database
- Python
- Docker
- Large Language Model
- ChatGPT
- AI Agent Development
- Natural Language Processing
- Claude 3.5 Sonnet
Manhattan, New York
Data Scientist | ML Engineer | NLP & Predictive Modeling | Python | SQL I help teams make sense of messy data and turn it into working machine learning systems. My work spans the full lifecycle: exploratory analysis, feature engineering, modeling, evaluation, and deployment. I combine strong data science fundamentals with deep experience in NLP and applied ML. What I Do: - Exploratory Data Analysis (EDA) to understand patterns, risks, and modeling paths - Build classification, ranking, and regression models using Scikit-learn & PyTorch - NLP projects: text cleaning, vectorization, topic modeling, transformer-based classifiers - End-to-end ML pipelines: data prep → modeling → validation → reporting - Data workflows at scale with BigQuery, PySpark, and cloud-based ML tooling - Business-facing analytics, dashboards, and insights that support decision-making Toolkit: - Python (Pandas, NumPy, Scikit-learn, PyTorch, Hugging Face) - SQL (BigQuery), PySpark - Visualization: Power BI, Plotly, Matplotlib - ML Ops & Infra: Vertex AI, Docker - Git/GitHub, Notion, n8n Selected Results: - Improved an enterprise classifier from **0.74 → 0.86 F1** across **90M+ documents** - Built LLM-assisted labeling workflows expanding datasets from **200 → 4,000+ samples** - Reduced NGO triage workload by **40%** using ML-based prioritization - Conducted deep EDA for multiple enterprise projects to inform modeling direction & risk What You Get Working With Me: - Clear, honest communication and fast iteration - Strong reasoning behind modeling decisions - Clean, maintainable code with documentation - A partner who understands both analysis and production-ready ML If you need someone who can explore your data, understand it, and turn it into a reliable ML solution — let’s talk.
- Data Science
- pandas
- TensorFlow
- Tableau
- Python
- NumPy
- SQL
- Machine Learning
- Keras
- LLM Prompt Engineering
- Hugging Face
- Data Analysis
- Generative AI
- Data Extraction
- Microsoft Excel
Brooklyn, New York
I have several years of experience, using multiple programming languages and frameworks, in both academia and industry, in statistical modeling, probabilistic programming, classical machine learning and application of deep learning in several domains. The last several years I have been primarily ad cloud and data architect and data engineer and AI engineer. I have extensive experience in data modeling, pipeline development, data compliance, and worked with several database technologies (relational, NoSQL, graph, Hadoop and Spark), as well as CI/CD, API development, and architecting and development with multi-cloud computing platforms (Microsoft Azure, AWS, and GCP). I have deep experience in Databricks, and am a certified data engineer professional. I have worked in various fields such as, statistical genetics, behavior genetics, computational social science, app development, FinTech and healthcare data. The last 3 years I have run a consultancy while also building two profitable startups as data architect, data engineer while also being responsible for AI engineering and machine learning. Current Certifications: Databricks: Data Engineer Professional Azure: Developer, Data Scientist, Data Engineer, AI Engineer AWS: Data Engineer GCP: Data Engineer I have an extensive background in Python, R, JavaScript, SQL (T-SQL, PostgreSQL, SparkSQL, Databricks SQL, CQL, HiveQL, Cypher, KQL), with most data science libraries such as Pandas, NumPy, Scikit-learn, SciPy, SymPy, Rapids, PySpark, TensorFlow and TensorFlow Probability, Keras, PyTorch. I have extensive experience with standard data engineering tools: Snowflake, Databricks, Spark, PostgeSQL , dbt , Neo4j, Hadoop, Airflow, Beam, Iceberg, Hive, Kafka , AirFlow, Flink, SuperBase, Hadoop, Dagster, Prefect, dot. As well as deep experience with several AI frameworks and automation tools: DSPy, LlamaIndex, BAML. LangChain, Hugging Face, AutoGen, LangGraph, a2a, MCP the AI frameworks of all 3 major cloud providers, n8n, Make, PipeDream, Flowise, Zapier, 5 Tran, and AirByte. I have extensive experience with multiple cloud providers, especially with the following skills and cloud products: General Cloud Development: Databricks in any cloud Azure: Storage, Networking, Functions, VMs, App Service, Synapse, Data Factory, DevOps, Kubernetes Service, API Management, Service Bus, Event Hubs, Event Grid, PubSub, Notification HubsRelay, IoT Hub, Power Apps, Power Automate, Azure Machine Learning, Synapse Analytics, Data Lake Storage, SQL Server, Cosmos DB, Data Flow, Stream Analytics, HDInsight, Redis, Cognitive Services, Text Analytics Service, LUIS, Custom Vision, Bot Service, Cognitive Search, and the general Power Platform AWS: S3, EBS, EC2, Lambda, Networking, RDS, Redshift, Arora, Athena, DynamoDB, DocumentDB, Lake Formation, Glue, Redis, SageMaker, Bedrock, Kinesis, QuickSight Elasstic Beanstalk, Elastic Kubernetes Service, Databricks, Pipeline, EMR, CloudWatch GCP: Storage, Networking, Cloud Functions, VMs, App Engine, Kubernetes Engine, API Gateway, BigQuery, BigTable, DataProc, Composer, DataFlow, Data Fusion, Data Studio, Cloud SQL, CloudSpanner, Cloud Memorystore, Pub/Sub
- Machine Learning
- R
- Data Science
- Python
- Deep Learning
- Data Analysis
- Cloud Architecture
- Data Engineering
- API Development
- Node.js
- Google Cloud Platform
- Trading Automation
- Amazon Web Services
- Financial Trading
- Microsoft Azure
Queens County, New York
I am a versatile and driven tech professional with deep expertise in data engineering and a strong background in web and app development. With extensive experience across industries, I excel in transforming raw data into actionable insights and building innovative solutions that drive business success. Whether it’s leveraging PySpark for complex data transformations or designing responsive web applications, I am committed to delivering high-quality results and solving complex challenges. Key Skills: Data Engineering: ✅ PySpark & Hadoop ✅ Hive & SQL ✅ AWS & Cloud Technologies ✅ ETL Process Development ✅ Data Pipeline Optimization Web Development: ✅ Responsive Web Applications ✅ User-Friendly Interface Design ✅ Full-Stack Development (HTML, CSS, JS, React, Express, Node.js) App Development: ✅ Flutter & Firebase ✅ Cross-Platform Solutions ✅ API Integration Let’s connect to explore how I can help elevate your next data-driven or development project!
- Data Engineering
- Data Analysis
- Analytics
- Machine Learning
- App Development
- Android App Development
- iOS Development
- Mobile App Development
- Web Design
- Web & Mobile Design Consultation
- iOS
- Flutter
- NodeJS Framework
- ExpressJS
- React
How it works
Post a job for free Post a job
Tell us what you need. Create your own job post or generate one with AI then filter talent matches.
Hire top talent fast
Consult, interview, and hire quickly, so you can meet the freelancers you're excited about.
Collaborate easily
Use Upwork to chat or video call, share files, and track project progress right from the app.
Payment simplified
Manage payments in one place with flexible billing options. Only pay for approved work, hourly or by milestone.
Don't just take our word for it
“Upwork provides an umbrella-level of security. I can see a talent’s work history and ratings. I can hold payments in escrow. I can communicate through Upwork Messages instead of working through my email address.”
Kim Darling
Emerald Tiger
“Upwork is the best platform to hire skilled professionals when we're not looking for a full-time employee. All the companies in our portfolio use Upwork to find talent across a wide range of fields.”
David Merry
Kinetic Investments
“Our very specific requirements can be a challenge—With Upwork, we’re able to access a bigger community to ensure the success of our projects.”
Katja Krohn
Summa Linguae
How do I hire a Pyspark Developer in New York on Upwork?
You can hire a Pyspark Developer in New York on Upwork in four simple steps:
- Create a job post tailored to your Pyspark Developer project scope. We'll walk you through the process step by step.
- Browse top Pyspark Developer talent on Upwork and invite them to your project.
- Once the proposals start flowing in, create a shortlist of top Pyspark Developer profiles and interview.
- Hire the right Pyspark Developer for your project from Upwork, the world's largest work marketplace.
At Upwork, we believe talent staffing should be easy.
How much does it cost to hire a Pyspark Developer?
Rates charged by Pyspark Developers on Upwork can vary with a number of factors including experience, location, and market conditions. See hourly rates for in-demand skills on Upwork.
Why hire a Pyspark Developer in New York on Upwork?
As the world's work marketplace, we connect highly-skilled freelance Pyspark Developers and businesses and help them build trusted, long-term relationships so they can achieve more together. Let us help you build the dream Pyspark Developer team you need to succeed.
Can I hire a Pyspark Developer in New York within 24 hours on Upwork?
Depending on availability and the quality of your job post, it's entirely possible to sign up for Upwork and receive Pyspark Developer proposals within 24 hours of posting a job description.
Find more freelancers
Nearby cities for Pyspark Developers
- Database Freelancers in New York, NY
- Data Analysts in New York, NY
- Redis Developers in New York, NY
- Brand Licensing Specialists in New York, NY
- Nutritionists in New York, NY
- Film Directors in New York, NY
- SQL Consultants in Brooklyn, NY
- Jewelry Designers in New York, NY
- Grammar Specialists in New York, NY
- Actors in New York, NY
- Interviewers in New York, NY
- Fundraising Consultants in New York, NY
- CAD Designers in New York, NY
- Architects in New York, NY
- Consultants in New York, NY
- PR Consultants in New York, NY
Explore Related Skills in New York
- Pandas Developers in New York
- Data Engineers in New York
- Data Cleaning Professionals in New York
- Order Entry Specialists in New York
- Data Visualization Specialists in New York
- Scrapy Developers in New York
- Network Engineers in New York
- Data Entry Specialists in New York
- Machine Learning Engineers in New York
- Healthcare Information Technology Specialists in New York
- Scikit-Learn Specialists in New York
- Typists in New York