Hire the Best Data Engineers
in the United States

Clients rate our Data Engineers
Rating is 4.9 out of 5.
4.9/5
Based on 145 client reviews
Stanley B.

Yorba Linda, California

$95/hr
4.9
34 jobs

Cloud Solution Architect with engineering experience in Cloud SQL Big Data technologies including Data Architecture to support various Business Intelligence needs. Solution Architect in Technical Teams on Cloud Data Solutions into various Cubes, Data Marts, and ERP Systems. Developed data structures for business using various Analysis Services Cubes, BI Dashboard, and Scorecards. Recent experiences include developing multiplayer AI Platform with Generative AI using AWS Bedrock for back-office CareManagement application including HealthCare. Implementations included Anthropic Claude model & Twilio Integration for Client Onboarding, Authentication, and Call Flow Management. Cloud Services Database include Snowflake Data Cloud, Azure Cloud, Go/Language (JSON and yaml) with Anaconda python programming. Migrated Data Lakes from on-prem up to SnowFlake using SnowPipe and SnowSQL using command line scripts using AI methods. Implemented standards and data designs for HIPAA, SOX, regulatory, compliance, financial, reporting, and auditing. Next generation AI development with Anthropic Claude Code, ChatGPT, Gemini, etc.

  • Data Warehousing & ETL Software
  • Big Data
  • SQL
  • IBM Cloud
  • Microsoft SQL SSAS
  • Netezza
  • Snowflake
  • Databricks Platform
  • Azure Blockchain Service
  • Amazon Redshift
  • AWS CloudFormation
  • Google Cloud Platform
  • Apache Kafka
Shaista R.

Lake Grove, New York

$60/hr
5.0
6 jobs

Most data pipelines don’t fail because of code. They fail because they weren't built for scale. With 8+ years of experience engineering data systems at companies like Microsoft and Coreweave, I help businesses move away from "brittle prototypes" to production-grade, scalable infrastructure. I don’t just move data; I build the "Source of Truth" that leadership and AI systems actually trust. 💬What I Solve for You: Productionizing AI Pipelines: Hardening Python prototypes into scalable RAG and LLM infrastructures (AWS/Azure). ➔Infrastructure-as-Code: Building automated, modular ETL/ELT pipelines that don't require daily manual fixes. ➔The "One-Source" Dashboard: Integrating messy data from APIs, SaaS (Shopify, HubSpot), and DBs into clean Snowflake/BigQuery layers. ➔Performance Recovery: Optimizing slow SQL queries and high-cost cloud warehouses to save you thousands in monthly spend. 🛠 Tech Stack: Languages: Python (FastAPI, Pandas, PySpark), SQL Cloud & Warehousing: AWS (Glue, Lambda, S3), Snowflake, BigQuery, Azure Orchestration: Airflow, dbt, GitHub Actions Data Ops: API Integrations, Vector DBs, Data Validation ✅ Why Me? 8+ Years Experience: I’ve seen what breaks at the enterprise level and how to prevent it in your startup. Speed over Perfection: I focus on shipping high-impact systems that drive revenue, not just technical documentation. Transparent Communication: You get regular updates and a partner who challenges requirements to find better solutions. Ready to clean up your data debt? 📩 Message me for a FREE 15-minute technical consultation. Let’s discuss your architecture and see if I’m the right fit for your system.

  • Data Engineering
  • Apache Spark
  • BigQuery
  • Data Integration
  • Data Warehousing
  • ETL Pipeline
  • Python
  • SQL
  • Apache Airflow
  • Snowflake
  • Amazon Web Services
  • Data Modeling
  • Apache Kafka
  • PostgreSQL
  • Tableau
  • Docker
Clayton G.

Miami, Florida

$60/hr
5.0
7 jobs

I build the unglamorous data layer most founders don't want to touch: web scrapers, data pipelines, data cleanup and enrichment, and small AI automations. And I ship fast, with a working sample instead of promises. A few things I've built: • A scraper pulling product pricing across 14 states into a live, filterable dashboard • A tool that turns messy Confluence spaces into clean, agent-ready docs (markdown, stable anchors, llms.txt) • A Hyperliquid wallet PnL analyzer, and a Meteora DLMM impermanent-loss backtester • codexplays.games, autonomous AI agents I run in public What I'm best at: getting data out of stubborn sites (Playwright, Selenium, requests), cleaning and structuring it, wiring it into Postgres, Sheets, or APIs, and automating the boring parts. Stack: Python, Playwright, Pandas, Supabase/Postgres, plus LLM and agent tooling when it earns its place. How I work: I'd rather show you a working sample than talk. Send me the site or the data problem and I'll usually hand back a small proof before you commit to anything. Fast turnarounds, clean handoffs, and I'm honest about what's actually doable.

  • Python
  • Web Scraping
  • Data Scraping
  • Data Extraction
  • Web Crawling
  • Scrapy
  • Selenium
  • Automation
  • Data Mining
  • API Integration
  • Data Cleaning
  • PostgreSQL
  • Artificial Intelligence
  • Generative AI
  • Web Development
Guibin Z.

Plano, Texas

$85/hr
5.0
2 jobs

🚀 Ex-Meta / Yahoo | AI Full-Stack Engineer | GenAI, LLM, SaaS, APIs I build end-to-end AI products — from LLM-powered features to scalable full-stack systems. With 10+ years at Meta, Yahoo, and startups, I’ve shipped production systems used by millions and designed platforms that scale reliably. I focus on delivering real, usable AI applications — not just demos. ✅ What I Can Help You Build: • GenAI Applications: GPT / Claude integrations, AI copilots, workflows, prompt engineering • RAG Systems: chat over documents, knowledge bases, retrieval pipelines • Full-Stack AI Apps: frontend + backend + APIs + AI integration • AI Agents: tool-using agents, automation workflows • Backend Systems: scalable APIs, microservices, cloud architecture • AI SaaS: MVP → production-ready systems 💼 Experience Highlights: • Meta — built large-scale production systems • Yahoo — personalization systems serving billions of requests/day • Indeed / Workday — scalable backend and cloud platforms • Tech Lead — owned architecture and built systems end-to-end • Real-time systems processing TBs/day (Kafka, distributed systems) 📊 What You Get: • End-to-end builder (AI + full-stack + product) • Production-ready systems, not fragile prototypes • Scalable architecture from day one • Fast execution and clear communication 🔧 Tech Stack: AI: GPT-4/5, Claude, RAG, AI agents, prompt engineering Frontend: React, Next.js, TypeScript Backend: Python (FastAPI, Flask), Node.js, Java, Scala, APIs, microservices Cloud: AWS (S3, Lambda), serverless architecture Infra: Docker, Kubernetes, CI/CD Data: Spark, Kafka Databases: PostgreSQL, MySQL, MongoDB, Redis 🎯 Typical Projects: • AI chatbot / copilot • RAG over internal/company data • AI SaaS MVP • Full-stack GenAI applications • Workflow automation with LLMs Keywords: Generative AI, GenAI, LLM, GPT-4, GPT-5, Claude, OpenAI API, Anthropic Claude, RAG, Retrieval Augmented Generation, AI Agents, Prompt Engineering, AI Automation, Chatbots, Copilots, Python, FastAPI, Flask, Full-Stack Development, Frontend Development, Backend Development, React, Next.js, TypeScript, Node.js, API Design, REST APIs, GraphQL, Microservices Architecture, SaaS Development, Multi-Tenant Systems, Authentication, OAuth, JWT, Cloud Architecture, AWS, Serverless, Docker, Kubernetes, CI/CD, Distributed Systems, System Design, PostgreSQL, MySQL, MongoDB, Redis, Machine Learning, MLOps, Data Engineering, Apache Spark, Kafka Industries: AI Platforms, SaaS, Enterprise Software, AdTech, MarTech, E-commerce, Marketplaces, FinTech, Payments, Healthcare, HealthTech, Developer Tools, Productivity Tools, Analytics Platforms, Data Platforms, Media, Social Platforms

  • Python
  • Artificial Intelligence
  • Machine Learning
  • Large Language Model
  • Retrieval Augmented Generation
  • Claude
  • API
  • React
  • TypeScript
  • Next.js
  • Node.js
  • JavaScript
  • Docker
  • Amazon Web Services
  • GraphQL
  • PostgreSQL
  • MLOps
  • REST API
  • MongoDB
  • Kubernetes
Holden G.

Woodleaf, North Carolina

$94/hr
5.0
1 jobs

I’m an AI Full-Stack Data Engineer specializing in designing, building, and deploying end-to-end data systems that power analytics, machine learning, and AI-driven applications. I help companies turn raw, messy data into scalable, production-ready pipelines and intelligent systems. My focus is on building reliable data infrastructure, integrating LLMs into real-world products, and enabling data-driven decision-making through clean architecture and automation. I work across the full stack of data engineering from ingestion and transformation to model deployment and API integration, ensuring performance, scalability, and maintainability at every layer. Core expertise includes: Building scalable ETL/ELT pipelines (batch & real-time) Designing data architectures (data lakes, warehouses, lakehouses) LLM integration (OpenAI, LangChain, RAG systems, vector databases) API development (FastAPI, Flask, Node.js) Cloud platforms (AWS, GCP, Azure) MLOps & deployment (Docker, Kubernetes, CI/CD) Data processing frameworks (Spark, Pandas, Airflow, dbt) Database systems (PostgreSQL, MySQL, MongoDB, Snowflake, BigQuery) I prioritize clean code, system reliability, and business impact. Whether it’s building a full data platform from scratch or optimizing existing pipelines, I deliver production-grade solutions that scale.

  • Apache Spark
  • BigQuery
  • Python
  • SQL
  • Apache Airflow
  • dbt
  • Apache Kafka
  • FastAPI
  • REST API
  • Amazon Web Services
  • Google Cloud Platform
  • Azure DevOps
  • Docker
  • Kubernetes
  • Terraform
  • Snowflake
  • PostgreSQL
  • pandas
  • Machine Learning
  • LLM Prompt Engineering
Faz M.

Chicago, Illinois

$40/hr
5.0
9 jobs

𝐈 𝐰𝐢𝐥𝐥 𝐟𝐢𝐧𝐝 𝐭𝐡𝐞 𝐡𝐨𝐥𝐞 𝐢𝐧 𝐲𝐨𝐮𝐫 𝐭𝐫𝐚𝐜𝐤𝐢𝐧𝐠 𝐛𝐞𝐟𝐨𝐫𝐞 𝐲𝐨𝐮 𝐟𝐢𝐧𝐢𝐬𝐡 𝐫𝐞𝐚𝐝𝐢𝐧𝐠 𝐭𝐡𝐢𝐬. 🚀 Google Tag Manager (GTM) | GA4 | Google Ads | Meta Pixel & CAPI Tracking Expert | Looker Studio & Marketing Automation Specialist I help businesses fix broken tracking, improve attribution, and scale profitable PPC campaigns using accurate, end-to-end data systems. Most ad accounts struggle not because of bad campaigns but because of poor tracking, missing conversion data, and disconnected platforms. I solve this by building clean, scalable analytics and automation systems that connect your ads, website, and CRM into one reliable source of truth. 💡 What I Specialize In: ✔️ Google Tag Manager (Web & Server-Side) setup, debugging & scaling ✔️ GA4 advanced tracking (events, funnels, conversions, ecommerce) ✔️ Google Ads & Facebook Ads Manager conversion tracking ✔️ Meta Pixel + Conversion API (CAPI) with deduplication setup ✔️ PPC Campaign Setup & Management tracking systems ✔️ Looker Studio dashboards for unified marketing reporting ✔️ Marketing Analytics & performance tracking systems ✔️ Server-side tracking & first-party data implementation ✔️ GoHighLevel (GHL) CRM setup, funnels, pipelines & automation ✔️ Marketing automation using Zapier & Make (Integromat) ✔️ Shopify tracking setup (events, purchases, attribution) ✔️ WordPress tracking integration & custom event setup ✔️ JavaScript-based tracking customization & dataLayer implementation 📈 Business Outcomes You Get: ✔️ Accurate conversion tracking across all platforms ✔️ Clear attribution (no more “unknown traffic” issues) ✔️ Better ROAS through data-driven optimization ✔️ Reduced wasted ad spend from incorrect tracking ✔️ Full visibility from click → lead → sale → revenue ✔️ Automated reporting dashboards for faster decisions 🧠 Tools & Tech Stack: Google Tag Manager, GA4, Google Ads, Meta Ads, Looker Studio, BigQuery, GoHighLevel (GHL), Shopify, WordPress, JavaScript, Zapier, Make, Facebook Ads Manager, TikTok Pixel, LinkedIn Insight Tag. 🚀 What You Can Expect Working With Me: ✔ Clean, scalable tracking architecture ✔ Clear communication without technical confusion ✔ Systems built for long-term marketing growth ✔ Focus on revenue impact not just data setup If you need help fixing tracking issues, improving ad performance, building automation systems, or creating a full marketing analytics infrastructure, feel free to reach out. Let’s turn your data into a system you can actually scale with confidence. server-side tracking | server-side GTM | sGTM setup | Google Tag Manager | GA4 tracking | GA4 event tracking | GA4 ecommerce tracking | Meta Pixel setup | Meta Conversion API | Facebook CAPI | conversion tracking | tracking implementation | attribution tracking | marketing attribution | ROAS tracking | performance marketing tracking | ecommerce tracking setup | Shopify tracking | WooCommerce tracking | purchase event tracking | add to cart tracking | checkout tracking | funnel tracking | dataLayer implementation | JavaScript tracking | custom event tracking | GTM debugging | Google Ads conversion tracking | enhanced conversions Google Ads | offline conversion tracking | Looker Studio dashboards | marketing dashboards | data visualization reporting | BigQuery analytics | CRM tracking integration | GoHighLevel tracking | GHL automation setup | Zapier integration | Make automation | marketing automation systems | Hyros setup | Triple Whale tracking | cross domain tracking | first party data tracking | consent mode v2 | iOS tracking fix | conversion attribution | tracking audit | analytics implementation | event deduplication | tracking infrastructure | revenue tracking system | funnel analytics

  • Google Analytics 4
  • Data Analysis
  • Data Visualization
  • Looker Studio
  • Meta Pixel
  • Tracking Pixel
  • Web Analytics
  • Analytics
  • Analytics & Tracking Setup
  • WordPress
  • Google Tag Manager
  • Google Analytics
  • Google Ads
  • Marketing Analytics
  • Google Ads Audit
  • JavaScript
  • Marketing Automation
  • Shopify
  • PPC Campaign Setup & Management
  • Facebook Pixel Setup & Optimization

How it works

Post a job for free Post a job

Tell us what you need. Create your own job post or generate one with AI then filter talent matches.

Hire top talent fast

Consult, interview, and hire quickly, so you can meet the freelancers you're excited about.

Collaborate easily

Use Upwork to chat or video call, share files, and track project progress right from the app.

Payment simplified

Manage payments in one place with flexible billing options. Only pay for approved work, hourly or by milestone.

Don't just take our word for it

How do I hire a Data Engineer in the United States on Upwork?

You can hire a Data Engineer in the United States on Upwork in four simple steps:

  • Create a job post tailored to your Data Engineer project scope. We'll walk you through the process step by step.
  • Browse top Data Engineer talent on Upwork and invite them to your project.
  • Once the proposals start flowing in, create a shortlist of top Data Engineer profiles and interview.
  • Hire the right Data Engineer for your project from Upwork, the world's largest work marketplace.

At Upwork, we believe talent staffing should be easy.

How much does it cost to hire a Data Engineer?

Rates charged by Data Engineers on Upwork can vary with a number of factors including experience, location, and market conditions. See hourly rates for in-demand skills on Upwork.

Why hire a Data Engineer in the United States on Upwork?

As the world's work marketplace, we connect highly-skilled freelance Data Engineers and businesses and help them build trusted, long-term relationships so they can achieve more together. Let us help you build the dream Data Engineer team you need to succeed.

Can I hire a Data Engineer in the United States within 24 hours on Upwork?

Depending on availability and the quality of your job post, it's entirely possible to sign up for Upwork and receive Data Engineer proposals within 24 hours of posting a job description.