Hire the Best Pyspark Developers
in China
Beijing, China
๐ ๐๐จ๐ฉ-๐๐๐ญ๐๐ ๐ ๐ซ๐๐๐ฅ๐๐ง๐๐๐ซ ๐จ๐ง ๐๐ฉ๐ฐ๐จ๐ซ๐ค ๐ ๐ฆ๐๐ฟ๐๐ด๐ด๐น๐ถ๐ป๐ด ๐๐ถ๐๐ต ๐๐น๐ผ๐ ๐ฑ๐ฎ๐๐ฎ ๐ฝ๐ฟ๐ผ๐ฐ๐ฒ๐๐๐ถ๐ป๐ด ๐ผ๐ฟ ๐๐ป๐ฟ๐ฒ๐น๐ถ๐ฎ๐ฏ๐น๐ฒ ๐๐ง๐ ๐ฝ๐ถ๐ฝ๐ฒ๐น๐ถ๐ป๐ฒ๐? ๐ ๐ฏ๐๐ถ๐น๐ฑ ๐ฎ๐ป๐ฑ ๐ผ๐ฝ๐๐ถ๐บ๐ถ๐๐ฒ ๐ต๐ถ๐ด๐ต-๐ฝ๐ฒ๐ฟ๐ณ๐ผ๐ฟ๐บ๐ฎ๐ป๐ฐ๐ฒ ๐ฑ๐ฎ๐๐ฎ ๐๐๐๐๐ฒ๐บ๐ ๐๐ต๐ฎ๐ ๐๐๐ฟ๐ป ๐บ๐ฎ๐๐๐ถ๐๐ฒ ๐ฑ๐ฎ๐๐ฎ๐๐ฒ๐๐ ๐ถ๐ป๐๐ผ ๐ฎ๐ฐ๐๐ถ๐ผ๐ป๐ฎ๐ฏ๐น๐ฒ ๐ถ๐ป๐๐ถ๐ด๐ต๐๐. With 4+ years of professional experience, I have worked as a Data Engineer at world-class tech giants Xiaomi (Fortune Global 500) and Shopee ($80B Market Cap). My expertise lies in transforming complex data challenges into seamless, efficient, and scalable data workflows. โญ How I Can Elevate Your Business: โ End-to-End ETL/ELT Pipeline Development: I architect and build fully automated data pipelines using Airflow and Apache Spark (Scala/Python/Java), ensuring timely and accurate data for your analytics and machine learning models. โ Spark Performance Tuning & Optimization: Is your Spark job running slow or failing? I specialize in deep-diving into Spark applications to diagnose bottlenecks, optimize resource utilization (memory/CPU), and significantly cut down processing time and cost. โ Big Data Architecture & Solutions: Leveraging modern data stack tools like Kafka, Flink, Hadoop (HDFS, Hive), and Druid, I design and implement scalable Data Lakes and Data Warehouses tailored to your specific business needs. โ Data Quality & Integrity: I implement rigorous data cleaning and validation processes, transforming raw, messy data into a pristine, reliable asset for your decision-making. โญ Core Technical Skills: โ Big Data Ecosystem: Apache Spark, Airflow, Kafka, Flink, Hadoop, Hive, HDFS, Druid โ Programming: Scala, Python, Java, SQL โ Databases: HBase, Redis (NoSQL), Relational SQL Databases โ Platforms: Data Lake, Data Warehouse, AWS, GCP โญ What Sets Me Apart: โ Problem-Solver, Not Just a Coder: I focus on understanding your business goals first, then architect a solution that delivers real value. My work at Xiaomi was praised for not just meeting, but exceeding expectations by foreseeing future needs. โ Proactive & Clear Communication: You will always be in the loop. I believe in transparent, frequent updates to ensure the project aligns perfectly with your vision. โ Partnership & Kindness: I see my clients as partners. I am committed to working collaboratively and kindly to make your project a success and your life easier. Ready to build a data infrastructure that drives your business forward? ๐๐จ๐ฆ๐ง ๐๐๐๐๐ ๐ข๐ก ๐๐ก๐ฉ๐๐ง๐ ๐๐จ๐ง๐ง๐ข๐ก and let's have a quick chat about your project goals.
- Apache Spark
- Python
- Scala
- Big Data
- Apache Flink
- Hive
- Java
- Apache Hadoop
- Apache Kafka
- ETL Pipeline
- SQL
- Data Engineering
- Data Extraction
- Data Cleaning
Beijing, China
AWS Certified Data Engineer and Data Analyst with 5+ years of experience designing scalable data pipelines, building robust ETL workflows, and transforming raw data into actionable insights. Proven expertise in AWS cloud services, big data processing, and advanced analytics to drive data-driven decision-making. Dedicated to delivering high-quality data solutions that optimize performance, reduce costs, and unlock business value. Core Skills & Technologies ๐ท Data Engineering: AWS Stack: Redshift, Glue, EMR, Athena, Kinesis, Lambda, S3, CloudFormation ETL/ELT: Apache Spark, PySpark, AWS Glue, Airflow, Kafka Data Warehousing: Snowflake, Redshift optimization Big Data: Hadoop, Hive, Databricks, real-time streaming ๐ท Data Analysis & Visualization: SQL, Python (Pandas, NumPy), R BI Tools: Tableau, Power BI, AWS QuickSight Advanced Analytics: Predictive modeling, regression, clustering ๐ท Infrastructure: IaC (Terraform), Docker, CI/CD pipelines, AWS Security (IAM, KMS) Services Offered ๐ทEnd-to-End Data Pipeline Development โ Build automated, fault-tolerant data pipelines (batch/streaming) using AWS services โ Migrate on-premise data systems to AWS cloud (e.g., S3 โ Redshift) ๐ทData Warehousing & Modeling โ Design star/snowflake schemas, optimize Redshift clusters, manage data lakes ๐ทETL Optimization โ Refactor legacy ETL jobs to Spark/Glue for 50%+ faster processing ๐ทAnalytics & Reporting โ Create interactive Tableau/Power BI dashboards for KPIs and forecasting ๐ทAd-Hoc Analysis โ Clean, analyze, and visualize datasets to uncover growth opportunities Why Hire Me? โ AWS Certified Data Engineer- Associate โ Performance-Driven: 40% lower query costs | 60% pipeline efficiency gains โ Full-Cycle Delivery: Architecture โ Deployment โ Monitoring โ Documentation โ Client Focus: Responsive communication | Agile workflow | Transparent timelines AWS Certified Data Engineer, Data Analyst, Big Data Engineer, ETL Developer, Data Pipeline, AWS Glue, Redshift, PySpark, SQL Data Analyst, Tableau Specialist, Data Warehousing, Kinesis, Lambda, S3, Data Lake, Power BI, Data Modeling, Business Intelligence, Machine Learning, Data Visualization, Cloud Migration
- PySpark
- SQL
- Python
- Data Science
- ETL
- Microsoft Power BI
- Data Analysis
- Business Analysis
- Business Intelligence
- AWS Glue
- Data Engineering
- Data Lake
- Data Warehousing
- Amazon Redshift
- Data Ingestion
Shanghai, China
Work as tech lead & chief architect for a web company serving 4m+ users. Collect and store user behavior data & transaction data, perform daily and ad hoc ETL on the data, find useful pattern and provide BI visualization; design & develop server backend and providing apis to end users. Able to help a middle sized tech firm to setup OLAP(Online Analytical Processing) & OLTP(Online Transactional Processing) platform from zero. Has rich working experience in Spark, MongoDB, Fastapi, Clickhouse, Pandas, Numpy, Terraform, Agent development, AWS environment(Lambda, Athena, DynamoDB, Glue, EMR, Bedrock, Cognito, QuickSight, Sagemaker, etc). Rich coding experience in Python/Scala/Java. Rich experience in Machine Learning especially NLP. Can work as part-time job, 10~20 hours per week.
- Apache Spark
- Scala
- pandas
- AWS Lambda
- AWS Glue
- MongoDB
- Terraform
- Apache Airflow
- ClickHouse
- FastAPI
- Amazon DynamoDB
- Metabase
- AI Agent Development
- SQL
- Amazon Bedrock
Wuhan, China
Iโm a developer with experience in NLPใLLM ใ CV and recommendation system and bigdata. 1. Iโm experienced in RAGใlangchainใdeepspeedใllmใ object dectionใvideo generation 2.Iโm experienced in hadoop/hdfs/yarn/hbase/redis/kafka/hive, 2. Iโm experienced in tensorflow/kubeflow/tf serving/trtion serving 3.Iโm experienced in k8s/docker/html/jqurey/spring boot/mybatis 4.Iโm experienced in aws componet, such as emr/ec2/s3/code deploy
- Apache Spark
- PySpark
- TensorFlow
- Java
- Apache Flink
- Kubernetes
- Spring Boot
- Apache Hadoop
- Artificial Intelligence
- Big Data
- AWS Application
- LLM Prompt Engineering
- LangChain
- jQuery
How it works
Post a job for free Post a job
Tell us what you need. Create your own job post or generate one with AI then filter talent matches.
Hire top talent fast
Consult, interview, and hire quickly, so you can meet the freelancers you're excited about.
Collaborate easily
Use Upwork to chat or video call, share files, and track project progress right from the app.
Payment simplified
Manage payments in one place with flexible billing options. Only pay for approved work, hourly or by milestone.
Don't just take our word for it
โUpwork provides an umbrella-level of security. I can see a talentโs work history and ratings. I can hold payments in escrow. I can communicate through Upwork Messages instead of working through my email address.โ
Kim Darling
Emerald Tiger
โUpwork is the best platform to hire skilled professionals when we're not looking for a full-time employee. All the companies in our portfolio use Upwork to find talent across a wide range of fields.โ
David Merry
Kinetic Investments
โOur very specific requirements can be a challengeโWith Upwork, weโre able to access a bigger community to ensure the success of our projects.โ
Katja Krohn
Summa Linguae
How do I hire a Pyspark Developer in China on Upwork?
You can hire a Pyspark Developer in China on Upwork in four simple steps:
- Create a job post tailored to your Pyspark Developer project scope. We'll walk you through the process step by step.
- Browse top Pyspark Developer talent on Upwork and invite them to your project.
- Once the proposals start flowing in, create a shortlist of top Pyspark Developer profiles and interview.
- Hire the right Pyspark Developer for your project from Upwork, the world's largest work marketplace.
At Upwork, we believe talent staffing should be easy.
How much does it cost to hire a Pyspark Developer?
Rates charged by Pyspark Developers on Upwork can vary with a number of factors including experience, location, and market conditions. See hourly rates for in-demand skills on Upwork.
Why hire a Pyspark Developer in China on Upwork?
As the world's work marketplace, we connect highly-skilled freelance Pyspark Developers and businesses and help them build trusted, long-term relationships so they can achieve more together. Let us help you build the dream Pyspark Developer team you need to succeed.
Can I hire a Pyspark Developer in China within 24 hours on Upwork?
Depending on availability and the quality of your job post, it's entirely possible to sign up for Upwork and receive Pyspark Developer proposals within 24 hours of posting a job description.
Find more freelancers
Top cities for Pyspark Developers in China
- Logistics & Shipping Specialists in Tianjin, CN
- Interpretation Specialists in Shanghai, CN
- Quality Control Freelancers in Guangzhou, CN
- Quality Control Freelancers in Shenzhen, CN
- Administrative Assistants in Guangzhou, CN
- Sourcing Specialists in Shanghai, CN
- Sourcing Specialists in Taiyuan, CN
- Sourcing Specialists in Shenzhen, CN
- Sourcing Specialists in Beijing, CN
- Sourcing Specialists in Dongguan, CN
- Sourcing Specialists in Foshan, CN
- Sourcing Specialists in Guangzhou, CN
- Amazon FBA Assistants in Shenzhen, CN
- Translators in Shenzhen, CN
- Translators in Zhengzhou, CN
- Translators in Qingdao, CN
More top skills in China
- Big Data Engineers in China
- Pandas Developers in China
- Machine Learning Engineers in China
- Data Scientists in China
- Algorithm Developers in China
- Data Analysts in China
- AI Freelancers in China
- MATLAB Developers in China
- Data Entry Specialists in China
- Scrapy Developers in China
- Algorithms Engineers in China
- Express Js Developers in China
- Web Crawling Freelancers in China
- Python Numpy Developers in China
- Web Crawler Developers in China
- Stata Specialists in China