Hire the Best Pyspark Developers
in Poland

More than 3,000 reviews on G2
Rating is 4.5 out of 5.
4.5/5
of Upwork by G2 peer reviewers
Sylwester N.

Warsaw, Poland

$110/hr
5.0
25 jobs

๐Ÿš€ Build scalable data infrastructureโ€ƒ๐Ÿ›  Automate data workflowsโ€ƒ๐Ÿ“Š Deliver actionable analytics Iโ€™m a certified data consultant specializing in Microsoft Fabric, Azure Databricks, SQL, Python, and Power BI. I help companies, from SaaS startups to global enterprises, turn complex, fragmented data into reliable, analytics-ready datasets that drive faster decisions and product growth. My work spans the full data lifecycle: designing architecture, building ETL pipelines, managing data lakes, and delivering secure reporting layers. Iโ€™ve delivered 20+ successful remote projects across logistics, maritime, energy, and real estate - from setting up 5+ Microsoft Fabric environments from scratch to managing infrastructure for Verizon AIโ€™s 80M+ company dataset. Core Skills ๐Ÿ“ถ Data Engineering & Analytics: Microsoft Fabric, Azure Databricks, SQL Server, T-SQL, Python, PySpark, DAX, M โ˜๏ธ Cloud & Orchestration: Azure Data Factory, ADLS Gen2, Medallion Architecture, CI/CD, Git ๐Ÿ”Œ Integrations: REST APIs, CRMs (Salesforce, HubSpot), ERPs (SAP, QuickBooks), IoT data ๐Ÿ“Š Data Visualization: Power BI (Embedded), Apache Superset Example Use Cases End-to-end SaaS data infrastructure for lead-generation platforms ETL pipelines consolidating CRM, ERP, and IoT data into analytics-ready datasets Embedded analytics portals for client KPI tracking Role-based reporting for operations, finance, and product teams Microsoft Certified PL-300 โ€“ Power BI Data Analyst Associate DP-700 โ€“ Fabric Data Engineer Associate DP-500 โ€“ Azure Enterprise Data Analyst Associate Clients choose me because I: ๐Ÿ”ธ Deliver on time and to spec ๐Ÿ”ธ Take ownership of the whole solution, from ingestion to analytics ๐Ÿ”ธ Communicate clearly with both technical and non-technical teams ๐Ÿ”ธ Bring both enterprise-scale engineering and startup agility to projects If youโ€™re looking for a technical data consultant who can architect, build, and optimize your data infrastructure - letโ€™s connect.

  • Microsoft Power BI
  • Fabric
  • Microsoft Excel
  • SQL
  • Data Visualization
  • Dashboard
  • Business Intelligence
  • Power Query
  • Microsoft Power Automate
  • Microsoft Power BI Data Visualization
  • Microsoft Power BI Development
  • Data Engineering
  • Data Analysis
  • ETL
  • Database
Mariusz S.

Brzozowka, Poland

$100/hr
5.0
50 jobs

I have over 9 years of experience in Data Engineering (especially using Spark and pySpark to gain value from massive amounts of data). I worked with analysts and data scientists by conducting workshops on working in Hadoop/Spark and resolving their issues with big data ecosystem. I also have experience on Hadoop maintenance and building ETL, especially between Hadoop and Kafka. You can find my profile on stackoverflow (link in Portfolio section) - I help mostly in spark and pyspark tagged questions.

  • Apache Spark
  • PySpark
  • Apache Hadoop
  • Apache Kafka
  • Apache Airflow
  • Data Migration
  • Python
  • Data Visualization
  • ETL
  • Data Scraping
  • Data Warehousing
  • MongoDB
Mykola M.

Marki, Poland

$80/hr
5.0
39 jobs

Full-Stack AI Lead with 15+ years of experience in AI/ML/Data Engineering. Specialist in productionizing Document AI, Agentic LLM Workflows, and Scalable Big Data systems. Masterโ€™s in Applied Mathematics. ๐€๐ ๐ž๐ง๐ญ๐ข๐œ ๐€๐ˆ & ๐‹๐‹๐Œ - Architecting production-grade agents and multi-intent workflows using LangGraph and LangChain. - Implementing NeMo Guardrails, DeepEval, and Langfuse to ensure security and near-zero hallucination rates. - Developing high-performance FastAPI backends for real-time operational data integration. ๐ƒ๐จ๐œ๐ฎ๐ฆ๐ž๐ง๐ญ ๐ˆ๐ง๐ญ๐ž๐ฅ๐ฅ๐ข๐ ๐ž๐ง๐œ๐ž & ๐Ž๐‚๐‘ - Specialist in extracting structure from "dirty" or non-digital sources using Visual NLP and OCR. - Lead Developer of Spark OCR. - Author and main contributor of open-source projects: Spark PDF and ScaleDP. - Expertise in document layout analysis, data extraction, and semantic search. ๐๐ซ๐จ๐๐ฎ๐œ๐ญ๐ข๐จ๐ง ๐Œ๐‹๐Ž๐ฉ๐ฌ & ๐๐ข๐  ๐ƒ๐š๐ญ๐š - Architecting scalable infrastructures using Vector DBs (Milvus, Pinecone, Qdrant). - Optimizing Apache Spark (PySpark, Streaming) and Databricks pipelines; reduced processing time from days to hours. - Implementing robust CI/CD via Databricks Asset Bundles, Jenkins, and GitHub Actions. ๐๐ซ๐ข๐ฏ๐š๐œ๐ฒ & ๐‚๐จ๐ฆ๐ฉ๐ฅ๐ข๐š๐ง๐œ๐ž - Automated de-identification and redaction for GDPR/HIPAA. - Specialist in masking sensitive information in DICOM, PDF, and image formats. ๐“๐ž๐œ๐ก๐ง๐จ๐ฅ๐จ๐ ๐ข๐ž๐ฌ AI & LLM: LangGraph, LangChain, RAG, Llama 3, Gemini, GPT-4, Vertex AI, Hugging Face, GliNER. Backend: Python, Scala, FastAPI, Node.js, PostgreSQL, MongoDB. Vector DBs: Pinecone, Milvus, Qdrant. Infrastructure: AWS, GCP, Azure, Docker, Kubernetes, Jenkins, AirFlow. Big Data: Apache Spark (PySpark, Streaming, MLlib), Kafka, Kinesis.

  • Apache Spark
  • PySpark
  • Natural Language Processing
  • Computer Vision
  • PyTorch
  • Scala
  • Python
  • Tesseract OCR
  • Machine Learning
  • Large Language Model
  • Hugging Face
  • Software Architecture & Design
  • Databricks Platform
  • LangChain
  • AI Development
  • AI Agent Development
  • Generative AI
  • Vector Database
  • Recommendation System
  • Chatbot
Artsiom S.

Warsaw, Poland

$20/hr
5.0
4 jobs

- 10+ years of professional experience in JVM-based software development. - Expertise in Scalaโ€™s modern FP stack (Typelevel, ZIO, Akka), data engineering (Hadoop, AWS, Spark), and Java (Spring) for high-performance solutions. - A product-oriented mindset, focused on understanding the business domain deeply to deliver solutions that align with and exceed business expectations.

  • Apache Spark
  • PySpark
  • Scala
  • Python
  • Apache Kafka
  • Apache Flink
  • Java
  • Microsoft Azure
  • Amazon Web Services
  • Google Cloud Platform
  • ClickHouse
  • PostgreSQL
  • Microsoft SQL Server
  • Distributed Computing
  • Distributed Database
Oleh S.

Warsaw, Poland

$40/hr
5.0
5 jobs

๐Ÿ”ฅ Looking for a Data Engineer who knows how to scale? I help enterprises transform raw data into real-time insights using Databricks, Azure, and streaming architectures that deliver results. ๐Ÿ‘‰ Press โ€œ๐ˆ๐๐•๐ˆ๐“๐„ ๐“๐Ž ๐“๐‡๐„ ๐‰๐Ž๐โ€ or โ€œ๐’๐„๐๐ƒ ๐€ ๐Œ๐„๐’๐’๐€๐†๐„โ€ and letโ€™s talk about how I can help you build a high-performance data platform. ๐Ÿš€ Case Study: I partnered with Databricks Professional Services (via Azure) to deliver a production-grade, structured streaming pipeline for a client that ingests over 4 billion events per day. The result? A scalable, monitored system that powers real-time analytics and intelligent automationโ€”proving that AI Chatbots and Agents can thrive even in high-volume, enterprise-grade environments. ๐Ÿ“œ Certifications: ๐Ÿง  Generative AI โ€“ Databricks ๐Ÿ—๏ธ Azure Databricks Platform Architect โ€“ Databricks ๐Ÿงฑ Databricks Lakehouse โ€“ Databricks ๐Ÿ”ง Databricks Platform โ€“ Databricks ๐Ÿ›ก๏ธ Platform Administrator โ€“ Databricks Iโ€™m a certified Data Engineer with deep expertise in Databricks, Azure, and distributed data systems. Iโ€™ve worked with enterprise clients (1000+ employees) to deploy scalable, multi-region data platforms that power real-time analytics and business growth. โœ… As a Data Engineer, I deploy enterprise-grade data infrastructure by rolling out Databricks across 13 regions to support global operations with high availability and performance. โœ… As a Data Engineer, I build streaming pipelines that process billions of records daily with near real-time SLAs (<1 minute), enabling fast, actionable insights. โœ… As a Data Engineer, I drive impact-driven analytics that enabled a client to onboard 500+ new customers by unlocking insights from their data lake through Databricks-powered analysis. โœ… As a Data Engineer, I collaborate with Microsoft Professional Services to optimize Azure and Databricks performance, ensuring cost efficiency and speed at scale. โœ… As a Data Engineer, I design secure and reliable architecturesโ€”implementing Medallion Architecture, Delta Lake optimizations, and CI/CD workflows for robust data operations. ๐Ÿค” Whatโ€™s stopping us from building something powerful together? Send me a message, and letโ€™s kick off your next data engineering project!

  • Apache Spark
  • SQL
  • ETL Pipeline
  • Big Data
  • Python
  • Data Analysis
  • Machine Learning
  • Data Engineering
  • Amazon Web Services
  • Data Science
  • Data Warehousing
  • ETL
  • BigQuery
  • Data Scraping
  • Data Extraction
David A.

Wroclaw, Poland

$15/hr
4.8
21 jobs

๐Ÿ’ชTechnical Expertise: โœ… Python -> FastAPI, PostMan,BeautifulSoup, Selenium, Requests โœ… SQL -> PostgreSQL, SQLite, MS SQL, MySQL โœ… No-SQL Databases (MongoDB) โœ… Orchestration -> Airflow, Talend, Pentaho, Databricks, Azure Data Factory) โœ… Containerization and CI/CD (Docker, Kubernetes) โœ… Cloud Provider Experience (Azure, AWS) โœ… Data Modelling ๐Œ๐ฒ ๐’๐ž๐ซ๐ฏ๐ข๐œ๐ž๐ฌ ๐Ÿ’ป: ๐Ÿ› ๏ธ ๐—˜๐—ง๐—Ÿ ๐—ฃ๐—ถ๐—ฝ๐—ฒ๐—น๐—ถ๐—ป๐—ฒ: Need that pipeline to move your data from source to destination? JSON, CSV, EXCEL, SQL sources to any database of your choice. On-prem or cloud platform? You want to use Databricks or another service? In real-time or batches? You need consultations on choosing the best approach/service/design for your business usecase? I have got you covered - just a booking away to providing you the optimal solution. ๐Ÿ“Š ๐–๐ž๐› ๐’๐œ๐ซ๐š๐ฉ๐ข๐ง๐ : Data collation can be a tedious task asmost of the data are not readily available in a needed format and/or location, JavaScript complexities, dynamic pages, login - I have got you covered. Want your data collated in an organized and formatted manner - JSON, CSV, XLSX, or loaded to a database via an API - just name it. ๐ŸŒ ๐€๐๐ˆ ๐ƒ๐ž๐ฏ๐ž๐ฅ๐จ๐ฉ๐ฆ๐ž๐ง๐ญ: Looking for a reliable REST API for your business? Our solution delivers robust performance, secure authentication, and clear Pydantic validations. With intuitive documentation, we make integration effortless. Let's streamline your business processes! ๐Ÿ“ˆ ๐ƒ๐š๐ญ๐š ๐€๐ง๐š๐ฅ๐ฒ๐ฌ๐ข๐ฌ: Looking to uncover insights from your business data to drive revenue growth, optimize processes, or reduce costs? Our data analysis offers detailed insights and clear visualizations to help you make informed decisions. Let's transform your data into actionable strategies! โš™๏ธ ๐€๐ฎ๐ญ๐จ๐ฆ๐š๐ญ๐ข๐จ๐ง๐ฌ: Need a UI to automate your tasks - tkinter, Gradio, Streamlit? I have got you covered. Just with a button click, all your tasks will be automated and save you the stress of doing them manually. ๐Ÿ›  ๐‚๐ฎ๐ฌ๐ญ๐จ๐ฆ ๐’๐œ๐ซ๐ข๐ฉ๐ญ๐ข๐ง๐ : Developing custom scripts and applications for specific use case which includes data cleaning, automated form filling and task scheduling. Just one click away. ๐Ÿ” ๐–๐ก๐ฒ ๐ฅ๐ž๐ญ ๐ฆ๐ž ๐ก๐š๐ง๐๐ฅ๐ž ๐ฒ๐จ๐ฎ๐ซ ๐ญ๐š๐ฌ๐ค๐ฌ? โœ”๏ธ Diverse Experience โœ”๏ธ Proven track record ๐Ÿ“‹ of succesful deliveries. โœ”๏ธ Attention to detail - Watchful eyes ๐Ÿ‘€ to ensure nothing is missed . โœ”๏ธ On-time delivery - You'll get your deliverable as at when due โฐ, even earlier ๐Ÿ˜ โœ”๏ธ Excellent communication skills - All details of the project at your finger tips ๐Ÿ“ž

  • Microsoft Excel
  • Python
  • Data Scraping
  • SQL
  • Microsoft Power BI
  • Data Analytics
  • Report Writing
  • Microsoft SQL Server
  • Scripting
  • Web Scraping
  • PostgreSQL
  • API
  • Data Extraction
  • Microsoft Azure

How it works

Post a job for free Post a job

Tell us what you need. Create your own job post or generate one with AI then filter talent matches.

Hire top talent fast

Consult, interview, and hire quickly, so you can meet the freelancers you're excited about.

Collaborate easily

Use Upwork to chat or video call, share files, and track project progress right from the app.

Payment simplified

Manage payments in one place with flexible billing options. Only pay for approved work, hourly or by milestone.

Don't just take our word for it

How do I hire a Pyspark Developer in Poland on Upwork?

You can hire a Pyspark Developer in Poland on Upwork in four simple steps:

  • Create a job post tailored to your Pyspark Developer project scope. We'll walk you through the process step by step.
  • Browse top Pyspark Developer talent on Upwork and invite them to your project.
  • Once the proposals start flowing in, create a shortlist of top Pyspark Developer profiles and interview.
  • Hire the right Pyspark Developer for your project from Upwork, the world's largest work marketplace.

At Upwork, we believe talent staffing should be easy.

How much does it cost to hire a Pyspark Developer?

Rates charged by Pyspark Developers on Upwork can vary with a number of factors including experience, location, and market conditions. See hourly rates for in-demand skills on Upwork.

Why hire a Pyspark Developer in Poland on Upwork?

As the world's work marketplace, we connect highly-skilled freelance Pyspark Developers and businesses and help them build trusted, long-term relationships so they can achieve more together. Let us help you build the dream Pyspark Developer team you need to succeed.

Can I hire a Pyspark Developer in Poland within 24 hours on Upwork?

Depending on availability and the quality of your job post, it's entirely possible to sign up for Upwork and receive Pyspark Developer proposals within 24 hours of posting a job description.