Hire the Best Pyspark Developers
in Poland
Warsaw, Poland
๐ Build scalable data infrastructureโ๐ Automate data workflowsโ๐ Deliver actionable analytics Iโm a certified data consultant specializing in Microsoft Fabric, Azure Databricks, SQL, Python, and Power BI. I help companies, from SaaS startups to global enterprises, turn complex, fragmented data into reliable, analytics-ready datasets that drive faster decisions and product growth. My work spans the full data lifecycle: designing architecture, building ETL pipelines, managing data lakes, and delivering secure reporting layers. Iโve delivered 20+ successful remote projects across logistics, maritime, energy, and real estate - from setting up 5+ Microsoft Fabric environments from scratch to managing infrastructure for Verizon AIโs 80M+ company dataset. Core Skills ๐ถ Data Engineering & Analytics: Microsoft Fabric, Azure Databricks, SQL Server, T-SQL, Python, PySpark, DAX, M โ๏ธ Cloud & Orchestration: Azure Data Factory, ADLS Gen2, Medallion Architecture, CI/CD, Git ๐ Integrations: REST APIs, CRMs (Salesforce, HubSpot), ERPs (SAP, QuickBooks), IoT data ๐ Data Visualization: Power BI (Embedded), Apache Superset Example Use Cases End-to-end SaaS data infrastructure for lead-generation platforms ETL pipelines consolidating CRM, ERP, and IoT data into analytics-ready datasets Embedded analytics portals for client KPI tracking Role-based reporting for operations, finance, and product teams Microsoft Certified PL-300 โ Power BI Data Analyst Associate DP-700 โ Fabric Data Engineer Associate DP-500 โ Azure Enterprise Data Analyst Associate Clients choose me because I: ๐ธ Deliver on time and to spec ๐ธ Take ownership of the whole solution, from ingestion to analytics ๐ธ Communicate clearly with both technical and non-technical teams ๐ธ Bring both enterprise-scale engineering and startup agility to projects If youโre looking for a technical data consultant who can architect, build, and optimize your data infrastructure - letโs connect.
- Microsoft Power BI
- Fabric
- Microsoft Excel
- SQL
- Data Visualization
- Dashboard
- Business Intelligence
- Power Query
- Microsoft Power Automate
- Microsoft Power BI Data Visualization
- Microsoft Power BI Development
- Data Engineering
- Data Analysis
- ETL
- Database
Brzozowka, Poland
I have over 9 years of experience in Data Engineering (especially using Spark and pySpark to gain value from massive amounts of data). I worked with analysts and data scientists by conducting workshops on working in Hadoop/Spark and resolving their issues with big data ecosystem. I also have experience on Hadoop maintenance and building ETL, especially between Hadoop and Kafka. You can find my profile on stackoverflow (link in Portfolio section) - I help mostly in spark and pyspark tagged questions.
- Apache Spark
- PySpark
- Apache Hadoop
- Apache Kafka
- Apache Airflow
- Data Migration
- Python
- Data Visualization
- ETL
- Data Scraping
- Data Warehousing
- MongoDB
Marki, Poland
Full-Stack AI Lead with 15+ years of experience in AI/ML/Data Engineering. Specialist in productionizing Document AI, Agentic LLM Workflows, and Scalable Big Data systems. Masterโs in Applied Mathematics. ๐๐ ๐๐ง๐ญ๐ข๐ ๐๐ & ๐๐๐ - Architecting production-grade agents and multi-intent workflows using LangGraph and LangChain. - Implementing NeMo Guardrails, DeepEval, and Langfuse to ensure security and near-zero hallucination rates. - Developing high-performance FastAPI backends for real-time operational data integration. ๐๐จ๐๐ฎ๐ฆ๐๐ง๐ญ ๐๐ง๐ญ๐๐ฅ๐ฅ๐ข๐ ๐๐ง๐๐ & ๐๐๐ - Specialist in extracting structure from "dirty" or non-digital sources using Visual NLP and OCR. - Lead Developer of Spark OCR. - Author and main contributor of open-source projects: Spark PDF and ScaleDP. - Expertise in document layout analysis, data extraction, and semantic search. ๐๐ซ๐จ๐๐ฎ๐๐ญ๐ข๐จ๐ง ๐๐๐๐ฉ๐ฌ & ๐๐ข๐ ๐๐๐ญ๐ - Architecting scalable infrastructures using Vector DBs (Milvus, Pinecone, Qdrant). - Optimizing Apache Spark (PySpark, Streaming) and Databricks pipelines; reduced processing time from days to hours. - Implementing robust CI/CD via Databricks Asset Bundles, Jenkins, and GitHub Actions. ๐๐ซ๐ข๐ฏ๐๐๐ฒ & ๐๐จ๐ฆ๐ฉ๐ฅ๐ข๐๐ง๐๐ - Automated de-identification and redaction for GDPR/HIPAA. - Specialist in masking sensitive information in DICOM, PDF, and image formats. ๐๐๐๐ก๐ง๐จ๐ฅ๐จ๐ ๐ข๐๐ฌ AI & LLM: LangGraph, LangChain, RAG, Llama 3, Gemini, GPT-4, Vertex AI, Hugging Face, GliNER. Backend: Python, Scala, FastAPI, Node.js, PostgreSQL, MongoDB. Vector DBs: Pinecone, Milvus, Qdrant. Infrastructure: AWS, GCP, Azure, Docker, Kubernetes, Jenkins, AirFlow. Big Data: Apache Spark (PySpark, Streaming, MLlib), Kafka, Kinesis.
- Apache Spark
- PySpark
- Natural Language Processing
- Computer Vision
- PyTorch
- Scala
- Python
- Tesseract OCR
- Machine Learning
- Large Language Model
- Hugging Face
- Software Architecture & Design
- Databricks Platform
- LangChain
- AI Development
- AI Agent Development
- Generative AI
- Vector Database
- Recommendation System
- Chatbot
Warsaw, Poland
- 10+ years of professional experience in JVM-based software development. - Expertise in Scalaโs modern FP stack (Typelevel, ZIO, Akka), data engineering (Hadoop, AWS, Spark), and Java (Spring) for high-performance solutions. - A product-oriented mindset, focused on understanding the business domain deeply to deliver solutions that align with and exceed business expectations.
- Apache Spark
- PySpark
- Scala
- Python
- Apache Kafka
- Apache Flink
- Java
- Microsoft Azure
- Amazon Web Services
- Google Cloud Platform
- ClickHouse
- PostgreSQL
- Microsoft SQL Server
- Distributed Computing
- Distributed Database
Warsaw, Poland
๐ฅ Looking for a Data Engineer who knows how to scale? I help enterprises transform raw data into real-time insights using Databricks, Azure, and streaming architectures that deliver results. ๐ Press โ๐๐๐๐๐๐ ๐๐ ๐๐๐ ๐๐๐โ or โ๐๐๐๐ ๐ ๐๐๐๐๐๐๐โ and letโs talk about how I can help you build a high-performance data platform. ๐ Case Study: I partnered with Databricks Professional Services (via Azure) to deliver a production-grade, structured streaming pipeline for a client that ingests over 4 billion events per day. The result? A scalable, monitored system that powers real-time analytics and intelligent automationโproving that AI Chatbots and Agents can thrive even in high-volume, enterprise-grade environments. ๐ Certifications: ๐ง Generative AI โ Databricks ๐๏ธ Azure Databricks Platform Architect โ Databricks ๐งฑ Databricks Lakehouse โ Databricks ๐ง Databricks Platform โ Databricks ๐ก๏ธ Platform Administrator โ Databricks Iโm a certified Data Engineer with deep expertise in Databricks, Azure, and distributed data systems. Iโve worked with enterprise clients (1000+ employees) to deploy scalable, multi-region data platforms that power real-time analytics and business growth. โ As a Data Engineer, I deploy enterprise-grade data infrastructure by rolling out Databricks across 13 regions to support global operations with high availability and performance. โ As a Data Engineer, I build streaming pipelines that process billions of records daily with near real-time SLAs (<1 minute), enabling fast, actionable insights. โ As a Data Engineer, I drive impact-driven analytics that enabled a client to onboard 500+ new customers by unlocking insights from their data lake through Databricks-powered analysis. โ As a Data Engineer, I collaborate with Microsoft Professional Services to optimize Azure and Databricks performance, ensuring cost efficiency and speed at scale. โ As a Data Engineer, I design secure and reliable architecturesโimplementing Medallion Architecture, Delta Lake optimizations, and CI/CD workflows for robust data operations. ๐ค Whatโs stopping us from building something powerful together? Send me a message, and letโs kick off your next data engineering project!
- Apache Spark
- SQL
- ETL Pipeline
- Big Data
- Python
- Data Analysis
- Machine Learning
- Data Engineering
- Amazon Web Services
- Data Science
- Data Warehousing
- ETL
- BigQuery
- Data Scraping
- Data Extraction
Wroclaw, Poland
๐ชTechnical Expertise: โ Python -> FastAPI, PostMan,BeautifulSoup, Selenium, Requests โ SQL -> PostgreSQL, SQLite, MS SQL, MySQL โ No-SQL Databases (MongoDB) โ Orchestration -> Airflow, Talend, Pentaho, Databricks, Azure Data Factory) โ Containerization and CI/CD (Docker, Kubernetes) โ Cloud Provider Experience (Azure, AWS) โ Data Modelling ๐๐ฒ ๐๐๐ซ๐ฏ๐ข๐๐๐ฌ ๐ป: ๐ ๏ธ ๐๐ง๐ ๐ฃ๐ถ๐ฝ๐ฒ๐น๐ถ๐ป๐ฒ: Need that pipeline to move your data from source to destination? JSON, CSV, EXCEL, SQL sources to any database of your choice. On-prem or cloud platform? You want to use Databricks or another service? In real-time or batches? You need consultations on choosing the best approach/service/design for your business usecase? I have got you covered - just a booking away to providing you the optimal solution. ๐ ๐๐๐ ๐๐๐ซ๐๐ฉ๐ข๐ง๐ : Data collation can be a tedious task asmost of the data are not readily available in a needed format and/or location, JavaScript complexities, dynamic pages, login - I have got you covered. Want your data collated in an organized and formatted manner - JSON, CSV, XLSX, or loaded to a database via an API - just name it. ๐ ๐๐๐ ๐๐๐ฏ๐๐ฅ๐จ๐ฉ๐ฆ๐๐ง๐ญ: Looking for a reliable REST API for your business? Our solution delivers robust performance, secure authentication, and clear Pydantic validations. With intuitive documentation, we make integration effortless. Let's streamline your business processes! ๐ ๐๐๐ญ๐ ๐๐ง๐๐ฅ๐ฒ๐ฌ๐ข๐ฌ: Looking to uncover insights from your business data to drive revenue growth, optimize processes, or reduce costs? Our data analysis offers detailed insights and clear visualizations to help you make informed decisions. Let's transform your data into actionable strategies! โ๏ธ ๐๐ฎ๐ญ๐จ๐ฆ๐๐ญ๐ข๐จ๐ง๐ฌ: Need a UI to automate your tasks - tkinter, Gradio, Streamlit? I have got you covered. Just with a button click, all your tasks will be automated and save you the stress of doing them manually. ๐ ๐๐ฎ๐ฌ๐ญ๐จ๐ฆ ๐๐๐ซ๐ข๐ฉ๐ญ๐ข๐ง๐ : Developing custom scripts and applications for specific use case which includes data cleaning, automated form filling and task scheduling. Just one click away. ๐ ๐๐ก๐ฒ ๐ฅ๐๐ญ ๐ฆ๐ ๐ก๐๐ง๐๐ฅ๐ ๐ฒ๐จ๐ฎ๐ซ ๐ญ๐๐ฌ๐ค๐ฌ? โ๏ธ Diverse Experience โ๏ธ Proven track record ๐ of succesful deliveries. โ๏ธ Attention to detail - Watchful eyes ๐ to ensure nothing is missed . โ๏ธ On-time delivery - You'll get your deliverable as at when due โฐ, even earlier ๐ โ๏ธ Excellent communication skills - All details of the project at your finger tips ๐
- Microsoft Excel
- Python
- Data Scraping
- SQL
- Microsoft Power BI
- Data Analytics
- Report Writing
- Microsoft SQL Server
- Scripting
- Web Scraping
- PostgreSQL
- API
- Data Extraction
- Microsoft Azure
How it works
Post a job for free Post a job
Tell us what you need. Create your own job post or generate one with AI then filter talent matches.
Hire top talent fast
Consult, interview, and hire quickly, so you can meet the freelancers you're excited about.
Collaborate easily
Use Upwork to chat or video call, share files, and track project progress right from the app.
Payment simplified
Manage payments in one place with flexible billing options. Only pay for approved work, hourly or by milestone.
Don't just take our word for it
โUpwork provides an umbrella-level of security. I can see a talentโs work history and ratings. I can hold payments in escrow. I can communicate through Upwork Messages instead of working through my email address.โ
Kim Darling
Emerald Tiger
โUpwork is the best platform to hire skilled professionals when we're not looking for a full-time employee. All the companies in our portfolio use Upwork to find talent across a wide range of fields.โ
David Merry
Kinetic Investments
โOur very specific requirements can be a challengeโWith Upwork, weโre able to access a bigger community to ensure the success of our projects.โ
Katja Krohn
Summa Linguae
How do I hire a Pyspark Developer in Poland on Upwork?
You can hire a Pyspark Developer in Poland on Upwork in four simple steps:
- Create a job post tailored to your Pyspark Developer project scope. We'll walk you through the process step by step.
- Browse top Pyspark Developer talent on Upwork and invite them to your project.
- Once the proposals start flowing in, create a shortlist of top Pyspark Developer profiles and interview.
- Hire the right Pyspark Developer for your project from Upwork, the world's largest work marketplace.
At Upwork, we believe talent staffing should be easy.
How much does it cost to hire a Pyspark Developer?
Rates charged by Pyspark Developers on Upwork can vary with a number of factors including experience, location, and market conditions. See hourly rates for in-demand skills on Upwork.
Why hire a Pyspark Developer in Poland on Upwork?
As the world's work marketplace, we connect highly-skilled freelance Pyspark Developers and businesses and help them build trusted, long-term relationships so they can achieve more together. Let us help you build the dream Pyspark Developer team you need to succeed.
Can I hire a Pyspark Developer in Poland within 24 hours on Upwork?
Depending on availability and the quality of your job post, it's entirely possible to sign up for Upwork and receive Pyspark Developer proposals within 24 hours of posting a job description.
Find more freelancers
Top cities for Pyspark Developers in Poland
- Pandas Developers in Gdansk, PL
- JavaFX Developers in Warsaw, PL
- Node.js Developers in Warsaw, PL
- Customer Engagement Freelancers in Warsaw, PL
- Research Specialists in Krakow, PL
- Scientific Researchers in Warsaw, PL
- InVision Designers in Wroclaw, PL
- InVision Designers in Krakow, PL
- Sound Designers in Warsaw, PL
- Translators in Wroclaw, PL
- Digital Marketers in Warsaw, PL
- Legal Freelancers in Warsaw, PL
- Virtual Assistants in Warsaw, PL
- Lead Generation Specialists in Katowice, PL
- English to Polish Translators in Krakow, PL
- Polish-to-English Translators in Rzeszow, PL
More top skills in Poland
- Kubernetes Developers in Poland
- Predictive Analytics Specialists in Poland
- BigQuery Developers in Poland
- Data Processing Experts in Poland
- Machine Learning Engineers in Poland
- Angular Developers in Poland
- Data Scientists in Poland
- Data Analysts in Poland
- Apache Tomcat Developers in Poland
- Computer Vision Engineers in Poland
- Data Entry Specialists in Poland
- Data Scrapers in Poland
- QT Developers in Poland
- Joomla Developers in Poland
- Ruby Developers & Programmers in Poland
- Zend Framework Developers in Poland