Hire the Best Pyspark Developers
in Brazil
Curitiba, Brazil
You probably think clicking "deploy" on Databricks from the cloud marketplace is all it takes to build a modern data stack. Instead, you get unmanageable infrastructure, skyrocketing costs, and pipelines feeding reports nobody trusts. ๐ ๐ณ๐ถ๐ ๐๐ต๐ฎ๐. ๐ก๐ผ ๐ฎ๐ด๐ฒ๐ป๐ฐ๐ถ๐ฒ๐, ๐ป๐ผ ๐ฏ๐น๐ผ๐ฎ๐. Just a multi-certified, 5+ years of experience Cloud Solutions Architect building automated, high-integrity platforms that turn raw data into a competitive advantage. If you shoot me a invitation or message I'll send you a personalized Loom video back on how I may be able to help you; and of course, to prove that I'm the real deal, ๐ป๐ผ ๐๐ ๐ถ๐ป๐๐ผ๐น๐๐ฒ๐ฑ! Whether you are building a greenfield lakehouse from scratch or migrating legacy systems to the cloud, I architect efficient, cost-effective environments that scale without the overhead. I understand the business bottom line just as well as the underlying code. โช 100% Job Success Score | 5.0โ average โช Proven experience on multi-cloud architectures ๐ก ๐ช๐ต๐ฎ๐ ๐ ๐ฑ๐ผ: โข ๐๐ฎ๐๐ฎ ๐ฃ๐น๐ฎ๐๐ณ๐ผ๐ฟ๐บ ๐๐ป๐ด๐ถ๐ป๐ฒ๐ฒ๐ฟ๐ถ๐ป๐ด: I build production-ready environments using Terraform. No manual marketplace or standard deployments that break at scale. โข ๐ฅ๐ฒ๐น๐ถ๐ฎ๐ฏ๐น๐ฒ ๐๐ฎ๐๐ฎ ๐๐ป๐ด๐ถ๐ป๐ฒ๐ฒ๐ฟ๐ถ๐ป๐ด: Raw data becomes actionable. I build resilient Medallion architectures and automated ETL/ELT pipelines so your stakeholders actually trust the numbers. โข ๐ฃ๐ฟ๐ผ๐ฑ๐๐ฐ๐๐ถ๐ผ๐ป ๐ ๐๐ข๐ฝ๐: I bridge the gap between data engineering and machine learning. Using MLflow and Databricks Model Serving, I operationalize models into scalable, real-time REST endpoints and automated streaming inference pipelines. โข ๐๐ผ๐๐ฒ๐ฟ๐ป๐ฎ๐ป๐ฐ๐ฒ & ๐ฆ๐ฒ๐ฐ๐๐ฟ๐ถ๐๐: Proper data governance utilizing Unity Catalog (no legacy Hive metastores) to ensure your data is accessible, secure, and future-proof. โข ๐๐น๐ผ๐๐ฑ ๐๐ผ๐๐ ๐ข๐ฝ๐๐ถ๐บ๐ถ๐๐ฎ๐๐ถ๐ผ๐ป: Most companies overspend on cloud infrastructure. I architect systems that pay for themselves in weeks by eliminating overhead and inefficiencies with efficient auditing and monitoring features. โ ๐๐ฒ๐ฟ๐๐ถ๐ณ๐ถ๐ฐ๐ฎ๐๐ถ๐ผ๐ป๐ (๐๐ฒ๐ฟ๐ถ๐ณ๐ถ๐ฒ๐ฑ): โข Databricks Professional Data Engineer โข Databricks Associate Data Engineer โข Databricks Lakehouse Fundamentals โข GCP Professional Data Engineer โข GCP Associate Cloud Engineer โข GCP Cloud Digital Leader โข AWS Associate Solutions Architect โข AWS Cloud Practitioner ๐ง ๐๐ ๐ฝ๐ฒ๐ฟ๐ถ๐ฒ๐ป๐ฐ๐ฒ ๐๐ถ๐๐ต ๐๐น๐ผ๐๐ฑ ๐ฆ๐ฒ๐ฟ๐๐ถ๐ฐ๐ฒ๐: โข ๐๐ฎ๐๐ฎ๐ฏ๐ฟ๐ถ๐ฐ๐ธ๐: Workflows, LDP (Lakeflow Declarative Pipelines), Unity Catalog, Workflows, Databricks SQL, MLFlow. โข ๐๐บ๐ฎ๐๐ผ๐ป ๐ช๐ฒ๐ฏ ๐ฆ๐ฒ๐ฟ๐๐ถ๐ฐ๐ฒ (๐๐ช๐ฆ): EMR, Athena, Redshift, Glue, S3, RDS, Kinesis Data Firehose, Kinesis, and Data Streams. โข ๐๐ผ๐ผ๐ด๐น๐ฒ ๐๐น๐ผ๐๐ฑ ๐ฃ๐น๐ฎ๐๐ณ๐ผ๐ฟ๐บ (๐๐๐ฃ): Bigquery, Dataform, Composer, Dataflow, Dataproc, Cloud Storage, Pub/Sub, Cloud Functions, and Looker Studio. โข ๐ ๐ถ๐ฐ๐ฟ๐ผ๐๐ผ๐ณ๐ ๐๐๐๐ฟ๐ฒ: Data Factory, Synapse, and Storage Account. โข ๐ข๐๐ต๐ฒ๐ฟ๐: Terraform, dbt, Airflow, Airbyte, Hadoop, and Hive. โ๏ธ ๐๐ผ๐ฟ๐ฒ ๐ฒ๐ ๐ฝ๐ฒ๐ฟ๐๐ถ๐๐ฒ: โข ๐ฅ๐ผ๐น๐ฒ๐: Data Architect, Data Engineer, Solutions Architect, Platform Engineer โข ๐ฃ๐น๐ฎ๐๐ณ๐ผ๐ฟ๐บ๐: Databricks (Delta Lake, Unity Catalog, Lakeflow, Workflows), BigQuery โข ๐๐ป๐ณ๐ฟ๐ฎ๐๐๐ฟ๐๐ฐ๐๐๐ฟ๐ฒ: Infrastructure as Code (IaC), Terraform, Multi-Cloud (AWS, GCP, Azure) โข ๐๐ฟ๐ฐ๐ต๐ถ๐๐ฒ๐ฐ๐๐๐ฟ๐ฒ: Medallion Architecture, Data Lakehouse, Data Governance, Data Quality, Machine Learning โข ๐๐ป๐ด๐ถ๐ป๐ฒ๐ฒ๐ฟ๐ถ๐ป๐ด: PySpark, Python, SQL, dbt, Apache Airflow, ETL/ELT, CDC, Batch and Stream Processing
- PySpark
- Cloud Architecture
- Cloud Computing
- Databricks Platform
- Data Engineering
- Python
- SQL
- Apache Airflow
- Google Cloud Platform
- Amazon Web Services
- Microsoft Azure
- ETL
- Data Analysis
- Bash
- Data Modeling
- Data Warehousing
- Continuous Improvement
Sao Vicente, Brazil
Hi there, thanks for considering me for your project! ๐ My name is Paulo, a Computer Science graduate from Brazil with 2.5 years of experience in Data Engineering using Python. With a strong background in algorithms, mathematics, and logic, Iโll quickly adapt to new tools and technologies to find the best solutions for your needs. ๐ ๐ง๐ท What I can do for you: โข Web Scraping โ Extract and process data from websites ๐ โข Develop ETL & Data Pipelines โ Automate data collection and transformation ๐ โข API Integrations โ Connect and exchange data efficiently ๐ โข Train and Deploy Machine Learning Models โ Build intelligent, data-driven solutions ๐ค Iโll break down technical concepts in an intuitive and easy-to-understand way. ๐ก My work is driven by commitment, dedication, and professionalism. ๐จ๐พโ๐ป Available in your time zone for smooth collaboration. โ๏ธ
- ETL
- Data Extraction
- ETL Pipeline
- Machine Learning
- Machine Learning Model
- Database Modeling
- Database Architecture
- Data Engineering
- Data Cloud
- Data Integration
- Big Data
Porto Alegre, Brazil
I have almost 5 years of experience working with data, having started as a data analyst, then data scientist, and for the past 4 years, as a data engineer. I've worked in many projects related to API consumption, web scraping and automation. I work with Python, SQL, PySpark, and AWS services like Glue, S3, Redshift, and IAM. Also have got experience with Azure services like ADLS and Functions. I have hands-on experience building scalable, production-grade pipelines using the medallion architecture, with automation and orchestration through serverless services. Experience with infrastructure-as-code practices using Terraform. I'm passionate about clean engineering, automation, and creating end-to-end data solutions that drive business value. Related to web scraping these are the tools I use depending on the need: requests, selenium, bs4, playwright. To deal with anti-bot we can always use proxies (that we can gather for free and I do have a database for that already), user-agents and cookies to mimic human like behavior. We scrape the data as json, html, xml or plain text and turn them into structured data as an EXCEL file, csv, database etc.
- PySpark
- ETL
- Data Extraction
- Data Mining
- ETL Pipeline
- Web Scraping
- Python
- API
- Data Entry
- Data Analytics
- Machine Learning
Belo Horizonte, Brazil
I am a Software Engineer with 8 year of experience in developing applications. I already work with a lot of technologies including: VB.NET, C#, Asp. Net Web Forms, Asp.Net Web Api, Asp.Net MVC, JQuery, Angular JS, Angular, Entity Framework, NHibernate, Delphi. I have experience in both backend and front-end development. I have a lot experience in T-SQL (Sql Server), but I also have some knowledge in Oracle, MySql,Postgre and MongoDB. For control version I already work with Git, Bitbucket and TFS. Recently I've shifted my career to work with Big Data, I'm finishing my thesis on Data Science and Advanced Analytics and I'm currently working as a Big Data Engineer.
- PySpark
- Python
- Databricks Platform
- Machine Learning
- SQL
- Transact-SQL
- PostgreSQL Programming
- BigQuery
- Google Cloud Platform
- HubSpot
- Apache Airflow
- Google Search Console
Divinopolis, Brazil
I build AI systems that go beyond prototypes โ production-ready, well-architected, and designed to scale. With 7+ years of experience across industry and academic research (MSc in NLP, published at ACL & WSDM), I bring both deep technical skills and a practical, results-driven approach to every project. My clients tend to stay. My longest engagement ran 3 years, and most of my UpWork projects have been 1+ year collaborations. I don't just write code โ I ask the right questions, suggest better approaches, and make sure the final product actually works in the real world. ๐ What I build โ AI Agents & Chatbots โ text, voice, multi-language (OpenAI, Anthropic, Azure AI, LiveKit) โ RAG Pipelines โ semantic search, vector databases, MCP servers, knowledge retrieval โ NLP & LLM Applications โ classification, entity recognition, summarization, Q&A โ Data Extraction โ OCR, web scraping, document parsing, transaction processing โ Cloud-Native Backends โ microservices, event-driven architectures, serverless โ Real-Time Audio/Video Processing โ transcription, translation, speaker diarization โ MLOps & Monitoring โ experiment tracking, evaluation, observability ๐ ๏ธ Tech I work with daily โ Python: FastAPI, Flask, Streamlit, Pandas, scikit-learn, TensorFlow/Keras, spaCy, NLTK โ AI/LLM: OpenAI, Anthropic Claude, DeepSeek, Gemini, LangChain, LlamaIndex, CrewAI, Guardrails AI โ Cloud: AWS (Lambda, SQS, S3, Batch, EC2, RDS, SageMaker), Azure, GCP, Terraform, Docker โ Databases: PostgreSQL, Supabase, MongoDB, DynamoDB โ Vector DBs: Weaviate, Pinecone, Milvus, pgvector โ MLOps: MLFlow, Langfuse, DeepEval, Weights & Biases, OpenTelemetry โ Other: Tesseract OCR, Faster-Whisper, Pyannote, FastMCP, Selenium ๐ Recent projects โ 3-year AI platform for a finance company: chatbot (voice + text), OCR pipelines, merchant classification with GPT-4o, web scraping, MLOps setup โ Real-time transcription & translation system on AWS for multi-language conferences: GPU-accelerated Whisper, speaker diarization, LangChain summarization โ RAG backend with MCP server: semantic search across Slack, ClickUp, and Fireflies using Weaviate, LlamaIndex, and Supabase pgvector ๐ข What clients say about me โธ "Christian is a great developer, and asks relevant questions for the problems we give him, he's not just a 'pair of hands' but a helpful advisor for improving your initial suggestion on how to solve the problem. He's very fast to iterate and delivers code with great quality." โธ "Went above and beyond to help with code for a large project, and completed tasks quickly and efficiently. Understood exactly what was needed for the job and executed with precision. I will absolutely be working with him again in the future!" Let's build something great โ send me a message and let's talk about your project.
- Generative AI
- Natural Language Processing
- Python
- Machine Learning
- Amazon Web Services
- Claude
- LangChain
- Deep Learning
- Data Science
- Google Cloud Platform
- Artificial Intelligence
- Streamlit
- Azure Cognitive Services
- Retrieval Augmented Generation
- Microservice
- Audio Transcription
- MLflow
- Vector Database
- Text Summarization
- Classification
Betim, Brazil
Most data teams have infrastructure that looks good on paper but still can't answer basic business questions fast enough. I fix that end to end. With 7+ years in data engineering, I specialize in Snowflake, Databricks, and AWS building pipelines that actually run in production without breaking at 6am. Recent work includes: โข CI/CD pipeline for a US healthcare client using Schemachange + Snowflake with automated DQ checks and deployment gating โข Databricks-to-Snowflake migration of a 200-view mart package, rebuilt natively in Snowflake SQL โข ML-powered matching service on FastAPI with pgvector semantic search and a LightGBM reranker โข End-to-end RAG pipeline using Airflow, Pinecone, and OpenAI I work as an independent contractor no middleman, no overhead. I treat your problem as mine to solve, not a ticket to close. If your data is slow, unreliable, or not being used strategically, let's talk.
- Apache Spark
- Microsoft Power BI
- Python
- SQL Programming
- ETL
- Database
- Amazon Web Services
- Snowflake
- Apache Airflow
- Apache Kafka
- Cloud Architecture
- dbt
- AI Consulting
- Microsoft Azure
- Artificial Intelligence
How it works
Post a job for free Post a job
Tell us what you need. Create your own job post or generate one with AI then filter talent matches.
Hire top talent fast
Consult, interview, and hire quickly, so you can meet the freelancers you're excited about.
Collaborate easily
Use Upwork to chat or video call, share files, and track project progress right from the app.
Payment simplified
Manage payments in one place with flexible billing options. Only pay for approved work, hourly or by milestone.
Don't just take our word for it
โUpwork provides an umbrella-level of security. I can see a talentโs work history and ratings. I can hold payments in escrow. I can communicate through Upwork Messages instead of working through my email address.โ
Kim Darling
Emerald Tiger
โUpwork is the best platform to hire skilled professionals when we're not looking for a full-time employee. All the companies in our portfolio use Upwork to find talent across a wide range of fields.โ
David Merry
Kinetic Investments
โOur very specific requirements can be a challengeโWith Upwork, weโre able to access a bigger community to ensure the success of our projects.โ
Katja Krohn
Summa Linguae
How do I hire a Pyspark Developer in Brazil on Upwork?
You can hire a Pyspark Developer in Brazil on Upwork in four simple steps:
- Create a job post tailored to your Pyspark Developer project scope. We'll walk you through the process step by step.
- Browse top Pyspark Developer talent on Upwork and invite them to your project.
- Once the proposals start flowing in, create a shortlist of top Pyspark Developer profiles and interview.
- Hire the right Pyspark Developer for your project from Upwork, the world's largest work marketplace.
At Upwork, we believe talent staffing should be easy.
How much does it cost to hire a Pyspark Developer?
Rates charged by Pyspark Developers on Upwork can vary with a number of factors including experience, location, and market conditions. See hourly rates for in-demand skills on Upwork.
Why hire a Pyspark Developer in Brazil on Upwork?
As the world's work marketplace, we connect highly-skilled freelance Pyspark Developers and businesses and help them build trusted, long-term relationships so they can achieve more together. Let us help you build the dream Pyspark Developer team you need to succeed.
Can I hire a Pyspark Developer in Brazil within 24 hours on Upwork?
Depending on availability and the quality of your job post, it's entirely possible to sign up for Upwork and receive Pyspark Developer proposals within 24 hours of posting a job description.
Find more freelancers
Top cities for Pyspark Developers in Brazil
- Bank Reconciliation Specialists in Sao Paulo, BR
- Consultants in Curitiba, BR
- Consultants in Florianopolis, BR
- Research Specialists in Recife, BR
- Research Specialists in Joao Pessoa, BR
- Embedded Systems Engineers in Rio de Janeiro, BR
- Translators in Sao Paulo, BR
- Translators in Rio de Janeiro, BR
- Kinetic Typography Specialists in Sao Paulo, BR
- Lighting Experts in Sao Paulo, BR
- Photoshop Experts in Sinop, BR
- Troubleshooting Freelancers in Brasilia, BR
- Legal Freelancers in Rio de Janeiro, BR
- Brazilian Portuguese to English Translators in Rio de Janeiro, BR
- Illustrators in Balneario Camboriu, BR
- English to Portuguese Translators in Teresina, BR
More top skills in Brazil
- Numpy Freelancers in Brazil
- Qlik Sense Developers in Brazil
- IBM InfoSphere DataStage Specialists in Brazil
- Data Engineers in Brazil
- Data Modeling Specialists in Brazil
- R Developers & Programmers in Brazil
- Machine Learning Engineers in Brazil
- Apache Kafka Developers in Brazil
- Angular Developers in Brazil
- AWS Fargate Developers in Brazil
- Data Collection Specialists in Brazil
- Data Scientists in Brazil
- Data Analysts in Brazil
- Apache Tomcat Developers in Brazil
- Relational Databases Specialists in Brazil
- GitLab Specialists in Brazil