Hire the best Apache Spark Engineers in Sao Paulo, BR

Check out Apache Spark Engineers in Sao Paulo, BR with the skills you need for your next job.
  • $60 hourly
    Senior data engineer with product analytics and data science background, having worked with Fortune 500 companies (Procter & Gamble, Merck, Anheuser-Busch), as well as top-notch data-driven startups. Skilled in translating complex business problems into data solutions, designing data pipelines, providing high-quality data for data-driven insights and decision-making, as well as building KPIs, conducting statistical analyses, and creating impactful visualizations. Problems I'm good at solving: • Data Warehousing and Analytics • ETL / ELT data pipelines • SQL query tuning • Data Modeling and Database Design • Reporting • Data Analysis • Data Cleaning, Pre-Processing • Data Visualization • NLP problems I have a bachelor's in engineering from the top LATAM university (Universidade de São Paulo) with a track record of supporting organizations across various industries, including remote hiring, real estate, and consumer goods. Skills and Expertise ✅ SQL ✅ Python Databases ✅ Snowflake ✅ Redshift ✅ BigQuery ✅ Athena ✅ Trino ✅ Postgres ✅ MySQL Big Data Cloud Technologies ✅ Amazon Web Services – AWS Certified (Redshift, Athena, S3, Lambda, Glue ...) ✅ Google Cloud Platform Other Data Engineering Tools ✅ dbt ✅ Airflow ✅ Fivetran ✅ Git, Gitlab, and Github ✅ Rundeck ✅ Docker Data Visualization ✅ Looker (LookML Expert) ✅ PowerBI ✅ Metabase ✅ Looker Studio (Data Studio) Data Science and Machine Learning ✅ Sci-kit learn, pandas, etc ✅ NLP analysis ✅ Spark ✅ Databricks ✅ Hex ✅ Jupyter Notebooks User Behavioral Analytics ✅ Snowplow ✅ Indicative ✅ Heap ✅ Amplitude ✅ Google Analytics
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    Amazon Redshift
    BigQuery
    dbt
    ETL Pipeline
    Looker
    Data Analysis
    Data Modeling
    Data Visualization
    Business Intelligence
    Data Warehousing
    Snowflake
    Machine Learning
    Python
    SQL
  • $10 hourly
    • Data Engineer with 5 years experience in development and data modeling. • Collaborative and motivated professional, always helps the team and focused on the results. • Enthusiastic about world changing technologies and always studying geographical technologies.
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    Apache Kafka
    AWS Lambda
    Google Cloud Platform
    Amazon S3
    Apache NiFi
    BigQuery
    Apache Hadoop
    pandas
    Amazon Web Services
    PySpark
    Python
    Java
    PHP
    SQL
  • $20 hourly
    Experienced in developing data science and analysis using mainly the python programming language and a little bit of scala. Knowledge of main technologies and frameworks used for that purpose, such as pandas, keras, tensorflow, spark and sklearn
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    PySpark
    Analytics
    Mathematics
    Microsoft Power BI
    Database
    ETL Pipeline
    Microsoft Azure
    Deep Neural Network
    SQL
    Machine Learning
    Data Science
    Python
    Deep Learning
    Machine Learning Model
  • $43 hourly
    I have experience with Data Science, mainly focused on the financial industry. I can help you identify business needs and develop models from scratch, from data acquisition to monitoring performance. I have experience developing risk models for car loans, collection models, and default amount forecasting.
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    Jira
    Visualization
    Data Analysis
    RESTful API
    Heroku
    Artificial Intelligence
    Data Visualization
    SQL
    Deep Learning
    Machine Learning Model
    Machine Learning
    Natural Language Processing
    SAS
    Python
    Data Science
  • $50 hourly
    Graduated in Computer Science and with a postgraduate degree in Data Science, I emphasized my learning in the areas of analysis, development, and information technology. I am organized, enjoy working in teams, constantly strive to stay updated with emerging technologies, play chess, play music in my free time, and am a big fan of Star Wars.
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    REST API
    C#
    Deep Learning
    Keras
    Python Scikit-Learn
    NumPy
    pandas
    Python Script
    Apache Hadoop
    Apache Spark MLlib
    PySpark
    Machine Learning
    R
    Python
  • $100 hourly
    In the last 4 years, I have developed data and cloud solutions for top companies in the finance, marketing and education industries. I'm experienced in data engineering, cloud engineering and data analytics. The tech I have worked with the most is Python, AWS, SQL, Spark and Terraform, but I always love to check out cool new stuff! I have designed, architected and developed large scale data lakes, data pipelines for analytics, ETL, ELT and backend services, following DevOps and Software Engineering best practices. I will help your business with freelance and consulting services such as: • design and development of data lakes • adoption and deployment of Databricks • architecture of cloud solutions on AWS • infrastructure as code (IaC) • CI/CD pipelines • DevOps best practices • code review • data analytics and statistical modeling • team building • general consulting on data, cloud and software engineering I'm always eager to hear about interesting and challenging projects!
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    Data Analytics
    Apache Airflow
    Big Data
    Data Engineering
    Databricks Platform
    Data Science
    Amazon Redshift
    Software Consultation
    Software Architecture & Design
    AWS Glue
    AWS Lambda
    Python
    SQL
  • $27 hourly
    I have experience with: - Automation of data-based processes - Exploratory data analysis - ETL Developer - Data Visualization with Power Bi, Tableau, Data Studio - Data mining tools like KNIME - Programming languages like Python, Java, Javascript and SQL - Databases such as MySQL, SQLite, Hive and others - Data Cloud Platforms - Spark, Databricks
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    Microsoft SQL Server
    Transact-SQL
    Databricks Platform
    PySpark
    Bash
    ETL Pipeline
    Data Cleaning
    Data Analysis
    KNIME
    Python
    Microsoft Power BI
    SQL
  • $50 hourly
    Developing data-driven solutions for business applications and impacts. Particularly enjoying the process of identifying problems, discovering insights, validation through data, and implement solutions. Especially interested when they have positive influence on business and users
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    Apache Airflow
    Google Cloud Platform
    AWS Lambda
    AWS Glue
    PySpark
    Plotly
    Terraform
    Amazon Redshift
    SQL
    Microsoft Power BI
    pandas
    Python
  • $20 hourly
    Bilingual professional with a degree in Applied and Computational Mathematics from UNICAMP. Currently working as a data engineer at Santander in Brazil, but seeking opportunities in data science abroad. Experience in data pipelines, data manipulation and modeling, mathematics, analytics and software development. Strong analytical and problem-solving skills. Passionate about applying mathematical concepts to real-world problems. Looking for exciting challenges and the chance to contribute to international data projects.
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    Git
    Scrum
    ETL
    Deep Learning
    Databricks Platform
    pandas
    Microsoft Azure
    Database Modeling
    Data Extraction
    Data Science
    SQL
    Python
    C
    Mathematics
  • $15 hourly
    Passionate Data Scientist and AI professional with expertise in Python, Machine Learning, Deep Learning, NLP, Databases, and BigQuery. Skilled in promoting fairness and inclusiveness in language models. Fluent in English and Portuguese. Skills * Python, Matlab, C/C++, Linux * SQL * Machine Learning, Deep Learning, Data Analysis * Keras, PyTorch, TensorFlow, Scikit-Learn, Numpy * Natural Language Processing (NLP) * BigQuery, Spark
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    Geospatial Data
    Microsoft Power BI
    Seismic
    Artificial Intelligence
    PySpark
    BigQuery
    Python
    Apache Spark MLlib
    Machine Learning
    Machine Learning Model
  • $15 hourly
    I have extensive experience in leading teams of professionals to understand customer needs and deliver efficiently. I have managed AI processes, flows, and dialogues on platforms such as PVA, Azure Cognitive Service, and Custom Question Answering, achieving positive results through Microsoft Power Platform tools. My expertise includes data ingestion supervision, automation, and team management using agile methodologies, resulting in improved data loading processes and a positive impact on society. Specializing in structured and unstructured database analysis, I implement Collection models for credit recovery. Using statistical techniques, managing indicators, developing logistic regression models, and automating data processes, I lead the development of BI reports for key clients. I am proficient in SQL, MySQL, ERP, and PostgreSQL, ETL processes using Spark, Python, Shell Script, Pentaho, PowerCenter, BCP, BULK, and have developed.
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    R Hadoop
    Databricks Platform
    Microsoft PowerApps
    Microsoft Power Automate
    Microsoft Power BI
    Product Development
    Data Extraction
    Python
    Jenkins
    MySQL
    MariaDB
    MongoDB
    PostgreSQL
    SQL
  • $20 hourly
    Pharmacy graduated, currently studying a postgraduate program in Data Science & Machine Learning, and specialized in Quality Management and Patient Safety and Clinical Pharmacology. Accumulating experience within the Data Science field. Analyst in the Patient Safety Core Team, responsible for implementing and managing processes, risk management, clinical protocols for stroke and venous thromboembolism, as well as internal and external audits. Eight years of experience dedicated to enhancing patient care quality and safety through accreditation systems. Substantial expertise in implementing best practices focused on quality and patient safety. Additionally, a strong background in creating indicators with their respective technical specifications, critical analysis, and definition of action plans and utilizing risk management tools, patient safety practices, clinical protocol management, and performance monitoring. In the role of a Clinical Pharmacy Team Leade, improved the ability to analyze performance, develop action plans for achieving strategic results, evaluate adverse events, address product quality deviations, and oversee improvement projects. Relevant Skill Set: Quality and Safety in Pharmaceutical Products (Q&SP) Risk Management Clinical Protocol Management Patient Safety Accreditation Systems Performance Monitoring Data Science and Machine Learning Data Modeling with Hadoop Data Processing DataFrame Apache Spark (Python, SQL) Transformations RDDs MLlib and Streaming Machine Learning Algorithms for Big Data Preparation and Processing Kafka and Amazon Kinesis Environments Dashboard (Tableau and PowerBI)
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    Microsoft Power BI
    R
    Apache Spark MLlib
    Python
    Machine Learning
    Data Science
    Machine Learning Model
  • $10 hourly
    Unleashing the potential of your data, I specialize in seamlessly merging Analytics Engineering and Data Engineering. From crafting dynamic Extract, Transform, and Load (ETL) pipelines to designing user-friendly Star Schema models, I'm dedicated to empowering non-technical stakeholders. I transform raw data into actionable insights, making information accessible and impactful. Tech Toolbox: Python | SQL | AWS | Apache Airflow | Git | Databricks
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    Data Warehousing & ETL Software
    Docker
    Git
    SQL
    ETL
    Data Ingestion
    Google Cloud Platform
    Databricks Platform
    Python
    Data Modeling
    API
    Analytics
    Amazon S3
    Apache Airflow
  • $20 hourly
    -Graduated in Information System bacharel degree. Experience: - T-SQL development (procedures, views, functions, complex queries, jobs, indexes, temporary tables). - Development on VB6, dotNet (C# and Asp.Net). - Reports with Crystal Reports and Excel. - First level of support on SAP ECC, BW and BPC. - Development and ETL on Python. - Advanced/Fluent English skills and experienced sales representative abroad. - Part of analytics team, data ingestion, ETL through Spark, Data Factory, SSIS and Python, and streaming with EventHubs, Spark Streaming, all of them to Hive database. Knowledge and Courses: - Structured Data using BI tools and concepts as DW, SSIS, SSAS and SSRS. - Development of Machine Learning using Python (Supervised and Unsupervised learning, Preprocessing, extraction of knowledge from images and videos using Deep Learning and from text) - Big Data Ecosystem, Hadoop(Architecture, HDFS, MapReduce, YARN and blocks), Data Ingestion(Sqoop, PIG and Hive) and Data Processement(Hive and Spark). #python #analytics #sql #english #etl #businessintelligence #bigdata #data enginner
    vsuc_fltilesrefresh_TrophyIcon Apache Spark
    Real Time Stream Processing
    Microsoft Azure
    Apache Hive
    PySpark
    Apache Kafka
    PostgreSQL
    SQL
    pandas
    Python
  • Want to browse more freelancers?
    Sign up

How hiring on Upwork works

1. Post a job (it’s free)

Tell us what you need. Provide as many details as possible, but don’t worry about getting it perfect.

2. Talent comes to you

Get qualified proposals within 24 hours, and meet the candidates you’re excited about. Hire as soon as you’re ready.

3. Collaborate easily

Use Upwork to chat or video call, share files, and track project progress right from the app.

4. Payment simplified

Receive invoices and make payments through Upwork. Only pay for work you authorize.

Trusted by

How do I hire a Apache Spark Engineer near Sao Paulo, on Upwork?

You can hire a Apache Spark Engineer near Sao Paulo, on Upwork in four simple steps:

  • Create a job post tailored to your Apache Spark Engineer project scope. We’ll walk you through the process step by step.
  • Browse top Apache Spark Engineer talent on Upwork and invite them to your project.
  • Once the proposals start flowing in, create a shortlist of top Apache Spark Engineer profiles and interview.
  • Hire the right Apache Spark Engineer for your project from Upwork, the world’s largest work marketplace.

At Upwork, we believe talent staffing should be easy.

How much does it cost to hire a Apache Spark Engineer?

Rates charged by Apache Spark Engineers on Upwork can vary with a number of factors including experience, location, and market conditions. See hourly rates for in-demand skills on Upwork.

Why hire a Apache Spark Engineer near Sao Paulo, on Upwork?

As the world’s work marketplace, we connect highly-skilled freelance Apache Spark Engineers and businesses and help them build trusted, long-term relationships so they can achieve more together. Let us help you build the dream Apache Spark Engineer team you need to succeed.

Can I hire a Apache Spark Engineer near Sao Paulo, within 24 hours on Upwork?

Depending on availability and the quality of your job post, it’s entirely possible to sign up for Upwork and receive Apache Spark Engineer proposals within 24 hours of posting a job description.