Hire the best Pyspark Developers in France

Check out Pyspark Developers in France with the skills you need for your next job.
  • $60 hourly
    Coming from an academic background, I was inclined towards a career in research. After spending a few years as a research engineer in telecom network optimization, I embarked on a transition towards Cloud and Data. I specialize in Data and Cloud engineering with a strong interest in the DevOps approach. My main focus is on designing and building Data platform in the cloud. I am also interested in designing and deploying machine learning solution.
    Featured Skill Pyspark
    Terraform
    Apache Airflow
    Databricks MLflow
    Databricks Platform
    Amazon Web Services
    Machine Learning Model
    Machine Learning
    Kubernetes
    Docker Compose
    Docker
    NoSQL Database
    SQL
    PySpark
    Apache Spark
    Python
  • $40 hourly
    Senior Flutter Engineer, developed and maintained advanced mobile and web applications using Flutter & Native with Microsoft Azure & Firebase. My work focuses on enhancing user engagement and building new features. With a background as a Data Scientist and Engineer at Amadeus, I bring expertise in software solutions, machine learning, and data processing. I also share my knowledge as a Udemy instructor, having taught over 100,000 students in Flutter and AI development.
    Featured Skill Pyspark
    PySpark
    Artificial Intelligence
    JavaScript
    CSS
    HTML
    Azure DevOps
    GitHub
    Kotlin
    Figma
    Node.js
    Java
    Django
    Python
    Dart
    Flutter
  • $75 hourly
    Smaïl Darbane | Data Scientist & ML Engineer | AI Expert 🎓 Education: I hold a specialized Master's Degree in Data Science for Customer Knowledge from ENSAI and an Engineering Degree in Signal Processing & Artificial Intelligence from INP-ENSEIRB-MATMECA. Additionally, I completed a Master's in Data Science and Artificial Intelligence at the Polytechnic University of Madrid. 🔬 Professional Experience: Currently a Data Scientist at Air France KLM, I develop predictive models focused on revenue management and demand forecasting. Previously at TotalEnergies and Fujitsu France, I implemented both supervised and unsupervised algorithms for customer segmentation and fraud detection, as well as data generation and anomaly detection in financial data. 💻 Technical Skills: Proficient in Python (Pandas, PySpark, PyTorch, TensorFlow, Keras), Spark, SQL, R, and adept at CI/CD practices (Jenkins, Rancher, Bamboo) within cloud environments (GCP, Azure, AWS). 🌍 **Languages**: Fluent in French and Arabic, proficient in English (TOEIC 900), and intermediate Spanish. I am passionate about leveraging data science to tackle complex problems and create significant business value. My approach combines solid technical expertise with a deep understanding of business needs, enabling me to turn analytical challenges into innovative solutions.
    Featured Skill Pyspark
    Artificial Intelligence
    Big Data
    GitHub
    Neural Network
    LLM Prompt
    LLM Prompt Engineering
    Data Analysis
    PySpark
    Machine Learning Model
    Data Science
    Machine Learning
    XGBoost
    R
    Deep Learning
    Python
  • $80 hourly
    Whether you are looking for a reliable or innovative solution and struggling to find the right candidate, choose me. I will use all my 15 years of my overall experience, including the last four as an ML Engineer, to help you get the job done quickly and deliver high-quality results. Let my clients speak: "Excellent service. He finished work on time and under the budget." "Grigory went above the scope required. He completed it 100%, accommodated the changes I needed along the way, and executed everything perfectly. His expertise in the field resulted in him being an amazing consultant to help me achieve the outcomes of the project required. I already have future projects lined up, and Grigory will be my only go-to guy moving forward." "Excellent job, so far the best contractor I’ve worked with on here." See you soon
    Featured Skill Pyspark
    Time Series Analysis
    Excel Formula
    TensorFlow
    Natural Language Processing
    PySpark
    Exploratory Data Analysis
    Data Science
    Data Collection
    Microsoft Excel
    Tableau
    Python
    Data Scraping
  • $120 hourly
    Applied mathematician specialized in in machine learning, computer vision, and agile management, I have been working for the past six years on big data, machine learning and its applications industrial context as tech lead and senior developer. I usually act as a tech lead with data scientist and data engineering team, to work on code, technical management, and setting up development and CI/CD processes. I often work in collaboration with network, security and architecture teams. Experienced with on premise, SAS/Cloud, and hybrid setups, I can help refine and implement solutions by analyzing business and operational need to identify quick-win opportunities, relevant architecture and development patterns, and large-scale IT evolution.
    Featured Skill Pyspark
    Microsoft Azure
    Agile Software Development
    Google Cloud Platform
    Deep Learning Modeling
    Git
    CI/CD
    Terraform
    PySpark
    AWS Glue
    Data Modeling
    Data Engineering
    Python
    Apache Airflow
    Computer Vision
    Machine Learning
    Data Science
  • $30 hourly
    I am a mid-senior Web Developer, I have been in this field for almost five years. My aim is to design, develop, test and deploy high quality Web applications on a scalable infrastructure (include using AWS services) Communication is the foundation for everything I do, I always try to understand client requirements and I am extremely motivated to learn new things quickly. I am versatile in terms of the tools I can use and I make sure to choose the ones just right to get the job done. I guarantee daily communication and updates concerning the projects. Scope and workload will be discussed thoroughly and work will follow accordingly. Looking forward to working with you! Notable skills: - Databases : SQL • NoSQL • MySQL • Oracle - OS : Linux • Windows Servers - Blockchain : Ethereum • Solidity - Development : Java • C • C++ • Python - Version control systems : Git • Subversion - Web Development : HTML • CSS • JS • Node • Angular • PHP - Soft Skills : Leadership • Conception and analysis • Team management • Coordination • Creativity - Unit and e2e tests Certifications - Kubernetes for developers - AWS Fundamentals Specialization - AWS Fundamentals: Building Serverless Applications - AWS Cloud Technical Essentials - AWS Fundamentals: Migrating to the Cloud
    Featured Skill Pyspark
    API
    Kubernetes
    GraphQL
    ETL Pipeline
    Angular
    Amazon Web Services
    System Administration
    DevOps
    Apache Hadoop
    Unit Testing
    Git
    Test-Driven Development
    PySpark
    Laravel
    Python
  • $100 hourly
    Senior Data Scientist | AI & MLops With 5 years of experience as a Data Scientist, I’ve had the privilege to work on a wide range of impactful projects, including: • GEN AI: Developped a chatbot by finetuning a LLM to assist the customer service department of an automotive company. • Time Series Forecasting: Predicting future trends to optimize decision-making. • Fraud and Money Laundering Detection: Implementing robust systems to detect anomalous activities and uncover fraud networks. • Recommender Systems: Creating advanced models using images and texts for personalized user experiences. • Computer Vision: Developing solutions for elements recognition in videos and analysis. • Esports Talent Scouting: Designing innovative systems to identify and recruit new talent in the esports industry. --- What I bring to your organization: Expertise in solving complex data challenges. A results-driven approach to ensure business impact. The ability to turn raw data into clear, actionable strategies. Stack : Python, Spark, SQL, Django, Dataiku, Google Cloud Plateform, Git, SKLearn, Pytorch, Pandas Languages : French (native), English (fluent) If you're looking for someone who can leverage your data to drive growth and innovation, I’d be delighted to collaborate with you !
    Featured Skill Pyspark
    Git
    PySpark
    Deep Learning
    Machine Learning
    Python
  • $120 hourly
    As a freelance data engineer and ML engineer with 8 years of experience in the retail and telecommunications industries in Paris and Brussels, I am an expert in designing and implementing datadriven solutions that help businesses make informed decisions. My technical expertise includes machine learning operations (MLOps), software engineering, Docker, and cloud-based systems such as Azure. I am also proficient in programming languages like Python, which allows me to develop robust and scalable data solutions. In addition to my professional experience, I hold a PhD in a related field and a degree from Telecom ParisTech, a prestigious Grand Ecole, where I gained a solid foundation in data engineering, machine learning, and software development. I have also volunteered my skills for various data-driven social impact initiatives and mentored junior data engineers and ML engineers to help them develop their skills and improve their problem-solving abilities.
    Featured Skill Pyspark
    BigQuery
    Snowflake
    PySpark
    dbt
    Docker
    Azure DevOps
    Python
  • $40 hourly
    A data-oriented developer for more than 5 years, I have made many applications in the context of professional and personal projects. My field of expertise includes the creation and optimization of data pipelines but also the creation of web and mobile applications. Enthusiastic about working on projects of all kinds, I would be delighted to put my skills at the service of your company and contribute to the realization of innovative and stimulating projects.
    Featured Skill Pyspark
    Node.js
    PySpark
    Apache Airflow
    Google Cloud Platform
    Kubernetes
    Docker
    AWS Application
    AWS Glue
    Python
  • $60 hourly
    I am a french flamenco guitarist who likes making prediction with AI on soccer games. I am a senior Palantir Foundry and Skywise integrator and developer. I use Pyspark and Python to create strong data analytics and machine learning models. I can help you to digitize your company or processes with the implementation of automation with Robotic process automation. I can help you in each part of your project. - Feasability study - Specification - Proof of concept - Deploying the technology - ML OPS
    Featured Skill Pyspark
    Data Analysis
    Data Science
    PySpark
    Big Data
    ETL Pipeline
    Computer Vision
    Machine Learning
  • $55 hourly
    Are you looking for a Data Engineer to join your team? Would you like to develop robust data pipelines for your customer and yourself? Then you've come to the right place. I'm a Data Engineer with +3 years' experience, specializing in Cloud services (GCP/AWS). I help companies of all sizes, from large groups to specific customers, to strengthen their teams for a variety of data projects. My background combines technical expertise and agile methodology, enabling me to work on both one-off and recurring projects. I also attach great importance to transparency with my customers. I send a daily report to my customers at the end of the day on all the tasks I've carried out. I can support you from A to Z: → Data Engineering → Data analysis and processing → CI/CD implementation → Data Visualization Want to talk about your project ? Contact me directly on the platform.
    Featured Skill Pyspark
    Data Warehousing
    Data Visualization
    ETL
    CI/CD
    DevOps
    Apache Airflow
    PySpark
    Amazon Web Services
    Google Cloud Platform
    Data Engineering
    Git
    Bash
    Python
    SQL
  • $100 hourly
    I am a dedicated and skilled Data Engineer with a passion for transforming raw data into actionable insights that empower businesses to make informed decisions. With a strong foundation in data architecture, pipeline development, and cloud technologies, I bring innovative solutions to complex data challenges. My goal is to optimize data workflows, enhance system performance, and ensure data reliability across diverse platforms.
    Featured Skill Pyspark
    Python
    GitHub
    Git
    CI/CD
    Cloudera
    Databricks Platform
    Apache Spark
    PySpark
  • $80 hourly
    I am a data engineer developper, i use Apache Spark, scala, python, databricks, cloud (aws/azure/ gcp), ci/cd, streaming jobs .....
    Featured Skill Pyspark
    Java
    Azure App Service
    Azure Cosmos DB
    Azure DevOps
    Databricks Platform
    Microsoft Azure
    Git
    Python
    Scala
    Apache Spark
    PySpark
  • $35 hourly
    Career Objective: As a dedicated and autonomous data scientist, I have 4 years of hands-on experience building robust models and delivering actionable insights. I am comfortable collaborating with technical and non-technical audiences to drive business value with data driven solutions. With a strong background in providing data science end-to-end solutions, I am prepared to improve operation efficiency and customer satisfaction. I am quite flexible in my tools and can operate on various cloud systems, programming languages and visualization tools.
    Featured Skill Pyspark
    Econometrics
    SQL
    PySpark
    Python
    Microsoft Power BI
    Azure Machine Learning
    Forecasting
    Data Engineering
    Cloud Computing
    Business Intelligence
    Data Visualization
    Artificial Intelligence
    Machine Learning Model
    Data Mining
    Data Analysis
  • $50 hourly
    5 years experienced Data & Analytics Engineer with a proven track record in designing and optimizing large-scale data platforms. Skilled in building robust batch and streaming pipelines. Passionate about delivering efficient and secure data solutions, enabling datadriven decisions across organizations. Eager to leverage data engineering expertise to power AI-driven products and solutions.
    Featured Skill Pyspark
    Data Processing
    Microsoft Power BI
    SAS
    Databricks Platform
    Snowflake
    Microsoft Azure
    Python
    PySpark
    SQL
    Data Engineering
    Data Analytics
    ETL
  • $125 hourly
    Architecte data, spécialisé dans les solutions Snowflake, Databricks, dbt, Gitlab et AWS, j'ai aidé de nombreuses entreprises à mettre en place les solutions techniques robustes et durables adaptées à leurs besoins. L'approche que je privilégie est l'accompagnement des équipes sur toutes les étapes, de la phase d'étude initiale à la livraison du service en production, en assurant un support post livraison. Mon objectif est de rendre les équipes autonomes pour déployer et exploiter les solutions. Je fournis également l'expertise nécessaire pour garantir la sécurité et la qualité du service aux utilisateurs sur le long terme. Mes services comprennent donc : * L'évaluation et diagnostic de la solution existante * La conception d'une stratégie de modernisation * L'aide à la sélection des technologies * La définition de l'architecture * La mise en œuvre et la migration * L'automatisation des processus * Les formations * Le mentorat et l'accompagnement
    Featured Skill Pyspark
    Amazon ECS
    Docker
    GitLab
    Database Architecture
    Data Engineering
    Data Cloud
    Cloud Architecture
    Databricks Platform
    PySpark
    Snowflake
    dbt
    ETL
    Data Extraction
  • $25 hourly
    I'm a senior Data Scientist with 5 years of experience in consulting and In the insurance business. I can be valuable to you when it comes to : - Cleaning and transforming Data - Building predictive models - Scraping data from the web - Writing technical articles - Building visualization dashboards on Power BI, Metabase, or using python code - Automating repetitive tasks using Python scripts Let’s keep in touch!
    Featured Skill Pyspark
    pandas
    Data Analysis
    Twitter/X API
    Google APIs
    Computer Science
    Automation
    Selenium
    PySpark
    Data Science
    Machine Learning Model
    Metabase
    Web Scraping
    Microsoft Power BI
    Python
    SQL
  • $12 hourly
    Data Analyst | Data Engineer I’m Abderrahim Loudiyi, a dedicated Data Analyst with a strong background in data science, data cleaning, and report generation. I specialize in cleaning and organizing complex datasets, ensuring that they are ready for accurate analysis and actionable insights. Proficient in tools such as Excel, Python, and SQL, I can help you clean up messy data, run analyses, and create clear, concise reports to meet your needs. With a keen eye for detail and a passion for turning data into meaningful information, I am committed to delivering results that are on time, accurate, and tailored to your project requirements. Whether it's cleaning up datasets or providing in-depth analysis, I’m here to help.
    Featured Skill Pyspark
    Data Extraction
    Data Engineering
    Data Entry
    Python
    PySpark
    Apache Airflow
    ETL
    Microsoft Excel
    Snowflake
    Tableau
    Qlik Sense
    Microsoft Power BI
    Data Analytics
  • $5 hourly
    I'm a Data Engineer and Data Applications Developer with over 3 years of experience working on big data platforms, including Spark and Hadoop. I specialize in building modern, interactive data applications using Streamlit and Python, helping clients turn complex data into clear, actionable insights. I also work with cloud technologies like Snowflake and Azure to deliver scalable, secure, and efficient data solutions. I'm committed to high-quality results, clean code, and open communication throughout every project.
    Featured Skill Pyspark
    Data Cloud
    Scrum
    SaltStack
    Apache Hadoop
    Apache Kafka
    Apache NiFi
    Snowflake
    SQL
    PySpark
    Data Ingestion
    Python
    Streamlit
  • $10 hourly
    Hello! 👋 I'm Mehdi, a data engineer with around 2.5 years of hands-on experience building data pipelines, dashboards, and even AI prototypes. I’m passionate about turning raw data into clear, useful insights — whether it’s for tracking business performance or preparing datasets for machine learning. I also have a strong background in Python development — whether it’s building ETL scripts, automating data workflows, or creating APIs and data tools from scratch. What I’m good at: . ETL pipelines with Python, Spark, Databricks . Dashboarding (Power BI, Grafana) . Cloud data (Azure, AWS S3, Delta Lake) . Data annotation, cleaning, and formatting . Python development for automation and backend scripts What I’ve worked on recently: ✅ Dashboards for a real estate company to track KPIs and team performance ✅ Real-time IoT platform to reduce equipment downtime using Grafana & InfluxDB ✅ AI agents trained in Unity for industrial simulations I like to keep things clean, efficient, and on time. If you need someone who can bring structure to your data or automate your workflows in Python, I’d love to hear about your project.
    Featured Skill Pyspark
    Microsoft Azure
    PySpark
    Databricks Platform
    Dashboard
    Web Scraping
    REST API
    Data Annotation
    ETL
    Data Visualization
    Data Analysis
    Microsoft Power BI
    SQL
    Python
  • $30 hourly
    I'm a Data Scientist and Mechanical Engineer, I have experience in developping machine learning and deep learning models for various applications (time series forecasting, speech to text, human machine interface, sensors monitoring, ...). I also have experience in developping digital twin (FE simulations, DoE, Sensitivity Analysis, Calibration and Validation). I can help.
    Featured Skill Pyspark
    R
    Streamlit
    PySpark
    TensorFlow
    Python Scikit-Learn
    Deep Learning
    Computer Vision
    NLP Tokenization
    Generative AI
    MLOps
    Machine Learning
    SQL
    Microsoft Power BI
    Python
    Data Science
  • $25 hourly
    Seasoned Data Science and Analytics professional with more than 10 years of demonstrated experience in leading multiple machine learning model development and production projects in financial services sector. Hands on experience in providing analytical solutions to drive better customer experience, risk mitigation and revenue growth across multiple geographies (Asia Pacific, Africa, Europe, Americas). Expertise in aligning data solutions to the core of strategic decision making for senior executives while working in an agile framework. Statistical and Machine Learning Algorithms: XGBoost, Gradient Boosting, Random Forest, SHAP, Logistic Regression, Matrix Factorization, Collaborative Filtering, K Means, Decision Tree, NLP, Time Series Tools:  Python : pandas, dask distributed, scikit-learn, xgboost, statsmodel, shap, lightfm, abydos, plotly, dash, hyperopt  PySpark : SQL, Pandas API, ML  Databricks : Feature Store, Experiments, , Model Registry, Delta Lake, Workflow, .dbx, Compute  Azure Machine Learning (Microsoft Certified Azure Data Scientist)  Azure DevOps  R : data.table, dplyr, scorecard, glm, lubridate  SAS, SQL
    Featured Skill Pyspark
    Time Series Forecasting
    Gradient Boosting
    Computer Vision
    Natural Language Processing
    Supervised Learning
    Deep Learning
    Artificial Intelligence
    PySpark
    MLOps
    Databricks Platform
    Microsoft Azure
    Python
    Machine Learning Model
    Data Analytics
    Data Science
  • $15 hourly
    Compétences Linux Git MySQL Java Data Analysis Machine Learning Python Deep Learning SQL TensorFlow Talend PowerBI AWS PySpark kubernates
    Featured Skill Pyspark
    PySpark
    Deep Learning
    Python
    SQL
    Amazon Web Services
    Google Cloud Platform
    Tableau
    Microsoft Power BI
    Talend Data Integration
    Oracle
    Analytical Presentation
    Machine Learning
    Data Analysis
    ETL
  • $65 hourly
    Passionate about Data professions and backed by five years of experience in various technological projects, I am constantly in search of challenges that allow me to develop my skills.
    Featured Skill Pyspark
    Microsoft Power BI
    Ansible
    Jenkins
    SQL
    Databricks Platform
    Microsoft Azure SQL Database
    Azure DevOps
    Scala
    PySpark
    Python
    Machine Learning Model
    Analytical Presentation
    ETL
    Artificial Intelligence
    ETL Pipeline
  • $10 hourly
    As a freelance Data Science & Analytics Engineer, I transform complex data into actionable insights with advanced technical expertise: ETL Pipelines & Data Processing: Build scalable ETL pipelines using Python, Apache Airflow, and PySpark. Dashboard Development & Visualization: Create interactive dashboards with Power BI, Streamlit, and custom JavaScript solutions. CI/CD & Automation: Implement CI/CD pipelines with GitLab and automate workflows using Power Automate. Cloud & Database Technologies: Manage secure, scalable data storage with Azure Cosmos DB, Snowflake, and Hive. Web Scraping & Data Integration: Extract and integrate data from diverse sources using advanced web scraping techniques.
    Featured Skill Pyspark
    R Shiny
    GitLab
    NLP Tokenization
    ETL Pipeline
    Machine Learning
    Data Analysis
    Azure Cosmos DB
    JavaScript
    Microsoft SQL Server
    SQL
    MongoDB
    Hive
    PySpark
    Python
    Microsoft Power BI Data Visualization
  • $25 hourly
    Certified Data Engineering Senior Analyst with over 3+ years of experience in designing and implementing scalable data solutions, ETL pipelines, and cloud-based workflows. Backed by 7+ years of overall professional experience, including a strong focus on Databricks, AWS, and Python for data transformations and automation. Certified in AWS, Azure, and Databricks, with a proven track record of optimizing data processing for business outcomes. Passionate about Python, AI and ML, with experience developing Generative AI (GenAI) proof-of-concepts
    Featured Skill Pyspark
    Azure Cognitive Services
    Amazon S3
    Amazon EC2
    Linux
    Bash
    Databricks Platform
    AWS Development
    AWS Glue
    PySpark
    Python
    Artificial Intelligence
    ETL Pipeline
    ETL
    Data Extraction
  • $50 hourly
    I help businesses, researchers, and public institutions turn complex data into actionable insights. With a PhD in Econometrics and a Data Science specialization, I bring 10+ years of experience in statistical modeling, forecasting, and economic analysis. ✅ My key skills include: • Time series modeling (ARIMA, SARIMA, VAR, VECM) • Machine learning (Random Forest, XGBoost, NLP, classification) • Econometric analysis (panel data, regressions, impact evaluation) • Data processing & visualization (Python, R, SQL, Power BI, Tableau) • Forecasting tools for economic trends, prices, or labor market dynamics I’ve delivered impactful work for clients such as Renault, research institutions, and public policy centers (e.g., CEET), and I’ve led data science projects involving Spark, Docker, and cloud technologies. If you’re looking for a reliable and insightful data expert to build forecasting models, analyze your data, or develop automated dashboards — let’s talk!
    Featured Skill Pyspark
    PySpark
    SQL
    Python
    Analytical Presentation
    Data Mining
    Data Visualization
    Forecasting
    Econometrics
    Artificial Intelligence
    Machine Learning
    Data Analysis
  • Want to browse more freelancers?
    Sign up

How hiring on Upwork works

1. Post a job

Tell us what you need. Provide as many details as possible, but don’t worry about getting it perfect.

2. Talent comes to you

Get qualified proposals within 24 hours, and meet the candidates you’re excited about. Hire as soon as you’re ready.

3. Collaborate easily

Use Upwork to chat or video call, share files, and track project progress right from the app.

4. Payment simplified

Receive invoices and make payments through Upwork. Only pay for work you authorize.