Hire the best Pyspark Developers in England

Check out Pyspark Developers in England with the skills you need for your next job.
  • $85 hourly
    I am an experienced full stack data and backend engineer. My background and skills include: - Expert in Python, SQL and NodeJS - Certified AWS Cloud Practitioner and studying for exams in Certified Developer, Solutions Architect, SysOps Administrator and Data Analytics - Two MScs in Mathematics and Data Science - Databases (Postgres, PostGIS, Aurora, DynamoDB, MySQL, Mongo, Redis, Snowflake, Redshift, Neo4j) and ORMs - Cloud Infrastructure (AWS, GCP, Terraform) - APIs (FastAPI, Flask, Express, GraphQL) - Orchestration (Airflow, Kubernetes) - Pub/Sub + Queuing (Kafka, RabbitMQ) - Version control Git - CI/CD (Docker, Github Actions, Jenkins) - GIS (Postgis, Shapely, GDAL, H3) - Web crawling (Scrapy, custom crawlers in Python/Node/Go) - PySpark/AWS EMR/AWS Batch I have worked in data and tech for 4 years including full time roles as a Data Scientist, Data Engineer, Backend Engineer and Head of Data. In a previous life I worked investment banking in Sales and Trading for 4 years. Previous projects include: - Designing and building a client a booking engine to allow them to expand into the reservation and beauty treatment business (GCP, Python, Postgres, Redis and FastAPI). - Assisted a bottling company in expanding their tech capabilities by moving them from spreadsheets into the cloud and developing APIs to automate their business with other partners (Python, Postgres, AWS, FastAPI). - Refactored an employers crawling system to make it more efficient. Previously each data entity was crawled on a regular frequency, but the vast majority of entities very rarely changed leading to unnecessary crawling and resource use. The refactor took into account the changes in the data and how often they occurred to predict the next optimal time to crawl resulting in a 15% reduction in cloud costs (AWS, Postgres, NodeJS, Python, RabbitMQ). - Developed a streaming change data capture pipeline handling 50 million unique payloads per day resulting in a +15% reduction in total cloud costs while allowing live data to be available to customers (AWS, Python, NodeJS, RabbitMQ, Postgres, Snowflake, Airflow) - Created a data architecture to allow aggregation of geospatial time series for any possible geographic polygon across +20bn global data points in sub-second time (AWS, Clickhouse, Python, FastAPI, Postgres, Uber H3, Redis) - Created an autonomous on-demand Excel and PDF reporting system to allow a sales team to generate their own reports from data stores with no required input from developers (AWS, Python) - Developer multiple machine learning models running production including an age/gender classification for faces in photos (Python, Keras, GCP), entity resolution system combining tabular, text and image embeddings to deduplicate +30mm listings across multiple provider platforms (AWS, Python, PyTorch, RabbitMQ, Neo4j, Postgres)
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Amazon Web Services
    Terraform
    RabbitMQ
    Flask
    Machine Learning
    RESTful API
    PostgreSQL
    Node.js
    Snowflake
    PySpark
    Apache Kafka
    Docker
    Apache Airflow
    Python
    SQL
  • $50 hourly
    As a seasoned Data Scientist and Technical Product Manager, I bring extensive experience in Financial Crime Risk and Credit Risk management, coupled with deep proficiency in Python, Spark, SAS (Base, EG, and DI Studio), Hadoop, and SQL. Transitioning into freelancing, I am eager to leverage my skills to contribute to diverse projects. While Upwork's guidelines restrict sharing direct links to external profiles, I am happy to provide a detailed portfolio from my LinkedIn upon request.
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Data Mining
    Big Data
    Data Science
    Fraud Detection
    Data Analysis
    PySpark
    SAS
    Credit Scoring
    Apache Hadoop
    SQL
    Python
  • $50 hourly
    Data Scientist | AI Specialist | Python, SQL, Spark, MS Excel, Tableau, Power BI, Github, Docker, Airflow, LLM | +5 Years of Experience ◾️ Who I am? I am a full-stack data scientist with over 5 years of experience, helping businesses solve issues through data-driven approaches. Also, I've been recognized both as an industry leader and emerging talent by the UK Global Talent Program and as a top 1% mentor by ADPList (the largest online mentorship community). ◾️ Why work with me? ✅ I am reliable, trustworthy, and professional. ✅ I put your needs first and provide tailored solutions to ensure the finished product is a great fit. ✅ I have experience working across multiple industries with startups to global $2 billion turnover clients. ✅ I have over 5 years of industry experience across Data and AI. ◾️ More About Me 🔗 LinkedIn: younes-sandi/ 🔗 ADPList: adplist.org/mentors/younes-sandi 🔗 Github: github.com/Unessam 🔗 Website: ds4technology.com/ ◾️ Accepting • Data Analysis • Data Visualization • Web Scraping • Data Modeling • ETL • Data Manipulation • Machine Learning • Data Engineering • Automation and Pipeline Development • Model Deployment • LLM Development • MLOps • Tutoring • Mentorship ◾️ Tools • Python • SQL • Pyspark • Microsoft Azure • AWS • GCP • Tableau • Power BI • Google Analytics • Excel • Docker • Airflow • MLflow • Github • OpenAI • LangChain ◾️ Projects • Predictive Modeling • Recommendation Systems • Segmentation and Clustering • Path Analysis • LLM Integration • AI Integration • Business Automation • Sales & Demand Prediction • Customer Segmentation • Customer Attribution Modeling • Customer Churn/ Retention Prediction • Predictive Maintenance Modeling • Anomaly Detention and Modeling • NLP and Sentiment Analysis • Survival Analysis and Modeling • A/B Testing • Impact Analysis • Performing Complex Statistical Test
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    AI Consulting
    OpenAI API
    LLM Prompt Engineering
    Big Data
    Survival Analysis
    Predictive Analytics
    Data Analysis
    PySpark
    Machine Learning
    Data Science
    Recommendation System
    Deep Learning
    Computer Vision
    SQL
    Python
  • $40 hourly
    With 6+ years of experience across legacy and modern technology stacks, I specialize in solving complex data challenges and driving data modernization for businesses. My product-led approach combines software engineering best practices with a deep understanding of DevOps to streamline processes and foster collaboration. I bring strong expertise in Agile and Waterfall methodologies, having delivered impactful solutions across diverse industries, including: • Energy Liaison • Banking • Telecom (Big Data Reporting & Automation) • Consumer Electronics (Data Warehousing) • Consulting Key Skills & Tools: • ETL Pipelines & Data Warehousing: Advanced SQL (SSIS, SRSS), Microsoft Fabric, Airflow, Azure Synapse, Databricks • Data Frameworks & Automation: FastAPI, Flask, Terraform, Great Expectations, DuckDB • Business Insights: Advanced Excel (Dashboards, Formulas, Macros, Financial Analysis) • System Design: Normalization (2NF, 3NF), Lakehouse Architecture Selected Project Highlights: • Built a Data Platform integrating Databricks, Azure DevOps, and Terraform for seamless data management. • Developed a CI/CD Pipeline leveraging Terraform, YAML, and PowerShell for automated deployment. • Deployed a Fabric Data Lakehouse using AWS S3, Lambda, and Azure Data Pipelines. • Designed a Process Management Framework with MSSQL, PySpark, and DuckDB for task orchestration. • Created a Pipeline Monitoring & Audit Solution using Azure API, Python, and Postgres. • Engineered a robust Data Quality Framework combining Python, Great Expectations, and FastAPI. With a proven track record of delivering value-driven solutions, I am your go-to expert for data engineering, automation, and report optimization. Let’s collaborate to bring your ideas to life and unlock your data’s full potential.
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Microsoft Excel
    Terraform
    Docker
    Apache Airflow
    dbt
    Data Lake
    Microsoft Power BI
    Fabric
    Databricks Platform
    Azure DevOps
    Normalization
    Miro
    Python
    PySpark
    SQL
  • $50 hourly
    Data Analysis | Market Research Analyst | Go-To-Market Strategy I am seasoned Data Analytics Expert with a passion for transforming raw data into actionable insights. With a robust background in statistical analysis, machine learning, and database management, I bring a unique blend of technical proficiency and analytical acumen to every project. My experience includes 10 years in the field, where I've successfully Proficient in languages such as Python and R, I have a proven track record of developing predictive models, creating insightful visualizations via Tableau, Power BI and communicating complex findings to both technical and non-technical stakeholders. I thrive in dynamic environments, utilizing my problem-solving skills to tackle intricate data challenges. My commitment to continuous learning ensures that I stay at the forefront of emerging technologies and industry trends. I look forward to contributing my expertise in data analytics to [current or prospective employer/clients, driving data-driven decision-making and making a meaningful impact. 🎇Using MS excel Power BI, Tableau for regular data update of accounts and data visualization 🎇Running pre defined SQL statements to gather the data using Hive. 🎇Cultivated and strengthened lasting client relationships using effective communication and consumer behavior. Thank you for reviewing my profile. Are you interested in working with me? Send me an invite. I will be happy to have a quick 20 - 30 min call about your project.
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Databricks Platform
    PySpark
    Business Intelligence
    Data Extraction
    Microsoft Excel
    SQL
    Google Analytics
    Looker Studio
    Market Research
    Social Media Audience Research
    Media Analytics
    Data Visualization
    Microsoft Power BI
    Dashboard
    Data Analysis
  • $60 hourly
    I'm a data scientist with a Master's in Analytics and 3 years of in-industry experience. I have experience in all areas of data science but specialise in: - Developing and deploying machine learning models - Natural language processing - Analysing and visualising data with interactive dashboards - Creating clear, well documented and reusable python code - AWS Certified Cloud Practioner Get in touch and find out how I can help!
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Data Analytics
    GitHub
    Algorithm Development
    Network Analysis
    Analytics
    PySpark
    SQL
    Tableau
    Data Science
    Python
    Machine Learning Model
    Deep Learning
    Machine Learning
    Natural Language Processing
    Amazon SageMaker
  • $40 hourly
    Data Engineer with over 5 years of experience in developing Python-based solutions and leveraging Machine Learning algorithms to address complex challenges. I have a strong background in Data Integration, Data Warehousing, Data Modelling, and Data Quality. I excel at implementing and maintaining both batch and streaming Big Data pipelines with automated workflows. My expertise lies in driving data-driven insights, optimizing processes, and delivering value to businesses through a comprehensive understanding of data engineering principles and best practices. KEY SKILLS Python | SQL | PySpark | JavaScript | Google cloud platform (GCP) | Azure | Amazon web services (AWS) | TensorFlow | Keras | ETL | ELT | DBT | BigQuery | BigTable | Redshift | Snowflake | Data warehouse | Data Lake | Data proc | Data Flow | Data Fusion | Data prep | Pubsub | Looker | Data studio | Data factory | Databricks | Auto ML | Vertex AI | Pandas | Big Data | Numpy | Dask | Apache Beam | Apache Airflow | Azure Synapse | Cloud Data Loss Prevention | Machine Learning | Deep learning | Kafka | Scikit Learn | Data visualisation | Tableau | Power BI | Django | Git | GitLab
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Data Engineering
    dbt
    ETL
    Chatbot
    CI/CD
    Kubernetes
    Docker
    Apache Airflow
    Apache Kafka
    PySpark
    Machine Learning
    Exploratory Data Analysis
    Python
    SQL
    BigQuery
  • $60 hourly
    Results-driven AWS Certified DevOps Engineer and Data Engineer with 10+ years of experience designing, implementing, and optimizing scalable cloud infrastructures. Specialized in the automation of CI/CD pipelines, data pipelines, and serverless architectures on AWS. Expert in Python, PySpark, AWS CloudFormation, Terraform, and AWS Glue, with a proven track record of improving deployment efficiency and reducing costs. Experienced in cloud security, infrastructure as code, and collaborating with data science teams on machine learning projects
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Amazon Athena
    AWS Glue
    AWS Fargate
    Amazon Cognito
    PySpark
    Google Cloud Platform
    CI/CD
    Serverless Computing
    AWS Lambda
    Flask
    Docker
    SQL
    Python
    pandas
  • $51 hourly
    🏆 Multi-Award Winner with Big5 company ex-clients in UK,Europe and Caribbean 🎯 Digital & Data Analytics Strategist. 📈 Expert to help you driving the Digital Transformation in any Industry. 🏆 Worked with biggest mobile networks and finance clients in UK and Europe and 30million+ customer and data systems. 📈 Process Transformation, Data-Driven Decision Making Enabler. Developing and implementing a successful Business Intelligence strategy should not feel daunting with expert guidance and support from the early stages. Thats where I can help you. For start-ups, small enterprises, and corporate companies, I specialize in developing scalable Business Intelligence and Data Analytics solutions. Using the most recent tools and technologies, I will work hard to develop you a high-performing BI solution. I am well versed in the tools & technologies listed below. (Expert level & Practitioner) -- Visualization Tools: Microsoft Fabric, Tableau, Data Studio, Power BI, QlikView, BIRST, Jaspersoft and Cognos. -- Cloud ETL: Funnel.io, Supermetrics, Fivetrans, Stitch, Snowflake, Synapse -- Cloud Platforms Amazon AWS, GCP, Microsoft Azure, Microsoft Fabric -- Microsoft Fabric, Redshift, Google Cloud Platform, Google Data Studio, Looker Studio, Google BigQuery, SQL, ETL Pipelines, PowerBI I can assist to turn your data or digital transformation project into a money-spinner. 💰 I would love to arrange an inital consultation call with you to discuss about your ideas for maximizing your business performance. ════════════ SERVICES ════════════ 🟢 End 2 End Data Architecture Strategy and development according to your key business priorities 🟢 ETL Data Pipelines from any data source into AWS, Azure and GCP 🟢 Creating complex SQL queries 🟢 Beautiful looking dashboards (in Power BI, Google Data Studio or Tableau) 🟢 Business Analysis, i.e., providing actionable insights to solve business problems 🟢 Transition from Google Sheets/Excel reports into automated data dashboards. 🟢 Interactive dashboards 🟢 Creating reports and dashboards I've worked with big telecom and finance companies having 1 billion+ revenue andd 10million+ transactions per day. I have worked with companies that asked me to work on several KEY business performance indicators such as ROI, CPC, CPA, AOV, CAR, and COGS. Feel free to schedule a call with me to talk about iyour project. Kind Regards, Savneet
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Data Warehousing
    PySpark
    .NET Core
    ETL Pipeline
    Business Intelligence
    Microsoft Power BI
    Database Design
    Data Visualization
    Microsoft Azure
    Python
    Angular
    C#
    SQL
  • $100 hourly
    💬 "Every month, I spend hours manually pulling reports instead of focusing on our strategy" 💬 "I just want to see my team's performance without having to juggle different spreadsheets" 💬 "End-of-month reporting shouldn't feel like assembling a thousand-piece jigsaw puzzle" 💬 "I just need the figures to reconcile. Why is it such a hassle to get consistent data?" If you find yourself nodding to any of these, you're in the right place. I'm Ayub, and I specialise in streamlining data and reporting processes, so you can focus on what truly matters: growing your business. Let's make your data work for you, not the other way around. 𝗜𝗡𝗧𝗥𝗢 With 7+ years in the data and analytics space, I've collaborated with the likes of Meta, HelloFresh, Capgemini, and several thriving startups. 𝗦𝗨𝗖𝗖𝗘𝗦𝗦 𝗦𝗧𝗢𝗥𝗜𝗘𝗦 Online consumer services business: Worked closely with senior management to gather reporting requirements and developed a suite of Tableau reports following data visualisation best practices. These dashboards allowed everyone in the business to finally automate and track business KPIs with ease. ⭐️ Testimonial: "Ayub is exemplary in his work and delivery. He is quick in understanding the exact requirement, his planning is meticulous and he has an eye for details. He is very good with data visualization and his dashboards have made it easy for our organization to make sense of numbers. I enjoyed working with Ayub and would love to work with him in future as well." E-commerce agency: Built a data pipeline to extract and load live tracking and price history data and built dashboards in Tableau, Power BI, Google Data Studio, and Klipfolio. These dashboards served is used as an analytics offering by the business to their clients to consolidate and present their clients data in a compact and easy-to-digest set of dashboards. ⭐️ Testimonial: "I've worked with Ayub for over a year on some complex data and data visualisation projects in Tableau, Power BI and Klipfolio. I've found him to be very competent and an excellent problem solver, as well as responsive and efficient. Looking forward to working with him again in the future!" Drop me a message anytime to discuss your challenges. All the best, Ayub
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Data Management
    Amazon Redshift
    ETL
    PySpark
    Amazon S3
    BigQuery
    PostgreSQL
    Data Vault
    Data Modeling
    Apache Airflow
    Apache Spark
    Data Warehousing
    dbt
    Amazon Web Services
    Google Cloud Platform
    Terraform
    Cloud Engineering
    Snowflake
    SQL
    Python
    Data Engineering
  • $30 hourly
    Hi there! I have over 4 years of experience in Data Engineering and Data Analytics. I use Python as my daily driver, and I regularly work with technologies and frameworks like SQL, Azure Databricks, Azure Data Factory, Azure Synapse Analytics and PowerBI. I can help you with tasks like Data Extraction, Data Cleaning, Data Transformation, Data Analysis and Data Visualisation. Feel free to reach out if you'd like to discuss your project with me! Languages - Python, SQL Cloud Tools - Azure Databricks, Azure Data Factory, Azure Synapse Analytics, Azure Data Lake Storage Data Processing, Transformation and Analysis - Apache Spark, PySpark, Pandas Data Visualisation - PowerBI Data Storage Formats - CSV, Microsoft Excel, Google Sheets, Parquet Others - Jupyter Notebook, ipynb
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Algorithm Development
    Data Management
    Java
    Data Analysis
    Data Structures
    Resume
    Interview Preparation
    Candidate Interviewing
    Machine Learning
    Data Science
    Career Coaching
    PySpark
    Apache Spark
    Python
    SQL
  • $25 hourly
    Your One-Stop Solution for Web Development, Data Analysis, and Quality Assurance I’m a versatile and highly skilled Web Developer, Data Analyst, and QA Specialist dedicated to helping businesses thrive with tailored, high-quality solutions. Whether you need a modern website, actionable insights from data, or reliable software testing, I’m here to turn your vision into reality. 🚀 Why Choose Me? ★ Web Development: I create stunning, responsive websites using HTML, CSS, JavaScript, React.js, PHP, and SQL. From sleek portfolios to business websites and full-stack applications, I deliver solutions customized to meet your unique needs. ★ Data Analysis: I transform raw data into actionable insights with Python, SQL, and Power BI. Whether it’s cleaning data, uncovering trends, or designing interactive dashboards, I empower businesses to make smarter, data-driven decisions. ★ Quality Assurance: I specialize in manual and automated testing, with expertise in Android and iOS mobile app testing to ensure seamless performance across devices. Using tools like Selenium, Postman, and modern app testing frameworks, I guarantee your software is bug-free, user-friendly, and meets the highest quality standards across platforms. ★ Virtual Assistance: Need a productivity boost? I offer efficient administrative support, including email management, calendar scheduling, data entry, and research, so you can focus on growing your business. 🎯 What I Guarantee: ✅ High-quality work delivered on time and within budget. ✅ Clear, regular communication to keep your project on track. ✅ A client-focused approach to ensure your satisfaction at every stage. Let’s work together to bring your vision to life—whether it’s building a website, analyzing data, or perfecting your software. Get in touch, and let’s start creating something amazing today!
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Data Visualization
    Data Wrangling
    Data Extraction
    Power Query
    JavaScript
    Web Development
    Microsoft Azure SQL Database
    Data Modeling
    Microsoft Excel
    PySpark
    Microsoft Power BI
    Python
    SQL
    GitHub
    Database
  • $20 hourly
    Dynamic and results-driven Cloud Data Engineer with a proven track record in designing and implementing end-to-end batch and streaming data pipelines. Proficient in orchestrating data workflows within multi-cloud environments, combining technical expertise with a commitment to optimizing data-driven solutions for enhanced business outcomes. Put your faith in me!
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Agile Software Development
    Terraform
    Amazon Web Services
    Google Cloud Platform
    Apache Spark
    Apache Kafka
    PySpark
    Apache Airflow
    SQL
    Python
  • $20 hourly
    🏆 Most recent achievements: ✅ LinkedIn made me a "Top Data Engineering Voice" for my contributions to Data Analytics & Data Engineering ✅ Teaching Python, SQL, Power BI to over 33,000 data enthusiasts on TikTok, YouTube, LinkedIn etc ✅ Saved my clients ~$500k in computing costs building custom data tools with Python, SQL, Databricks etc ✅ I have a YouTube channel with over 1.87k+ subscribers where I walk through data projects step by step (titled “Stephen | Data”) 💻 What do I do? I help companies build data products that generate the ROI they're after. I've done this using Python, SQL and Spark for over 7+ years in data engineering. Here are some of the tools + resources I can design and build from start to finish: 📍 data strategies 📍 data workflows 📍 data quality frameworks 📍 automation + augmentation tools ...among other solutions tailored to your data team's needs. Show me your data challenges and I'll create the solutions for them. 🌐 Other things to note: ✅ Response time: <24 hours ✅ Availability: Immediately (most of the time) Feel free to check out my portfolio and online handles for more information about me
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    pandas
    Data Ingestion
    Data Transformation
    Data Extraction
    Data Analytics
    Data Engineering
    Data Warehousing & ETL Software
    ETL
    Databricks Platform
    PySpark
    SQL
    Python
  • $50 hourly
    OBJECTIVE: Data Scientist with 5+ years of experience executing data-driven solutions to increase efficiency, accuracy, and utility of internal data processing. Experienced at creating data regression models, using predictive data modeling, and analyzing data mining algorithms to deliver insights and implement action-oriented solutions to complex business problems. AREAS OF EXPERTISE Data Science |Machine Learning| Management | Visualization TECHNOLOGIES Python, PySpark, Numpy, Pandas, R, GCP, Big Query, Azure Databricks, Tableau, Excel STATISTICS & APPLICATIONS OF STATISTICS Statistical Methods, Exploratory Data Analysis, Factor Analysis, Principal Component Analysis, Machine Learning (Regression - Linear/Logistic/Lasso/Ridge/Elastic net, Decision Trees, Ensemble Learning - (Bagging/Random Forest/Ada Boost/Gradient Boost/XGBoost, KNN, Naïve Bayes, SVM)
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Tableau
    Matplotlib
    PySpark
    BigQuery
    AWS Glue
    Python
    Dashboard
    ETL Pipeline
    Analytical Presentation
    Data Extraction
    Data Analysis
    Artificial Intelligence
    Machine Learning Model
    Machine Learning
    ETL
  • $60 hourly
    Persistent problem-solver, always in pursuit of simple, clever, and optimal solutions. Versatile Data Specialist with a PhD in Electrical Engineering and several years of experience in both research and industry across different fields. Strong analytical and communication skills; adept at solving real world problems, both independently and as part of a team. - Python - Scala - ETL - Spark
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Problem Solving
    Analytical Presentation
    PySpark
    Apache Spark
    Scala
    Data Extraction
    Data Analysis
    Data Mining
    ETL Pipeline
    ETL
    Python
  • $70 hourly
    I’m a data engineer experienced in designing and implementing scalable data pipelines, integrating APIs, and delivering actionable insights for businesses. Whether you’re looking to optimize workflows, ensure data quality, or enable predictive analytics, I can help. - Proficient in Azure Data Factory, Databricks, PySpark, SQL, and Google BigQuery - Skilled in API integration, Terraform for infrastructure versioning, and data governance - Expertise in building real-time event-driven pipelines and automating workflows I believe in maintaining clear and regular communication to ensure project success, so let’s stay connected!
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Machine Learning
    Machine Learning Model
    Artificial Intelligence
    Terraform
    ETL Pipeline
    Azure DevOps
    Amazon Web Services
    Google Cloud Platform
    PySpark
    Python
    Data Science
    Data Engineering
  • $100 hourly
    Dedicated and results-driven Data Engineer with a focus on delivering cost-efficient and scalable solutions. Proven expertise in implementing Continuous Integration/Continuous Deployment (CI/CD) methodologies to streamline development processes and significantly reduce cycle time.
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Problem Solving
    SAS
    Database Management
    Database
    PySpark
    BigQuery
    Snowflake
    CI/CD
    GitHub
    Kubernetes
    Terraform
    Python
  • $65 hourly
    Experienced data scientist with 5+ years of experience in analyzing large-scale biological datasets, leveraging machine learning, and statistical modeling in cancer research. Strong background in genomics and computational biology to generate novel biological insights. Adept at managing complex projects and collaborating with interdisciplinary teams. Seeking opportunities to contribute to transforming patient's health and wellbeing with digital biology. Committed to responsible AI practices. Specialization: Data Analysis, Machine Learning, Computational Biology, Cancer Research, AI in HealthTech, Content Writing, Full Project Management
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Big Data
    Writing
    Statistical Analysis
    Dimensionality Reduction
    SQL
    R
    PyTorch
    PySpark
    TensorFlow
    pandas
    Python
    Artificial Intelligence
    Data Mining
    Machine Learning
    Data Analysis
  • $50 hourly
    Senior Data Scientist and ML Engineer with 6+ years of experience building production-grade ML solutions. Proven track record of driving business impact, including developing customer acquisition models that increased conversion rates by 11%. * Expert in Python, PySpark, and SQL * End-to-end project ownership from requirements gathering to deployment * Strong communicator focused on delivering clear insights.
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Python
    PySpark
    ETL
    SQL
    MLOps
    Data Science
    Cluster Analysis
    Audience Segmentation
    Predictive Modeling
    Statistical Analysis
    Cloud Engineering
    Cloud Computing
    MLflow
    Machine Learning Model
    Databricks Platform
  • $50 hourly
    I am an Experienced Principal Consultant - Data Platforms with a proven track record of designing and managing large-scale, high-performance data infrastructures. My expertise spans database architecture, cloud platforms (AWS & Azure), and data engineering, ensuring optimized solutions tailored to meet unique business needs. With a strong foundation in SQL performance tuning, ETL processes, and business intelligence systems, I excel in creating scalable, reliable, and secure data ecosystems. I specialize in leading cross-functional teams, driving strategic technical initiatives in cloud data migration, advanced analytics, and data security. My hands-on approach ensures the seamless implementation of data pipelines, high system availability, and compliance with governance standards. Whether it's optimising legacy systems or implementing cutting-edge cloud solutions, I bring a blend of technical expertise and leadership that guarantees successful project outcomes. Let’s collaborate to take your data strategy to the next level.
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Big Data
    pandas
    Data Lake
    Data Modeling
    Microsoft Azure SQL Database
    Microsoft Azure
    PostgreSQL
    Microsoft SQL Server
    Python
    PySpark
    SQL
    Data Engineering
    Data Analytics & Visualization Software
    Data Warehousing & ETL Software
    Database Administration
  • $75 hourly
    I am an experienced and commercially focussed Tableau Developer with considerable experience of both insight analytics and BI development, providing deep, actionable direction to senior stakeholders. I am competent in dealing with a range of stakeholders, up to board level, and am flexible in my approach, being able to adapt quickly to surroundings and work at pace as required. KEY ACHIEVEMENTS * Led the migration to Tableau, and developed Tableau dashboard suites, for global entities such as Awaze.com, Booking.com, Wombat Finance, Next, E.ON, WHSmith and WorldRemit, enabling real-time insights for senior stakeholders. * Analytics and experimental lead for Booking.com Summer Peak campaign; resulting in incremental bookings uplift of 1.5% for 2.1m partners. * Introduced Next ROAS for marketing spend optimisation, resulting in efficient utilisation of over £100m advertising spend. * Implemented a personalised next best action approach for E.
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    ETL Pipeline
    ETL
    Data Analysis
    SAS
    Databricks Platform
    Microsoft Excel
    Microsoft Power BI
    dbt
    Python
    PySpark
    Snowflake
    SQL
    Tableau
  • $75 hourly
    I am a highly experienced AI Scientist, Data Scientist, and Machine Learning Engineer with a deep passion for leveraging advanced data analysis, machine learning, and artificial intelligence to solve real-world problems. With 25 years of expertise, I specialize in extracting actionable insights from complex data sets, developing predictive models, and driving data-driven decision-making across diverse industries. My proficiency in statistical modeling, data architecture, and machine learning empowers businesses to transform data into strategic assets that fuel growth and innovation.
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    R
    PySpark
    Python
    Predictive Modeling
    Deep Learning
    Machine Learning
    Data Warehousing
    Artificial Intelligence
    Data Science
    Business Analysis
    Data Visualization
    Microsoft Power BI
    Tableau
  • $30 hourly
    Data Engineer with over 4 years of professional experience developing ETL/ML pipelines, APIs, app backends, SQL databases using python. Also know about developing Machine learning models.
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    PySpark
    Artificial Intelligence
    Flask
    PyTorch
    Keras
    Python
  • $50 hourly
    I specialize in guiding and supporting projects related to all things data. I can help with building efficient data systems and pipelines, and aligning these with your objectives.
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Microsoft Azure
    Amazon Web Services
    Cloud Computing
    Data Visualization
    Data Analysis
    Software Development
    Databricks Platform
    Database Architecture
    PySpark
    Apache Spark
    SQL
    Java
    Scala
    Python
    Data Engineering
  • $60 hourly
    Experienced AI Developer and Data Analyst | Python | R | SAS | SQL | Machine Learning | AI With nearly seven years of extensive experience in the IT sector, I specialize in data analytics, machine learning, and AI development, particularly in Python, R, SAS, and SQL. I am adept at developing and implementing AI-driven solutions, with a proven track record of delivering impactful results in various industries, including healthcare, environmental sustainability, and HR analytics. Key Highlights: AI Development: Successfully developed personalized AI reduction strategies using Retrieval Augmented Generation (RAG) on Microsoft Azure, driving environmental sustainability and enhancing user engagement. Data Analytics: Expert in data preprocessing, machine learning, and time series forecasting, with a strong background in creating predictive models that support strategic decision-making. Healthcare Analytics: Implemented overpayment detection systems that significantly reduced financial losses and auditing costs, enhancing the efficiency of healthcare operations. HR Analytics: Designed and deployed predictive models for employee attrition and sentiment analysis, enabling proactive retention strategies and improving workplace satisfaction. I hold a Master's degree in Artificial Intelligence from the University of Surrey, where I focused on developing foundational models for air pollution forecasting. My ability to combine deep technical expertise with strong problem-solving skills allows me to deliver innovative solutions that meet client needs and drive business growth. I am passionate about leveraging AI and data analytics to solve complex challenges and am committed to continuous learning and professional development. Let's connect to discuss how I can contribute to your next project.
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Retrieval Augmented Generation
    Generative AI
    PySpark
    Data Analytics & Visualization Software
    Artificial Intelligence
    SAS
    SQL
    R
    Python
    Data Science
    Machine Learning
  • $15 hourly
    Microsoft Azure | SQL | Synapse | Databricks | Azure Data Factory | Logic App | ETL Flow Processing | Power BI | Bigdata | Hive | Impala | Python | pySpark
    vsuc_fltilesrefresh_TrophyIcon Pyspark
    Data Engineering
    Big Data
    Microsoft Azure SQL Database
    Microsoft Azure
    PySpark
    Microsoft Power BI
    Apache Spark
    Databricks Platform
  • Want to browse more freelancers?
    Sign up

How hiring on Upwork works

1. Post a job

Tell us what you need. Provide as many details as possible, but don’t worry about getting it perfect.

2. Talent comes to you

Get qualified proposals within 24 hours, and meet the candidates you’re excited about. Hire as soon as you’re ready.

3. Collaborate easily

Use Upwork to chat or video call, share files, and track project progress right from the app.

4. Payment simplified

Receive invoices and make payments through Upwork. Only pay for work you authorize.

Trusted by 5M+ businesses