Hire the best Apache Spark Engineers in England
Check out Apache Spark Engineers in England with the skills you need for your next job.
- $45 hourly
- 5.0/5
- (7 jobs)
Data engineer with extensive commercial experience in designing and building cloud native data solutions. Experience & Skills: Python application development ✅ Developing ETL and ELT applications ✅ Reading and writing files ✅ Data manipulation using Pyspark and Pandas ✅ Developing applications on Docker ✅ Writing pytest test cases SQL/Data Warehousing ✅ Snowflake, BigQuery, RDS (PostgreSQL, MySQL, Aurora) ✅ Data warehousing and modelling ✅ Writing complex queries and metrics ✅ Creating dbt models Infrastructure ✅ Serverless architecture design ✅ Event driven architecture design using SNS and SQS ✅ AWS Glue applications with event trigger or cron schedule, including crawlers and Athena table integration ✅ AWS ECS tasks and services to run dockerized applications ✅ AWS Batch jobs to run dockerized applications ✅ AWS Lambda functions with event trigger or cron schedule ✅ Static website hosting on S3 with CDN ✅ AWS EMR to run Apache Spark applications ✅ All infrastructure is provisioned using Terraform Monitoring and alerting ✅ Job monitoring and alerting using Cloudwatch metrics and Grafana CI/CD ✅ CircleCI, GoCD, GitLab CI/CD Version Control ✅ GitHub, GitLabApache SparkAWS LambdaTerraformSnowflakeData IngestionGrafanaSQLAWS GlueAmazon ECSPythondbtCI/CDData ModelingApache Hadoop - $55 hourly
- 5.0/5
- (11 jobs)
- Rich Academic Pedigree: PhD in Data Science and Machine Learning from the University of Surrey, UK, complemented by a Postdoctoral Research in Artificial Intelligence at King's College London. - Decade-Long Experience in AI: Boasting over 10 years of hands-on experience, particularly in machine learning, statistical data analysis, and AI applications spanning intelligent transportation systems, smart energy management, online learning analytics, and public healthcare. - Expert in MLOps and Generative AI: Demonstrated excellence in deploying machine learning models with MLOps principles, and leveraging generative AI techniques, notably with GPT-4, for AI-driven tools and conversational solutions. - Strategic Leadership Role: Currently spearheading as the Head of Data Science & Innovation at The Open University, driving innovation and setting benchmarks in AI-driven strategies and executions. - Renowned Scholar: Proven track record in research with significant publications in high-impact journals; recognized for extracting valuable insights from big data in both academic and commercial settings. - Collaborative Spirit: A history of thriving in interdisciplinary teams for various research and commercial projects, ensuring optimal outcomes and impactful innovations. - Cloud Computing Aficionado: Strong proponent of cloud-based solutions, boasting hands-on proficiency with platforms like Microsoft Azure and Google Cloud Platform. - Python & Pyspark Maestro: A decade of mastery over Python and Pyspark, underlining a robust technical foundation.Apache SparkArtificial IntelligenceStatistical AnalysisMicrosoft AzureData Science ConsultationPython Scikit-LearnData SciencePythonDatabricks PlatformApache Spark MLlibAzure Machine LearningMachine LearningDeep Learning - $60 hourly
- 5.0/5
- (28 jobs)
Skilled Data Engineer and Analyst with experience on multiple programming languages (Python,SQL,Golang,Scala) and multiple platforms. I specialise in building data pipelines that take data from source to destination and can process data as needed in the pipeline. Currently focused on using Talend & Pentaho Data Integeration (PDI) as tools of choice but equally comfortable creating bespoke ETL solutions using other software or writing data processing scripts using bash,python etc. Proficient in databases and setting up Data Warehousing solutions. Finally, long experience building Data Dashboards using R Shiny, Pentaho Server and PowerBI.Apache SparkData ManagementDatabase DesignGraphQLNeo4jScalaGolangPostgreSQLData ScrapingMySQLETL PipelinePythonSQL - $90 hourly
- 5.0/5
- (15 jobs)
🩺 𝗣𝗵𝗗 & 𝗠𝗗. 𝟴+ 𝘆𝗲𝗮𝗿𝘀 𝗼𝗳 𝘀𝘁𝗮𝘁𝘀 𝗼𝗯𝘀𝗲𝘀𝘀𝗶𝗼𝗻. 📊 𝟭𝟬𝟬+ 𝗽𝗿𝗼𝗷𝗲𝗰𝘁𝘀 𝗺𝗲𝘁𝗶𝗰𝘂𝗹𝗼𝘂𝘀𝗹𝘆 𝗰𝗼𝗺𝗽𝗹𝗲𝘁𝗲𝗱. 𝗔𝗹𝘄𝗮𝘆𝘀 𝗼𝗻-𝘁𝗶𝗺𝗲. 𝗔𝘁𝘁𝗲𝗻𝘁𝗶𝗼𝗻 𝘁𝗼 𝗱𝗲𝘁𝗮𝗶𝗹 𝗮𝗻𝗱 𝗲𝘅𝗰𝗲𝗽𝘁𝗶𝗼𝗻𝗮𝗹 𝗾𝘂𝗮𝗹𝗶𝘁𝘆 𝗳𝗼𝗰𝘂𝘀. 𝗠𝘆 𝗴𝗲𝗻𝗶𝘂𝘀 𝘇𝗼𝗻𝗲 𝗶𝘀 𝗲𝘅𝘁𝗿𝗮𝗰𝘁𝗶𝗻𝗴 𝗶𝗻𝘀𝗶𝗴𝗵𝘁𝘀 𝗺𝘆 𝗰𝗼𝗺𝗽𝗲𝘁𝗶𝘁𝗼𝗿𝘀 𝗺𝗶𝘀𝘀. 🎯 Hi I’m Amer and I excel at supporting US 🇺🇸 and worldwide 🌎 based healthcare projects with all of their biostatistician, research and data analyst needs. I’m currently focused on working on long-term projects. ☑️ As a result of working together you can expect with the data provided or collected I can tell you exactly what was learned that was statistically significant or not. 📞 Please invite me to your project on Upwork if you would like to schedule a complimentary consultation call together. 📞 ❝ 𝘿𝙧 𝘼𝙢𝙚𝙧 𝙞𝙨 𝙖 𝙧𝙤𝙘𝙠𝙨𝙩𝙖𝙧 𝙙𝙖𝙩𝙖 𝙨𝙘𝙞𝙚𝙣𝙩𝙞𝙨𝙩 𝙖𝙣𝙙 𝙥𝙝𝙮𝙨𝙞𝙘𝙞𝙖𝙣 𝙗𝙞𝙤𝙨𝙩𝙖𝙩𝙞𝙘𝙞𝙖𝙣. 𝙄𝙩 𝙬𝙖𝙨 𝙖 𝙥𝙡𝙚𝙖𝙨𝙪𝙧𝙚 𝙬𝙤𝙧𝙠𝙞𝙣𝙜 𝙬𝙞𝙩𝙝 𝙮𝙤𝙪 𝘿𝙧 𝘼𝙢𝙚𝙧 𝙖𝙣𝙙 𝙬𝙚 𝙡𝙤𝙤𝙠 𝙛𝙤𝙧𝙬𝙖𝙧𝙙 𝙩𝙤 𝙘𝙤𝙣𝙩𝙞𝙣𝙪𝙞𝙣𝙜 𝙩𝙝𝙞𝙨 𝙧𝙚𝙡𝙖𝙩𝙞𝙤𝙣𝙨𝙝𝙞𝙥. 𝘿𝙚𝙡𝙞𝙫𝙚𝙧𝙚𝙙 𝙤𝙣 𝙩𝙞𝙢𝙚, 𝙨𝙘𝙤𝙥𝙚 𝙖𝙣𝙙 𝙫𝙚𝙧𝙮 𝙬𝙚𝙡𝙡 𝙘𝙤𝙢𝙢𝙪𝙣𝙞𝙘𝙖𝙩𝙚𝙙. 𝘼 𝙩𝙧𝙪𝙚 𝙚𝙭𝙥𝙚𝙧𝙩 𝙞𝙣 𝙝𝙞𝙨 𝙛𝙞𝙚𝙡𝙙. ❞Apache SparkSASMachine LearningPredictive AnalyticsStataData AnalysisClinical TrialClinical Trial Management SystemMedical EditingStatistical AnalysisQuantitative AnalysisResearch MethodsData ScienceRPython - $85 hourly
- 5.0/5
- (36 jobs)
I work with ambitious innovators and entrepreneurs to design, prototype and build AI powered applications, fast. After leading the AI solutions engineering team at a Silicon Valley AI Startup, backed by Google Ventures, I know how to take your vision and turn it into AI solutions that result in real profits. After working in development for 10 years, I’ve seen 3 problems with how other providers typically run projects... 1. They overwhelm clients with technical jargon 2. They say "yes" to everything, over-promise, under deliver and overrun 3. They delay projects by focusing on shiny objects instead of real business results When this happens: - Projects take way longer than planned - Projects cost way more than quoted - Time is wasted on alignment In reality, finding a partner with the end to end practical experience needed to design and launch AI products is harder than you’d think... But, when your AI project succeeds the pay off is immense I've seen it first hand... Here's a few reasons that I'm the right person to turn your great idea into an roaring success: ✅ Top 1% on Upwork as a Top-Rated Plus and Expert Vetted Freelancer ✅ 9+ Years in Full Stack Mobile and Web Development ✅ Helped 26+ Businesses Scale with my Solutions ✅ Google Ventures Startup Experience: 2+ Years of Cutting-Edge AI Work ✅ IBM Certified Expertise in Enterprise Design Thinking ✅ London Based And here's some practical examples of the outcomes my solutions have achieved: The last startup I was at raised over $50m from Google and Sequoia and was valued at $400m. They just got bought by Steve Wozniak's company (Apple co-Founder) Since then I have: + Helped 2 startups raise 7-figures with clickable web app prototypes + Built an entire warehouse management platform for a US logistics company expanding to Europe. It syncs with their US operations and shipping providers + Prototyped an app for Canada's leading security tech company. They used it to land the biggest contract in company history. Then, I built them a full stack web application to visualise 1.8m crimes + Built a web app for a Supermajor energy trading team to monitor 136 petrochemical sites with connected car data and alert them to shutdowns that would move commodity prices + Developed an AI pipeline to detect deforestation across the whole of Indonesia (1.9m sqkm) for the worlds biggest CPG company + Developed a web app for an industrial insurance company to analyse thousands of invoices using OCR and a fine tuned AI model, freeing up 200+ hours per claim + Built a web MVP for a startup that uses gen AI to create thousands of personalised videos with AI avatars in multiple languages + Engineered a PDF AI extraction internal web platform for a US real estate law firm. Freeing them time to focus on more important tasks + Worked with a Japanese Bank to develop an application using cellular footfall data and financial inputs, to optimise how much cash to leave in US ATMs + Built an app for a London property company that automated their legal letter generation and streamlined surveys saving 60+ hours per week And, there’s more. Lots more... Feel free to have a look at my case studies and customer reviews below. With 10 years in engineering, and 2.5 years leading the AI solutions engineering team at a Google Ventures startup... I've delivered multi-million dollar projects for Fortune 500 companies So, why does this matter for your project? I know how to turn your idea into a functional AI application, fast. This means you get a solution that works In the shortest time possible Without any surprises Plus, I bring industry leading Silicon Valley expertise to your team What's more, I can prototype your idea in just 5 days... Why? Because nothing helps you get buy-in, investment and your first customers like a beautifully crafted, clickable prototype Here is how I work with clients: 1. You get hours of research before I even give you a proposal 2. You get access to top silicon valley talent, immediately 3. You get business results, fast Typically the first step is a quick call In the call we'll talk through your project goals and some discovery questions After the call, I'll give you a full project plan, scope and timelines and you can decide if you want to move forward If you're interested, let me know and I will share my calendar to find a time that works for us to meet.Apache SparkETL PipelineData LakeData WarehousingApache AirflowData EngineeringFlaskFastAPIDjangoDatabricks PlatformApache KafkaAmazon Web ServicesArtificial IntelligenceAPI IntegrationPython - $80 hourly
- 5.0/5
- (5 jobs)
I am a programmer fluent in Python, SQL, and Cloud Technologies. I have a lot of experience working with data at scale including the design of automated pipelines and data warehouses. I have implemented self-service BI tools and designed bespoke reporting dashboards. I also have a lot of experience building automated scripts to scrape data from the internet and automate daily tasks. Completed Projects: - Data integrations with eCommerce API's (Walmart DSV, Amazon SP-API) - PDF text extraction via Python and OCR - Webapp created to interact with the text extraction script - API created to interact with the text extraction script - Fullstack Django WebApp created for a sport prediction gameApache SparkDjango StackData WarehousingLookerCloud ComputingApache BeamAPIBig DataComputing & NetworkingApache AirflowGoogle SheetsScriptingJavaScriptAutomationPython - $120 hourly
- 5.0/5
- (3 jobs)
World-class expert in clouds, storages, data platforms, and high-performance systems, having led teams and core projects at AWS S3 and Apple. US Inventor. Lately, I have been focusing primarily on Clouds, Data Platforms & Data Governance, but fluent in the entire big data stack, including DevOps methologies, high-performance JVM installations (incl Spring), databases and Data Science. I emphasize reliability, clearly written processes, and agile turnaround with all stakeholders. I am now bootstrapping my own B2B startup, where I handle CustDev, Product Management, GTM, complex UX, enterprise search with AI and everything else. I am not that proficient in these areas, but I am learning something new every day. As a hedge against startup turbulence, I am happy to share my core competencies with companies through Upwork!Apache SparkArtificial IntelligenceCloud ArchitectureBig DataData AnalysisApache HadoopAmazon S3Distributed ComputingAI BotPythonWeb ApplicationJavaAmazon Web Services - $150 hourly
- 5.0/5
- (1 job)
I'm an experienced Machine Learning Engineer and have worked as a Data Engineer and Data Scientist in corporate companies, most recently building a better MLOps workflow. * Experienced python developer, comfortable working with container services (Docker, k8s) * Used to working as a consultant/contractor from previous role * Looking forward to working with youApache SparkGitDockerKubeflowMLOpsMachine LearningJavaScalaC++TensorFlowPython - $100 hourly
- 5.0/5
- (5 jobs)
⏩ Ready to transform your tech infrastructure and drive measurable results? With 20+ years of experience, I deliver scalable cloud, data, and web solutions that cut costs and accelerate growth. My expertise lies in simplifying complex technologies, driving revenue growth, and boosting efficiency through innovative strategies. Notable achievements include 20x throughput improvements and 25% cost reductions across healthcare, finance, and biotech sectors. As Infopoly’s co-founder and CTO, I empower clients to outpace competition and future-proof their operations. I work directly with decision-makers to align technology with business goals, ensuring every solution maximises ROI and positions businesses for long-term success. Services: 🚀 I combine deep technical expertise with a strategic mindset, ensuring each project delivers tangible business outcomes. ☁ Cloud Optimisation (AWS, Azure & GCP): Secure, scalable cloud environments that boost performance. 📊 Data Analytics: Actionable insights through Power BI and Tableau. 🌐 Web Technologies & Development: Full-stack development and responsive applications that drive user engagement and business growth. 🌀 Agile Team Leadership: Delivering projects on time with cross-functional teams. 🎯 Tech Alignment: Ensuring tech initiatives match business goals. Track Record: ⭐ 5.0 Rating across long-term UpWork contracts for data integration, warehousing, and cloud projects. 💬 Client Testimonial: "Infopoly enabled a technology step change for EptivA, automating algorithms and modernizing infrastructure. Their cloud-first strategy achieved 20x higher throughput, accelerating market success." 🤝 Partnered with healthcare, finance, and biotech sectors. 💰 Delivered $200K+ in UpWork projects; external engagements exceed $2M. ⚙️ Cut operational costs by 25% through cloud automation. 🔒 Built long-term client relationships grounded in transparency and results. Let's Connect: 📅 Let's connect! I'm ready to help you unlock growth – book a 30-minute call this week to strategise and drive your next project forward.Apache SparkServerless ComputingDockerApache KafkaScalaPythonKubernetesAmazon Web ServicesAgile Software Development - $80 hourly
- 4.9/5
- (35 jobs)
I specialize in Data analysis, Artificial Intelligence projects, Web development, and creating processes pipelines. I have a computer engineering background and graduated with a PhD in Machine Learning, Neural networks. Summary - Data analysis, Machine learning and Artificial intelligent skills with specialization of neural networks (Python, NumPy, Keras, TensorFlow, R, Java, Deeplearning4j); - Strong back end development experience with wide skill set (Java (Spring, Hibernate, Jackson, Microservices, Spring Security)); - Front end development experience (Javascript (AngularJS, ReactJS, ExtJS), Bootstrap); - MySQL/ SQL/Mongo, MVC, Git / Subversion toolset, Maven, Gradle; - Project management and supervisory experience (Agile, Scrum, Kanban - Jira, redmine); - Academic background; - High self-organization and discipline. My strong points are Creativeness, Punctuality and Responsiveness. The sphere of my interests is Deep Learning, Convolutional Neural Network, Recurrent Neural Network projects.Apache SparkProject ManagementDeep LearningArtificial Neural NetworkData ScienceArtificial IntelligenceTensorFlowJavaSpring FrameworkPython - $90 hourly
- 5.0/5
- (3 jobs)
Leading 4 programs as lead big data architect and lead data architect and many project as technical lead in Banking for 14 years. I have end to end big data, DW architecture and implementation experience with stakeholder management. - Lead/architect consultancy for data lake & analytics in Sainsbury’s, TUI, Collinson group, ... - AWS solution architecture, AWS EMR, S3, RDS, CloudFormation, VPC, IAM with GDPR PII/PCI compliance. - Azure solution architecture. Azure Synapse, Azure Gen2/Blob data lake, Data factory, sql pools, Databricks, Jupyter Notebooks, Logic apps. - AWS Glue, Athena, AWS Lambda serverless - Big Data (Spark, Hadoop, YARN, Nosql, Hive, Sqoop, Flume, Kafka, R) 3+ years. - In depth knowledge in dimensional&relational data modeling and data warehousing, 13 years - Data Science with MSC degree (Weka, Python, Pandas, numpy, nltk) 5+ years - Analysis, design and implementation of DW&mining system - 11+ years. - Designing end to end in-house CRM 360 degree customer view, Salesforce systems integration as solution architect - 5 years. - Planning and coordinating data migration of a banking system. - Designing banking migration infrastructure, configuring migration simulation environment, sql, pl/sql tuning migration, coordinating end of day batches of bank. - Data mining, customer segmentation, propensity modeling, churn modeling, - Oracle OFSA, datamart design, data quality, ODS, financial reporting. Skills: - Hadoop, Oracle Nosql, Apache SOLR, Pig,Hive,Spark. - Python: flask, django rest, numpy, dask - Oracle ODI (Sunopsis/10g/11g/12C),PL/SQL - Reporting: OBIEE, - Shell scripting, XML,XSD - Oracle: PL/SQL, SQL, DB Tuning - IBM DB2, SQL Server, AS/400, SYBASE, MySqlApache SparkData ManagementETL PipelineDatabase ArchitectureData ScienceBig DataAmazon S3AWS Glue - $100 hourly
- 5.0/5
- (11 jobs)
💬 "Every month, I spend hours manually pulling reports instead of focusing on our strategy" 💬 "I just want to see my team's performance without having to juggle different spreadsheets" 💬 "End-of-month reporting shouldn't feel like assembling a thousand-piece jigsaw puzzle" 💬 "I just need the figures to reconcile. Why is it such a hassle to get consistent data?" If you find yourself nodding to any of these, you're in the right place. I'm Ayub, and I specialise in streamlining data and reporting processes, so you can focus on what truly matters: growing your business. Let's make your data work for you, not the other way around. 𝗜𝗡𝗧𝗥𝗢 With 7+ years in the data and analytics space, I've collaborated with the likes of Meta, HelloFresh, Capgemini, and several thriving startups. 𝗦𝗨𝗖𝗖𝗘𝗦𝗦 𝗦𝗧𝗢𝗥𝗜𝗘𝗦 Online consumer services business: Worked closely with senior management to gather reporting requirements and developed a suite of Tableau reports following data visualisation best practices. These dashboards allowed everyone in the business to finally automate and track business KPIs with ease. ⭐️ Testimonial: "Ayub is exemplary in his work and delivery. He is quick in understanding the exact requirement, his planning is meticulous and he has an eye for details. He is very good with data visualization and his dashboards have made it easy for our organization to make sense of numbers. I enjoyed working with Ayub and would love to work with him in future as well." E-commerce agency: Built a data pipeline to extract and load live tracking and price history data and built dashboards in Tableau, Power BI, Google Data Studio, and Klipfolio. These dashboards served is used as an analytics offering by the business to their clients to consolidate and present their clients data in a compact and easy-to-digest set of dashboards. ⭐️ Testimonial: "I've worked with Ayub for over a year on some complex data and data visualisation projects in Tableau, Power BI and Klipfolio. I've found him to be very competent and an excellent problem solver, as well as responsive and efficient. Looking forward to working with him again in the future!" Drop me a message anytime to discuss your challenges. All the best, AyubApache SparkData ManagementAmazon RedshiftETLPySparkAmazon S3BigQueryPostgreSQLData VaultData ModelingApache AirflowData WarehousingdbtAmazon Web ServicesGoogle Cloud PlatformTerraformCloud EngineeringSnowflakeSQLPythonData Engineering - $30 hourly
- 4.7/5
- (19 jobs)
Hi there! I have over 4 years of experience in Data Engineering and Data Analytics. I use Python as my daily driver, and I regularly work with technologies and frameworks like SQL, Azure Databricks, Azure Data Factory, Azure Synapse Analytics and PowerBI. I can help you with tasks like Data Extraction, Data Cleaning, Data Transformation, Data Analysis and Data Visualisation. Feel free to reach out if you'd like to discuss your project with me! Languages - Python, SQL Cloud Tools - Azure Databricks, Azure Data Factory, Azure Synapse Analytics, Azure Data Lake Storage Data Processing, Transformation and Analysis - Apache Spark, PySpark, Pandas Data Visualisation - PowerBI Data Storage Formats - CSV, Microsoft Excel, Google Sheets, Parquet Others - Jupyter Notebook, ipynbApache SparkAlgorithm DevelopmentData ManagementJavaData AnalysisData StructuresResumeInterview PreparationCandidate InterviewingMachine LearningData ScienceCareer CoachingPySparkPythonSQL - $80 hourly
- 3.0/5
- (22 jobs)
This is Amber Ameer, professionally providing IT services based on my skills and experience. I have 9+ years of experience building Data Lake, Data Warehousing & Business Intelligence solutions. Hands-on DevOps engineer & ETL developer. Development team lead experience from design, documentation, coding, testing & review, through to release & support. Experienced in data warehouse (Redshift) optimization projects. Good communication & networking skills. Adaptable, resourceful, and conscientious. Happy to work both independently and as part of a team. Core Strengths • Python • Certified Cloud Specialist (Amazon Web Services) • DevOps Specialization (Cloudformation, Codecommit, Codepipeline, CodeBuild, Code Deploy, Jenkins) • Dockers (Amazon ECS ECR, Fargate, Kubernetes) • Data Transformation (Glue, Spark, ETL) • Solving Problems • Continuous Learning & Improvement • Sharing Knowledge • Passionate about ITApache SparkData AnalysisData Science ConsultationDatabaseETL PipelineData MigrationETLBig DataDevOpsPostgreSQLDockerJavaAmazon Web ServicesPython - $50 hourly
- 0.0/5
- (0 jobs)
Currently a consultant working within the Data-Engineering domain. Also, a Software Engineer with over four years' experience working with Card payments and payment switches in Africa. Have a master's degree in Software Engineering for Financial Services at the University of Leicester. Have an interest mainly in financial technology, and looking to expand on my knowledge and skills by securing a job in Technology within the financial services industry or any other areas in Technology that will expose me to new technologies.Apache SparkBig DataApache MavenPayment ProcessingApache KafkaApache HivePythonScalaJavaAndroidAmazon Web Services - $51 hourly
- 0.0/5
- (1 job)
Big Data expert with 6+ years of experience, leading projects for 𝐖𝐢𝐳𝐳𝐀𝐢𝐫, 𝐂𝐢𝐬𝐜𝐨, 𝐂𝐨𝐨𝐩 and for successful Start-Ups 𝐁𝐫𝐨𝐲𝐚𝐥𝐢𝐯𝐢𝐧𝐠.𝐜𝐨𝐦 (leading organic meals e-commerce in Canada), 𝐌𝐞𝐝𝐢𝐤𝐢𝐭.𝐮𝐚 (largest telemedicine company in Ukraine, 𝟏𝟎𝟎 𝟎𝟎𝟎+ 𝗰𝗼𝗻𝘀𝘂𝗹𝘁𝗮𝘁𝗶𝗼𝗻𝘀 𝐦𝐚𝐝𝐞) 𝐖𝐡𝐚𝐭 𝐈 𝐝𝐨 𝐟𝐨𝐫 𝐜𝐥𝐢𝐞𝐧𝐭𝐬: ✅ Data Architecture - Designing scalable, resilient, and efficient big data architectures. ✅ Data Management - managing data lakes and data warehouses, optimizing data for query performance, ensuring data quality and governance (Unity Catalog, Data Fabric). ✅ Data Orchestration - Airflow, Data Factory, Databricks Jobs, Lambda, Azure Functions. ✅ Data Processing (ETL Pipelines Construction) - Spark, Glue, Athena, MapReduce, Databricks, DBT ✅ Real-Time Data Streaming & Processing - Fivetran, Apache Kafka, Spark Streaming (structured/unstructured), Flux, SQS, SNS, Service Bus, Event Hub, NiFy, ✅ Visualisations & Dashboarding - PowerBI, Tableau, Looker, Grafana, InfluxDB, Splunk, Kibana ✅ Data Analytics - Apache Hive, Apache Impala, HDinsights, Presto for SQL-based data querying and analysis. ✅ Data Quality - Great Expectations, DBT Testing, Python Decorator Testing, TestContainers ✅ Cloud Solutions - AWS, Azure, GCP 𝐒𝐨𝐦𝐞 𝐨𝐟 𝐦𝐲 𝐫𝐞𝐜𝐞𝐧𝐭 𝐩𝐫𝐨𝐣𝐞𝐜𝐭𝐬: - 𝐄𝐓𝐋 𝐩𝐢𝐩𝐞𝐥𝐢𝐧𝐞𝐬 for ensuring batch data processing within centralised data warehouse for a 𝐆𝐞𝐫𝐦𝐚𝐧 𝐥𝐨𝐠𝐢𝐬𝐭𝐢𝐜𝐬 𝐜𝐨𝐦𝐩𝐚𝐧𝐲. - 𝐃𝐚𝐭𝐚 𝐦𝐢𝐠𝐫𝐚𝐭𝐢𝐨𝐧 from Hadoop to AWS infrastructure for 𝐅𝐨𝐫𝐭𝐮𝐧𝐞 𝟓𝟎𝟎 𝐞𝐧𝐭𝐞𝐫𝐩𝐫𝐢𝐬𝐞. - 𝐒𝐩𝐚𝐫𝐤 𝐒𝐭𝐫𝐞𝐚𝐦𝐢𝐧𝐠 application for KPI calculations & data enrichment, enabling detection of network vulnerabilities at the 𝐔𝐒-𝐛𝐚𝐬𝐞𝐝 𝐭𝐞𝐥𝐞𝐜𝐨𝐦. - Unique 𝐝𝐚𝐭𝐚 𝐚𝐫𝐜𝐡𝐢𝐭𝐞𝐜𝐭𝐮𝐫𝐞 𝐥𝐨𝐠𝐢𝐜 for a pharmaceutical research centre in the US - 𝐀𝐈-𝐝𝐫𝐢𝐯𝐞𝐧 𝐝𝐚𝐭𝐚 𝐩𝐥𝐚𝐭𝐟𝐨𝐫𝐦 to perform advanced analytics for the largest groceries 𝐫𝐞𝐭𝐚𝐢𝐥𝐞𝐫 𝐢𝐧 𝐃𝐞𝐧𝐦𝐚𝐫𝐤. 𝐌𝐲 𝐭𝐫𝐚𝐜𝐤 𝐫𝐞𝐜𝐨𝐫𝐝: - 10+ major projects in 𝐋𝐨𝐠𝐢𝐬𝐭𝐢𝐜𝐬, 𝐀𝐯𝐢𝐚𝐭𝐢𝐨𝐧, 𝐑𝐞𝐭𝐚𝐢𝐥/𝐄-𝐜𝐨𝐦𝐦𝐞𝐫𝐜𝐞, 𝐓𝐞𝐥𝐞𝐜𝐨𝐦, 𝐇𝐞𝐚𝐥𝐭𝐡𝐜𝐚𝐫𝐞 industries - 𝐔𝐒𝐃 𝟓𝟎𝟎𝐤+ in project value delivered to Enterprise-level clients - Recognised '𝐁𝐞𝐬𝐭 𝐓𝐞𝐜𝐡𝐧𝐢𝐜𝐚𝐥 𝐒𝐨𝐥𝐮𝐭𝐢𝐨𝐧' by WizzAir in 2023. - Led 𝟓+ 𝐩𝐞𝐫𝐬𝐨𝐧 𝐞𝐧𝐠𝐢𝐧𝐞𝐞𝐫𝐢𝐧𝐠 𝐭𝐞𝐚𝐦𝐬 for both end-to-end projects and technical parts of large project - Strong business acumen ✅ 𝗣𝗿𝗼𝗴𝗿𝗮𝗺𝗺𝗶𝗻𝗴 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲𝘀 𝗮𝗻𝗱 𝗖𝗼𝗿𝗲 𝗧𝗲𝗰𝗵𝗻𝗼𝗹𝗼𝗴𝗶𝗲𝘀 - Python, Scala, Java, SQL, C#, C, C++, Bash Scripting, PowerShell ✅ 𝐃𝐚𝐭𝐚𝐛𝐚𝐬𝐞𝐬 - MySQL - SQL (MS SQL, Oracle, PostgresSQL, BigQuery) - MS SQL - NoSQL - MongoDB - PostgreSQL - Redis - TimescaleDB - Snowflake ✅ 𝐀𝐝𝐝𝐢𝐭𝐢𝐨𝐧𝐚𝐥 𝐂𝐥𝐨𝐮𝐝 𝐒𝐞𝐫𝐯𝐢𝐜𝐞𝐬: - AWS (S3, SageMaker, Cognito, EKS, ECS, RDS, IAM) - Azure (Synapse, ADLS, Container Apps, Functions) - GCP (BigQuery, Hadoop, Blob Storage, SQL Server) 𝐑𝐞𝐯𝐢𝐞𝐰𝐬: ⭐⭐⭐⭐⭐ - 𝐍𝐢𝐦𝐚 𝐒𝐨𝐭𝐨𝐚𝐝𝐞𝐡, 𝐂𝐄𝐎 𝐨𝐟 𝐁𝐫𝐨𝐲𝐚 Nick has been an incredible asset to us at Broya. His data engineering expertise is not only profound but also practically applied to real-world problems. His work has significantly contributed to the success of our projects and data pipeline creations. He is a true professional and a pleasure to work with. ⭐⭐⭐⭐⭐ - 𝐑𝐮𝐬𝐥𝐚𝐧 𝐊𝐫𝐚𝐯𝐞𝐭𝐬, 𝐂𝐄𝐎 𝐨𝐟 𝐌𝐞𝐝𝐢𝐤𝐢𝐭 Working with Nick has transformed our approach to data management at 𝐌𝐞𝐝𝐢𝐤𝐢𝐭. His contributions were crucial in developing an efficient and automated data management system, complemented by real-time dashboarding, which has streamlined our operations and decision-making processes. Beyond his technical expertise, Nick's commitment to excellence has played a key role in enhancing our data handling capabilities and overall operational efficiency.Apache SparkCloud SecurityCloud ServicesApache KafkaDatabricks PlatformSQLETL PipelineData MigrationDatabase OptimizationDatabase ArchitectureData VisualizationData Quality AssessmentData AnalyticsData EngineeringBig Data - $20 hourly
- 5.0/5
- (8 jobs)
Dynamic and results-driven Cloud Data Engineer with a proven track record in designing and implementing end-to-end batch and streaming data pipelines. Proficient in orchestrating data workflows within multi-cloud environments, combining technical expertise with a commitment to optimizing data-driven solutions for enhanced business outcomes. Put your faith in me!Apache SparkAgile Software DevelopmentTerraformAmazon Web ServicesGoogle Cloud PlatformApache KafkaPySparkApache AirflowSQLPython - $35 hourly
- 0.0/5
- (0 jobs)
I am a seasoned Data Engineer with years of experience in designing, developing, and optimising data architectures. Passionate about transforming raw data into actionable insights, I excel at building robust and scalable data pipelines, implementing efficient ETL processes, and ensuring data integrity across diverse platforms.Apache SparkCloud SecurityBig DataPythonSQL ProgrammingSolution ArchitectureCloud ComputingData EngineeringAzure DevOps - $60 hourly
- 0.0/5
- (0 jobs)
Persistent problem-solver, always in pursuit of simple, clever, and optimal solutions. Versatile Data Specialist with a PhD in Electrical Engineering and several years of experience in both research and industry across different fields. Strong analytical and communication skills; adept at solving real world problems, both independently and as part of a team. - Python - Scala - ETL - SparkApache SparkProblem SolvingAnalytical PresentationPySparkScalaData ExtractionData AnalysisData MiningETL PipelineETLPython - $38 hourly
- 0.0/5
- (1 job)
Business Intelligence architect and developer with more than 10 years of experience in IT. Started as a hobby – programming small games and tools then turned into profession by programming POS systems, customising generic ETL systems to fit customer needs, turning business user stories into processes and dynamic reports. I have international experience in finding fraudulent activity, saving company's revenue, assisting marketing activities and giving important insights by presenting the data in a user-friendly way. My current academic experience in Big Data Science has given my also quite good experience in Cloud technologies (Azure, Amazon EC2), big data systems (Hadoop, Apache Spark) and Analysis tools (Python – scikit-learn).Apache SparkMicrosoft Power BISQLTerraformAWS AmplifyAWS GlueAmazon RedshiftAWS LambdaAmazon EC2SQL Server Integration ServicesJavaC#PythonApache HadoopTableau - $200 hourly
- 4.9/5
- (88 jobs)
A full-stack engineer with a background that lends itself to helping companies stay lean and connected whilst scaling up their customers and services. With 7 year's experience in providing DevOps solutions and services to Finance, Web3.0 and Data Analytics - I have been heavily involved in building scalable platforms for microservices on Kubernetes, migrating infrastructure and services to the cloud and creating build environments for growing teams of developers. Skills: Cloud migrations - AWS, Azure, GCP, DigitalOcean, on-premise, Hetzner Container orchestration - Kubernetes (k8s) Rancher, Docker Swarm, OpenShift Infra-as-code - Pulumi, Terraform, CloudFormation, Sceptre Continuous delivery/integration - Jenkins, DroneIO, Helm, Kubernetes, GoCD, Tilt, Earthly Database - Elasticsearch, MongoDB, MySQL, MSSSQL, Postgres Applications - Docker, Nginx, LAMP, CoreOS, Terraform, Tableau, MS Exchange, Nutanix, VMWare Horizon/vCenter, Kafka, Atlassian Jira/Confluence, Microsoft SQL Server, Microsoft Exchange CloudFormation, Hugo Networking: DNS, DHCP, VLANs, NAT, Cisco Switch/Firewall Languages: Strong - PowerShell, Bash, Python, YAML, JSON Intermediate - GoLang, NodeJS, JavaScript, HTML, CSS, C#, TSQL Basic - Haskel, OCaml, Rust Achievements (in the last 2 years): - Re-engineered SAAS architecture - migrating all production to microservices on Kubernetes reducing the company's total software expenditure by 40% - Developed Terraform templates to make an automated multi-cloud disaster recovery solution - Implemented build pipelines to allow developers to work with isolated and identical versions of dev, test, and prod - Product Owner and Scrum Master of an Agile software development project for iPhone app - Advocate the need for a transparent business vision by employing OKRs and helped to align cascading team OKRs down through the organisation - Automated the provisioning of on-premise Kubernetes clusters and build pipelines using Matchbox, Bash and Helm templating Qualifications and education: 2020 - Kubernetes Certified Applications Developer 2020 - Kubernetes Certified Administrator 2018 - AWS Certified Developer Associate 2018 - Agile Certified Practitioner 2008 - 1st class degree in Electronic Engineering and Cybernetics Please get in touch to if you think my background can be helpful to you.Apache SparkGrafanaAmazon ECSKubernetesDocker ComposeAmazon ECS for KubernetesContinuous IntegrationDockerJenkinsAmazon Web ServicesDevOpsTerraformMicrosoft Azure - $25 hourly
- 5.0/5
- (1 job)
I am a Fullstack developer with 5 years of experience. I have expertise in architecting and developing an anaytics product from scratch (fountain9 (.) com, Kronoscope), deploying services using nginx and Docker and contributed more than 80% of backend code. Expertise in multiple stacks including Backend development using Java and Python Django Framework, frontend development using React, ETL pipelines using Apache Spark. Expert knowledge in Database optimization at query level as well as configuration level. Implemented User and access management as well as Oauth integration with Social media (Google) Key Skills: • Backend development - Django Rest Framework (Python), Core Java and servlets, Pandas • Frontend: HTML/CSS, Javascript, React • ETL framework: Apache Spark (Using Scala, Python), Kafka connect, Apache NiFi • Databases: Postgres, MongoDB, Elasticsearch (searching), Cassandra • Other skills: Expertise in Ubuntu and CentOS, Cloud services AWS and GCP, HDFS, docker (compose, stack deploy, services using docker swarm), Git, MavenApache SparkETL PipelineLinuxReactElasticsearchMicroserviceDjangoJavaPostgreSQLMongoDB - $50 hourly
- 0.0/5
- (0 jobs)
I am a Microsoft-certified Data Analyst, Mechatronics and Robotics Engineer, and certified Azure Data Engineer. I work as a Technology and Analytics Consultant delivering Enterprise-scale Data Analytics solutions and Managed IT Services primarily for UK-based companies. Specializing in Cloud Infrastructure Management, Cloud Migration, ERP/CRM Development and Integration, Data Integration and ETL, Business Intelligence Reporting, and Power Platform Development, I have vast industry expertise in the following tools and technologies: 1. Cloud Infrastructure Management Amazon Web Services (AWS): AWS Management Console, AWS CloudTrail. Microsoft Azure: Azure Portal, Azure Automation, Azure Monitor. Google Cloud Platform (GCP): Google Cloud Console, Google Cloud Deployment Manager. Container Orchestration: Kubernetes, Docker Swarm. 2. Cloud Migration AWS: AWS Migration Hub, AWS Database Migration Service (DMS). Azure: Azure Migrate, Azure Integration Services, Azure Database Migration Service. GCP: Google Cloud Migrate. 3. ERP/CRM Development and Integration ERP Systems: SAP S/4HANA, Ellucian Cloud, Oracle ERP Cloud, Microsoft Dynamics 365. CRM Systems: Salesforce, Microsoft Dynamics 365 CRM, HubSpot. APIs and Middleware: RESTful APIs, Microsoft Power Automate. 4. Data Architecture and ETL (Extract, Transform, Load) ETL Tools: Azure Data Factory, Azure Synapse, SQL Server Integration Services (SSIS), Alteryx. Big Data Platforms: Apache Spark, Google BigQuery, AWS Redshift. Database Systems: SQL Server, MySQL, PostgreSQL, MongoDB. Data Warehousing: Snowflake, Microsoft Azure Synapse Analytics. 5. Business Intelligence Reporting BI Tools: Microsoft Power BI, Tableau, Looker. Data Visualization Libraries: Plotly, Matplotlib. 6. Power Platform Development Power BI: For data visualization and business intelligence. Power Apps: For building custom business applications. Power Automate: For workflow automation. Integration Tools: Microsoft Dataverse Connectors, Custom APIs. Development Tools: Visual Studio, Visual Studio Code, Azure DevOps, GitHub. Additionally, I am a Senior Data Analytics Lecturer at the UK's largest digital skills training provider and part-time Lecturer at the University of East London. I'm in the process of growing my skill set every day. As a seasoned IT professional based in London, I specialize in tackling the toughest challenges in the industry with precision and expertise. My approach is rooted in a deep understanding of cutting-edge technologies and a commitment to delivering bespoke solutions tailored to each client's unique needs. My clients over the years have ranged from startups to established enterprises, all praising my ability to not only solve immediate issues but also to anticipate future challenges and implement proactive strategies. This proactive approach has resulted in a track record of satisfaction and glowing testimonials from clients and colleagues alike. Delivering premium IT services in London requires a blend of technical acumen, innovative thinking, and a relentless dedication to quality. I pride myself on my meticulous attention to detail and my ability to communicate complex concepts in a clear, accessible manner. Whether it's optimizing infrastructure, enhancing cybersecurity measures, or developing scalable software solutions, I bring the highest level of professionalism and excellence to every project. Please reach out to discuss your project requirements, and together, we can transform your vision into reality.Apache SparkAzure Machine LearningSQLData ModelingETLData ScienceMicrosoft Power AutomatePostgreSQLMicrosoft AzureAnalytics DashboardMicrosoft Dynamics 365Data VisualizationMicrosoft PowerAppsBusiness IntelligenceMicrosoft Power BI - $13 hourly
- 0.0/5
- (1 job)
Aspiring Data Scientist looking for a role that challenges my ability and push me harder towards my goals. #python #datascienceApache SparkETL PipelineAnacondaDatabaseCore JavaABAPJupyter NotebookObject-Oriented ProgrammingData SciencePyTorchETLPythonJava - $50 hourly
- 0.0/5
- (0 jobs)
I specialize in guiding and supporting projects related to all things data. I can help with building efficient data systems and pipelines, and aligning these with your objectives.Apache SparkMicrosoft AzureAmazon Web ServicesCloud ComputingData VisualizationData AnalysisSoftware DevelopmentDatabricks PlatformDatabase ArchitecturePySparkSQLJavaScalaPythonData Engineering - $15 hourly
- 3.0/5
- (1 job)
Microsoft Azure | SQL | Synapse | Databricks | Azure Data Factory | Logic App | ETL Flow Processing | Power BI | Bigdata | Hive | Impala | Python | pySparkApache SparkData EngineeringBig DataMicrosoft Azure SQL DatabaseMicrosoft AzurePySparkMicrosoft Power BIDatabricks Platform - $25 hourly
- 0.0/5
- (0 jobs)
I am a full-stack developer with over 3 years of experience in building website, developing web applications and ERP systems. I recently completed a Master's degree in Big Data Analytics, gaining hands-on experience in data integration and big data technologies. I am now seeking software engineering or data engineering roles where I can leverage my diverse skill set to contribute effectively while continuing to expand my expertise.Apache SparkVue.jsOdooGitApache KafkaApache HadoopJavaScriptPythonWeb Development Want to browse more freelancers?
Sign up
How hiring on Upwork works
1. Post a job
Tell us what you need. Provide as many details as possible, but don’t worry about getting it perfect.
2. Talent comes to you
Get qualified proposals within 24 hours, and meet the candidates you’re excited about. Hire as soon as you’re ready.
3. Collaborate easily
Use Upwork to chat or video call, share files, and track project progress right from the app.
4. Payment simplified
Receive invoices and make payments through Upwork. Only pay for work you authorize.