Hire the best Data Preprocessing specialists

Check out Data Preprocessing specialists with the skills you need for your next job.
  • $80 hourly
    Certified: Microsoft Data Analyst Associate Certified: Azure Data Engineer Certified: MCSA Database Development Professional Experience: Web Development Agency: FortisureIT I currently serve as Solutions Director at FortisureIT where we implement data and automation solutions for organizations across the United States. My knowledge in the industry comes from years of experience working with clients to help develop data solutions that add value to their business. If you are interested in leveraging data to add gain insights and add value to your organization and are looking for a trusted partner then I would encourage you to reach out for a conversation. Some of the technologies that we work with are: - Power BI - Azure cloud services - SQL Server - Snowflake - Tableau - Python - Web development using PHP as well as node.js - And many more...
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    SQL Programming
    Data Cleaning
    Data Modeling
    ETL Pipeline
    Database Development
    Python
    Data Migration
    Automation
    Microsoft SQL Server
    SQL
    Microsoft Power BI Development
    Microsoft Azure
    Microsoft Azure SQL Database
    Microsoft Power BI Data Visualization
    Microsoft Power BI
  • $75 hourly
    As a highly experienced Director of Technology at People First, he possesses extensive background in numerous technical fields including Data Analytics, Data Science, Machine Learning, AI and Web/App Development. With of BA from the University of Chicago and a Master of Public Policy from The University of Chicago Harris School, he has the technical knowledge and business acumen to drive innovation and growth. Throughout his career, he has served as a Business Analyst at Loop, an insur-tech startup, Chief Marketing & Product Officer at Modern Reliance, and was an Entrepreneur at the Polsky Center for Entrepreneurship and Innovation at The University of Chicago. His achievements include winning 2nd place in the 2019 College New Venture Challenge with Modern Reliance and being a finalist in the 2018 Social New Venture Challenge from the Chicago Booth Rustandy Center with Gather Activism. He currently manages and leads a team of over 10 full-time management and intern level employees, demonstrating his exemplary leadership and communication skills. With his broad technical expertise and business acumen, he is a valuable asset to any organization or project seeking to drive innovation and growth.
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    ETL
    Facebook Ads Manager
    API
    International Development
    Airtable
    Data Analytics
    Continuous Integration
    Tech & IT
    API Development
    Anaconda
    Continuous Improvement
    ETL Pipeline
    Artificial Intelligence
    Technical Project Management
    Product Development
  • $40 hourly
    ✔️Experienced Data Engineer, specializing in Data ETL, Machine Learning pipelines, building managed/serverless solutions on AWS, and AWS Cloud Architecture. I worked with high profile oragnizations, and customers in my career, including the following: ✔️ A top 5 organization in the automotive industry (Fortune 500) ✔️ A top 3 organization in the railway public sector (Fortune 500) ✔️ An organization in the top 20 Brands in Jewlery (2.7B Euros revenue per year) Main Competencies: ✔️Data Pipeline development ✔️Building Dashboards ✔️Cloud Architecture ✔️DevOps knowhow ✔️Stakeholder management ✔️Requirements Analysis ✔️Troubleshooting ✔️Knowledge transfer Main Technologies: ✔️Python, Jupyter ✔️SQL ✔️PySpark ✔️AWS S3, EMR Serverless, Glue, Athena, Redshift, SNS, EKS, EC2, VPC, CloudFormation ✔️Apache AirFlow, Kafka, NiFi ✔️Docker, Kubernetes Why work together? ✔️Clear understanding, and breakdown of requested services ✔️Correct, and timely delivery ✔️Responsiveness Please reach out to me, so that we can discuss how to address your business needs.
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Amazon S3
    Amazon Athena
    Jira
    Jupyter Notebook
    PySpark
    AWS Glue
    AWS Lambda
    Data Integration
    ETL Pipeline
    JSON
    Data Extraction
    Amazon SageMaker
    Amazon Web Services
    Python
    Google Cloud Platform
  • $120 hourly
    With 12 years experience in Data Engineering, ETL/ELT, Business Intelligence and Dataviz, I am the perfect contractor to help your company capitalize on its data. I specialize in building Business Intelligence platforms for companies that want to develop a data-driven mindset. I thrive managing complex projects involving data collection, integration and visualization. My goal is to democratize data and empower users to make impactful decisions with dynamic reporting, no matter on which data literacy level they are. My skills include: - Databases: SQL, MySQL, Oracle, PostgreSQL, SQL Server. - Data integration: Fivetran, Stitch, Airbyte, Matillion, Azure Data Factory, Talend. - Data transformation: SQL, dbt, Python, Airflow, Dagster. - Data warehousing: AWS Redshift, BigQuery, Azure, Snowflake, Panoply. - Data visualization: Tableau, Microsoft Power BI, Looker, Metabase, Google Data Studio (now Looker Studio), Domo, AWS QuickSight, Klipfolio, QlikView/Sense, Mode, Superset. All work is completed by me, and you can be assured that I will not outsource to anyone else. I believe that clear, transparent and regular communication goes hand in hand with project success and I will always ask dozens of question to make it happen. I pride myself on providing the highest quality of work, and will never complete a project until the client is fully satisfied.
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Data Integration
    Data Ingestion
    ETL Pipeline
    Database Design
    Data Warehousing
    ETL
    Business Intelligence
    Data Visualization
    Looker Studio
    Python
    Talend Open Studio
    Microsoft Power BI
    SQL
    Tableau
  • $30 hourly
    I am an analytics professional with 6+ years of experience in Analytics, Visualization, Machine Learning, ETL and Data warehousing in the domain of Financial processes and Resource Utilizations. Also, I have worked with end to end deployment of the machine learning models in the cloud. Skill set : ETL - SSIS, Azure Data Factory, Google Dataflow, Airflow Datawarehousing - SQL, MongoDB and snowlfake Visualization - Tableau and Power BI reports Programming Languages - Python, SQL, C# and Golang
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    SQL Server Integration Services
    Data Migration
    Azure DevOps
    Microsoft Azure
    Data Analysis
    SQL
    Data Visualization
    Microsoft Power BI
    Machine Learning
    pandas
    Machine Learning Model
    Python
    Azure Machine Learning
  • $60 hourly
    🏆 𝗧𝗢𝗣 𝗥𝗔𝗧𝗘𝗗 𝗣𝗟𝗨𝗦 - among the top 3% talent on Upwork 🏆 ⭐️ 𝟒𝟖+ happy clients ⭐️ 𝟑𝟎𝟎𝟎+ hours clocked A proficient Data Consultant possessing exceptional skills that can aid businesses in effectively storing and processing their data, subsequently converting it into valuable actionable insights and predictions. I'm a Computer Science (BSCS) graduate having 7+ years of experience as a Big Data Consultant in the financial, e-commerce, marketing, healthcare, real-estate and e-gaming sector. I have expertise in building end to end Big Data solutions along with development of Business Intelligence Semantic layers. I am an expert in designing and devising Data Strategy Plans and Data frameworks and have implemented Hadoop architecture, Oracle Cloud Infrastructure, Azure, GCP and AWS Cloud Data architecture, Databricks and Snowflake. I have hands-on experience with ETL implementation using tools such as Informatica Cloud (IICS), Informatica BDM services, SSIS, Talend and Fivetran. Also, have experience with Data Pipelines deployment on Docker and Kubernetes for multiple organizations. I can help automate your tasks and solve complex problems to get the most out of your data. My Technical Expertise are listed below: ◾ Big Data Stack (On Premises): Big Data solutions using Cloudera Hadoop, Denodo, Spark, Impala, Hive, Flink, Airflow, Kafka, Nifi Building ETL pipelines and writing ETL scripts (Alteryx, Informatica BDM, Informatica Power Center, SSIS, Talend, Fivetran) Deployment: Github Actions, Pulumi, Docker ◾ Big Data Cloud Technologies: Microsoft Azure: Azure Data Factory, Azure Synapse Analytics, Azure Databricks, Azure SQL DB, Azure Cosmos DB, Cloud function AWS: AWS Redshift, AWS RDS, AWS Glue, AWS Data Pipeline, AWS DataBrew, AWS Lambda, S3, EMR, Sage Maker GCP: Google Big Query, Google Air Table, Google Dataproc, Google Pub/Sub Databases: Microsoft SQL Server, PostgreSQL, Oracle, MongoDB, Cassandra and many more. ◾ Data Warehousing and Analytics: 1️⃣ Data Modelling 2️⃣ Reporting (SSRS) 3️⃣ Data Analysis using Pandas 4️⃣ Data Cleaning, Visualizations, Pre-Processing 5️⃣ Data Analytics BI Tools (ClickUp, Power BI, Amazon QuickSight, Tableau, Google Data Studio, Qlik, Looker, Lookml, Sisense, Zoho Analytics, Domo, Grafana, Mixpanel) 6️⃣ Chatbots and NLP problems ◾ Computer vision and object recognition: I have a strong technical skill-set when it comes to working with Convolutional Neural Networks. I have worked on varied set of tasks with computer vision like: pose-recognition, multi-object-multi-cam detection, human tracking etc. Feel free to reach out to me if you need any consultation :) Thanks
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    AWS Glue
    ETL
    Apache Spark
    dbt
    Amazon Redshift
    Databricks Platform
    Data Warehousing
    Apache Kafka
    Snowflake
    QlikView
    Python
    BigQuery
    Apache Hive
    SQL
    Data Visualization
  • $100 hourly
    Senior Data Scientist with NLP specialization with over 7 years of experience in both the software industry and academic research, where I have demonstrated a strong ability to develop, deploy, maintain, and optimize AI solutions. I take pride in delivering high-quality work in the AI industry, and my clients have praised my ability to provide expert advice, deliver clean and well-structured code, and offer suggestions for improvement. My technical skills extend beyond machine learning and NLP, as I have extensive experience with computer science concepts such as parallel computing, graph theory, clean coding, code optimization, and debugging. I am highly proficient in several programming languages, including Python, which I have mastered with a 5-star rating. I have also worked extensively with AWS, SQL, GCP, and PySpark. My fields of expertise in NLP are: ★ Text Classification ★ Topic Modeling ★ Chatbot Development ★ Search Engines ★ Named Entity Recognition ★ Text Similarity ★ Question Answering Tools that I'm an expert: ★ Python: scikit-learn, pandas, NumPy, TensorFlow/Keras, Streamlit, SciPy, NLTK, Gensim, spaCy, CoreNLP, OpenAI API, and much more! ★ Amazon Web Services (AWS): EC2, S3, Lambda, RDS, Batch, DynamoDB, RedShift, SQS, SNS, Glue, SageMaker, etc. ★ SQL: PostgreSQL, SQL Server, MySQL. ★ NoSQL: MongoDB, DynamoDB. ★ Vector Databases: Milvus, Pinecone, Weaviate. ★ Visualization: Streamlit, Tableau, PowerBI. Recent feedbacks: ▸ "Christian is a great developer, and asks relevant questions for the problems we give him, he's not just a "pair of hands" but a helpful advisor for improving your initial suggestion on how to solve the problem. He's very fast to iterate and to develop and delivers code with great quality." ▸ "Went above and beyond to help with code for a large project, and completed tasks quickly and efficiently. Understood exactly what was needed for the job and executed with precision. I will absolutely be working with him again in the future!"
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Generative AI
    Artificial Intelligence
    Search Engine
    Data Engineering
    Big Data
    PySpark
    Google Cloud Platform
    Amazon Web Services
    GPT-3
    ChatGPT
    Python
    Machine Learning
    Natural Language Processing
    Data Science
    Deep Learning
  • $50 hourly
    I am an expert python/machine learning programmer working in Germany My Qualifications are : ✅ Machine Learning Engineer (Germany) ✅ Master's of Embedded Systems (Germany) ✅ Certified TensorFlow Developer ✅ Certified Professional Data Analyst My Progress on Upworks : ✅ Top rated PLUS Freelancer ✅ 141 Tasks with excellent client feedback on current and finished tasks You can check my work history as a proof on my proficiency in : ✳ Python/Machine Learning Tasks ✳ Python/Machine learning Tutoring ✳ Technical Content Creator Let's do some amazing work together !
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    BigQuery
    Blog Writing
    Data Labeling
    Exploratory Data Analysis
    Teaching Programming
    Teaching Algebra
    Microsoft Azure SQL Database
    C#
    Python
    TensorFlow
  • $30 hourly
    I specialize in sharp, useful Tableau dashboards. I'm Cost Effective! You will get a superior ratio of talent for what you pay. ✅ Why Me ✅ 4x Viz of the Day 🏆 on Tableau Public ✅ Not just pretty charts, but actionable metrics that update ✅ 100% ORIGINAL WORK ✅ Analytics Masters at Georgia Tech ✅ Anything to Satisfy the Customer ❌ Other Freelancers ❌ Work copied from Tableau Public ❌ Few developers have talent for VOD ✔️ I GUARANTEE YOU WILL RECEIVE a 100% scalable, BEAUTIFUL, unique, USEFUL solution. Business dashboards 📊 Deliver data analytics solutions. Descriptive and Predictive analytics. Tools —————- SQL, R, Python, Tableau, Data Studio
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Looker
    Looker Studio
    Python
    SQL
    R
    Tableau
  • $40 hourly
    Personal skills • Quality-oriented. I am interested in researching best practices and creating solutions that are resilient beyond the current context. In the long term, 5-stars systems pay off the extra effort. • Communicative. I don't like waiting for 3 days to receive a reply, so I don't do that to others. You can expect same day feedback from me, at the very least a "I'll get back to you later on this one". • Critical thinker: If I think you’re wrong in you're requirements, I’ll tell you and suggest alternative solutions :) Other than that, I consider myself a friendly and approachable person, who loves to help my colleagues and clients whenever I can! Technical skills —— Data Engineering • Expertise in Python: I ranked in the top 15% out of 1.3 million people on the LinkedIn Python assessment (see portfolio). • SQL • ETL/ELT with Python, Databricks (Pyspark), DBT, Dagster, Airbyte and a lot of AWS services. • Python Google Style Guide. • Agile, Extreme Programming (XP) & Clean Code (and Google Python Style Guide) —— Cloud/DevOps • AWS: Batch, Step Functions, Glue, Athena, Boto3, Lambda, S3, EC2, IAM, KMS, SQS, etc. • Bash. Docker + ECS. CI/CD - Github actions. Terraform, SAM, CodePipeline.
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Amazon S3
    Amazon API Gateway
    Terraform
    Tableau
    pandas
    AWS Lambda
    Google Cloud Platform
    Amazon Web Services
    dbt
    PySpark
    Databricks Platform
    Apache Spark
    Python
    Docker
  • $90 hourly
    Senior Data Engineer. Former Google, and McKinsey employee. Certifications: - Professional Data Engineer GCP - Databricks Certified Associate Developer for Apache Spark 3.0 Do you have complex data? Don't know what to do with your data? Don't you have any data? Do you have a complex problem? Do you want to optimize code? Do you need a quick, high quality and documented solution? If the answer to any of this question is "yes", then I am the person you are looking for. I am not an engineer that solves problem, I am a problem solver that does code. Since I was 10 years old I participated and won multiple mathematics and programming contests, nationally and internationally. I studied Mathematics and teach competitive math/programming since I was 16! (HackerRank profile: hec10r). Even more, I have worked as a Senior Data Engineer consultant at McKinsey and Google. I do have *proven* experience solving the hardest problems for some of the most important companies in the world. I have experience in multiple industries (insurance, O&G, agriculture, procurement) performing different Data Engineering tasks. I have strong communication and leadership skills that I have developed working as a consultant and leading Data teams. I feel comfortable working with different Data Engineer tools and frameworks. I have expertise with: Cloud providers: - Google Cloud Platform (Cloud Storage, BigQuery, Pub/Sub, Cloud SQL, Cloud Spanner, Composer, Dataflow, Looker) - Microsoft Azure (Data Factory, Databricks, SQL Server, Analysis Services, Blob Storage, PowerBI) - Amazon Web Services (S3, RDS, Lambda) Programming languages - Python (Pandas, NumPy, Kedro, Poetry, Anaconda) - Spark (PySpark) - SQL/NoSQL (BigQuery, Oracle, Postgres, MySQL, SQL Server, GraphQL) - JavaScript (NodeJs) Visualization tools - Python (Dash, Matplotlib) - Power BI (DAX guru :)) - Looker
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Google Dataflow
    API
    Data Processing
    Looker
    Data Analysis
    Business Intelligence
    Data Visualization
    Google Cloud Platform
    Microsoft Power BI
    Databricks Platform
    PySpark
    SQL
    Python
    Data Migration
    ETL Pipeline
  • $20 hourly
    Want to work with someone who has 16 years development and solution architect experience? Who always puts his client's needs first and provide them exactly what they want? Working with me you will, ⭐ STOP STRUGGLING with low sales conversion rate. ⭐ BE AT EASE with your website/mobile app standing out from your competitors. ⭐ NOT WORRY managing your whole project, I will take it from start to 100% completion. ⭐ SAVE MONEY and then reinvest it back into your business for more growth ⭐ WORKING with a full-stack software engineer to take care of all your website and mobile app requirements. I have been working with one of the leading clients here on Upwork and here is what some of them would like to say, ✔ ""It was awesome working with Umair. Great communicator and he has great ideas." ✔ "Umair is a dependable senior software engineer that provided his expertise in helping us build out our platform. Umair has strong technical and problem-solving skills. He has a good personality, can work with a team or by himself, communicates well, was available when we needed him, is self-motivated, and is dependable. I recommend Umair and would gladly work with him again." ✔ "Umair was a pleasure to work with!" ABOUT ME, I only take on the projects that I know I can take to the next level and complete them with 100% client satisfaction and deliverable. As you can see I have reached the TOP-RATED badge here by providing quality and on time work to my clients. I have a 70% client recall rate and 50% client referrals. Send me a massage and click the green "Schedule a meeting" button, choose 15-30 mins and I will confirm a timeslot.
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    React Native
    Data Scraping
    Machine Learning
    Django
    Angular
    Python-Requests
    Node.js
    Python
    React
    ChatGPT
    NGINX
    API Integration
    Docker
    Flask
    Git
  • $80 hourly
    Kalyan Kuramana, Founder at Mrikal Studios building a product studio for high quality product engineers and growth companies. Previously, he was CTO at Begig (Tech Mahindra garage venture), founded an edtech startup. He has his expertise in building fact paced iterative MVPs, creating scalable architecture for growth companies and bringing personalisation and adaptability with self-learning algorithms. He holds Master’s and Bachelor’s degree from the IIT Kharagpur and was featured in Forbes Asia 30 Under 30 (Class of 2019).
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    AWS Fargate
    Apache Superset
    SQL Programming
    Flask
    AWS Lambda
    CI/CD
    Algorithm Development
    API Development
    API
    Python
    JavaScript
    AWS Amplify
    Node.js
  • $125 hourly
    I offer expert level analytics services to my clients. I am available M-F during normal business hours and will communicate early and often during our working relationship. Areas of expertise include: - Data Modeling | SQL | DBT - Database Management | Fivetran | Segment | Redshift | Snowflake - Data Visualization | Tableau | Looker Studio | Mode - Marketing Analytics | Google Analytics | BigQuery | Google Tag Manager - Product Analytics | Heap | Product Strategy & Roadmap | Competitive Research - Conversion Rate Optimization | A/B Testing | Optimizely - Advanced Analytics | Python | R | Statistics
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    dbt
    Amazon Redshift
    Optimizely
    Heap
    Snowflake
    Google Analytics
    Data Analysis
    SQL
    Google Tag Manager
    BigQuery
    Tableau
    Product Analytics
    R
    Marketing Analytics
    A/B Testing
  • $20 hourly
    Greetings! I'm Salah Sammari, a dedicated Data Scientist with a focus on Natural Language Processing. Having accumulated over two years of hands-on experience in the realm of AI and machine learning, I'm reaching out to offer my expertise for your AI-driven endeavors. Professional Snapshot: My journey began with a solid foundation in Computer Science Engineering from the Higher School of Engineers Esprims in Tunisia. Over the past two years, I've been privileged to work with distinguished organizations such as DNEXT Intelligence SA and UBIAI. In these roles, I've not only implemented advanced NLP solutions but also successfully navigated challenges in trading platform optimization and extended data science training to budding enthusiasts. Core Competencies: NLP & Machine Learning: Expertise in various techniques ranging from sentiment analysis, topic modeling to Named Entity Recognition (NER). I've extensively worked with transformer models such as GPT, BERT, and LayoutLM. Programming & Tools: Proficient in Python and SQL (Postgres) with a keen understanding of data science libraries like Pandas-Numpy, Matplotlib-Seaborn, and Scikit-learn. My skill set also includes cloud platforms like AWS and Snowflake. Project Highlights: From developing AI-driven solutions for content filtering and recommendation engines to building transformer-based chatbots and leveraging OCR techniques, I've overseen multiple projects that required innovative problem-solving and rigorous model fine-tuning. Collaboration & Training: My cross-functional collaboration experience ensures smooth project executions. Additionally, as a Data Science Trainer at Ruspina Training Center, I've mentored over 150 students in Python, machine learning, and NLP. What Drives Me: I thrive on challenges and continually seek opportunities to apply my skills in diverse scenarios. My rank as a Kaggle Master, standing in the top 1%, speaks volumes about my passion for pushing the boundaries of what AI can achieve. The blend of rigorous academia, practical applications, and my incessant drive to learn has shaped my holistic approach to problem-solving.
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Recommendation System
    Hugging Face
    LLM Prompt Engineering
    GPT-3
    Chatbot
    Transformer Model
    Natural Language Processing
    Data Analysis
    Machine Learning
    Data Visualization
    Data Science Consultation
    Machine Learning Model
    Data Science
    Python
    Deep Learning
  • $50 hourly
    Greetings! I a m a research scientist with ten years expertise in machine learning, artificial intelligence, data analysis, and all kinds of data-driven modeling work. I help with data analysis, data visualization, classification/regression analysis, predictive modeling, machine learning, deep learning, statistical tests, computer vision, forecasting, and etc. I am professional in R and Python. I am here to help with your project and provide professional consultation.
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Analytical Presentation
    Artificial Intelligence
    Data Analysis
    Visualization
    Information Analysis
    Python
    R
    Machine Learning
    Machine Learning Model
  • $95 hourly
    I represent the top 1% of Upwork talent, with expertise in modern software development , deep learning neural networks and modern AI development. Having partnered with numerous startups and innovation centers across the globe, I possess the skills and insights to elevate your product or service. My Core expertise includes: 🚀 Deep Learning: 🌟 Expertise in neural network architectures such as CNNs, RNNs, GANs, and Transformers. 🌟 Proficiency in frameworks like TensorFlow, PyTorch, and Keras. 🌟 Experience in training models on large datasets and fine-tuning pre-trained models. 🌟 Knowledge of regularization techniques, optimization algorithms, and loss functions. 🌟 Hands-on experience with GPU computing and parallel processing. 🚀 Computer Vision: 🌟 Skilled in image and video processing techniques. 🌟 Familiarity with object detection, segmentation, and tracking algorithms. 🌟 Experience in implementing facial recognition and motion analysis systems. 🌟 Proficiency in OpenCV, Dlib, and other relevant and popular Computer Vision Python tools. 🌟 Ability to develop and optimize algorithms for real-time image processing. 🚀 Generative AI: 🌟 Large Language Models (LLMs): Developing and fine-tuning GPT-3 and similar models for diverse applications. 🌟 Small Language Models (SLMs): Efficient model implementation for resource-constrained environments. 🌟 Agents (Assistants & Copilots): Building context-aware virtual assistants and AI copilots. 🌟 Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs): Expertise in creative and commercial applications. 🌟 Reinforcement Learning (RL) and RL from Human Feedback (RLHF): Developing models for decision-making, enhanced by user interaction. 🚀 Speech/Music Synthesis: 🌟 Expert in speech recognition/generation, text-to-speech, speech-to-text technologies. 🌟 Knowledgeable in AI-driven music generation, melody and rhythm synthesis. 🌟 Proficient in audio signal processing, feature extraction, using tools like WaveNet, DeepVoice. 🚀 Related Peripheral Skills 🌟Front-end Development for UIs: React, React-Native, Flutter, Typescript, JavaScript, Dart. 🌟 Backend Development for Servers: Node.js, FastAPI, GraphQL, SQL and NoSQL DBs. 🌟 Platform|DevOps: CI/CD pipelines, Github, Docker, AWS, GCP, Azure and Nvidia Cloud
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Hugging Face
    LangChain
    Llama 2
    Node.js
    TypeScript
    ChatGPT
    GPT-4
    Amazon SageMaker
    TensorFlow
    PyTorch
    Deep Learning
    Machine Learning
    Chatbot
    Python
  • $35 hourly
    Experienced Data Engineer Manager with over 5 years of expertise in the field. Passionate about creating andleading teams that deliver optimal and scalable big data solutions. Firmly believe that data forms the foundationfor any company seeking to evolve and innovate. Highly interested in leveraging microservices-based platforms,harnessing the capabilities offered by cloud platforms, and exploring hybrid solutions. Committed to staying at theforefront of technology advancements to drive innovation and maximize the potential of data-driven initiatives. Experience ---------------------------------------------------------------- Data Engineering Chapter Lead/Manager ---------------------------------------------------------------- Experienced Data Engineer Manager with a strong background in leading and managing cross-functional teams. Proven expertise in ensuring seamless data flow on a data analytics platform. Skilledin building data transfer channels between data sources, datalakes, APIs, and our analytics platform.Adept at leveraging Kubernetes to manage microservices and maintain a generic ingestion framework,primarily composed of Apache Airflow and Apache Spark. Experienced in managing various types ofdata ingestion, including scheduled-based, event-based, fixed ingestions, wildcard ingestions, batchingestion, micro-batching ingestion, and streaming ingestion. Strong background in implementing dataquality and data integrity pipelines for loaded data. Skilled in coordinating teams to develop optimal andscalable solutions for parallel data flows. Additionally, responsible for maintaining data connectivity andingestion solutions on the Google Cloud Platform (GCP) using the suite of Google services. Experience: - Lead and manage a mixed team of Product Owners, Data Engineers, and SREs as both a Team Leadand People Manager. - Coordinate data flow on a data analytics platform by establishing robust transfer channels betweendata sources, datalakes, APIs, and the analytics platform. - Utilize Kubernetes to manage microservices and maintain a generic ingestion framework consisting of Apache Airflow and Apache Spark. - Manage various types of data ingestion, including scheduled-based, event-based, fixed ingestions,wildcard ingestions, batch ingestion, micro-batching ingestion, and streaming ingestion. Implement data quality and data integrity pipelines to ensure the accuracy and reliability of loaded data. - Coordinate teams to develop optimal and scalable solutions for parallel data flows, enablingtheoretically unlimited data flow capacity. - Maintain data connectivity and ingestion solutions on the Google Cloud Platform (GCP) using a rangeof Google services. ---------------------------------------------------------------- Freelance Data Engineer ---------------------------------------------------------------- As a freelance Data Engineer, I have worked on various projects involving data scraping and ETLpipeline development in different regulated domains such as pharmaceuticals, real estate, and e-commerce. My primary technologies and tools included Python, Selenium, Pandas, PySpark, Bashscripting, HTML, CSS, JavaScript, Requests library, Chromium, Red Hat servers, and Databricks. Key responsibilities: Utilized Python, Selenium, and web scraping frameworks to automate the extraction and parsing of datafrom websites, ensuring data accuracy and consistency. Employed Pandas and PySpark for data processing and transformation tasks, handling large volumesof data efficiently and performing necessary data cleansing and aggregation. Created custom Bash scripts for automation and orchestration of data scraping processes, ensuringseamless execution and scheduling on Red Hat servers and Databricks clusters. Developed web interfaces using HTML, CSS, and JavaScript for clients to interact with the scrapeddata, providing intuitive and user-friendly data exploration and visualization. Leveraged the Requests library to interact with web APIs, fetching and integrating data from externalsources into the data pipelines. Orchestrated and maintained the data scraping pipelines, ensuring continuous data availability andtimely updates as per client requirements. Designed, improved, migrated, and maintained ETL pipelines for handling large volumes of data. Utilized Python, Spark, Airflow, Databricks, dbutils, and Pandas for efficient data processing,transformation, and integration tasks. Leveraged Azure Synapse and Azure Data Factory for orchestrating and scheduling ETL workflows in acloud environment. Developed Bash scripts for automation and orchestration tasks, facilitating the deployment andexecution of ETL pipelines. Utilized Azure Blob Storage and S3 storage as the primary data storage solutions, ensuring reliable andscalable storage for the ETL pipelines.
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Voice-Over
    Data Scraping
    Apache Airflow
    Engineering & Architecture
    pandas
    Python
    Apache Spark
    Selenium WebDriver
  • $25 hourly
    I'm a rising Machine Learning Engineer and Artificial Intelligence(AI) major graduate. My experiences include Deep Learning, Machine Learning and Data science. I worked on Computer Vision projects that include: - Image Classificationon - Object Detection - Object Tracking - Object Counting My other skills include: - Building interfaces for yolo inference - Scarping data, cleaning data and modelling. - Building Interfaces and apps for scrapping and cleaning data. - Building APIs and Interfaces to serve Machine Learning models. - Building Machine Learning and Data Science apps using Streamlit. Lately, I've been building Computer Vision systems using Yolo. Skills: - Languages: Python - Frameworks: PyTorch, FastAI, Flask, Open-CV, Streamlit - Libraries: NumPy, Pandas, Scikit-Learn, Matplotlib - Version control: git
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Web Scraping
    RESTful API
    Artificial Intelligence
    SQL
    Flask
    Machine Learning
    Python
    Deep Learning
    Data Science
    Natural Language Processing
    PyTorch
    Computer Vision
    Keras
    Neural Network
    Artificial Neural Network
  • $75 hourly
    With a decade of experience as a Solution Architect and Data Engineer, I am a multi-certified expert in various technologies, including Azure Solutions Architect, DevOps Engineer, Azure Developer, Azure Administrator, Power Platform Developer and Kubernetes Administrator. Expertise Database / Storage: • Azure Synapse • Azure SQL Database • SQL Server DB • PostgreSQL • MySQL • Cosmos DB • Amazon Aurora • DynamoDB • Amazon RDS • Oracle Database • Data Lake Storage • Blob Storage • S3 Bucket • Elastic File System • Google Cloud Storage Integration / Analytics: • SSIS • Data Factory • Databricks • TSQL • Analysis Services • Azure Perview • API Management • EventGrid • Logic Apps • Service Bus • DB Migration • Synapse Analytics • Stream Analytics • EventHub • AWS Glue • Lambda • Athena • Kinesis Data Firehose • Quick Sight • AWS Data Pipeline • Serverless SQL Pool • IntelliJ Datawarehouse / Dashboards: • SQL Server • Azure SQL Data Warehouse • Azure Synapse Analytics • Amazon Redshift • BigQuery • PostgreSQL Data Warehouse • Power BI Services • Tableau Services • QuickSight • QlickSense • Qlickview • SSRS • Power BI Embedded Security / Networking: • Azure Active Directory • Azure Key Vault • Azure DNS • Azure Firewall • Azure DevOps • Load Balancer • Traffic Manager • VPN Gateway • Amazon Cognito • Amazon Inspector • AWS Artifact • AWS IAM • Elastic Load Balancing • Aws Direct Connect • GitHub I am eager to assist you in any way possible and am committed to establishing a long-term customer relationship. Please do not hesitate to let me know how I can be of service. Respectfully, Josh Nothing is impossible, the word itself says I’m Possible
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Data Analysis
    Azure DevOps
    ETL
    Microsoft PowerApps
    PySpark
    Azure DevOps Server
    Microsoft SQL Server
    DevOps
    Microsoft Azure SQL Database
    Microsoft Power BI Development
    Microsoft Power Automate
    Databricks Platform
    SQL Server Integration Services
    SQL
    Microsoft Power BI
  • $40 hourly
    Answering business questions using statistical learning and machine learning techniques on business data to uncover valuable insights and make informed decisions - that’s my expertise. As an experienced engineer with over 20 years in the field of IT, I can integrate statistics, and business vision synergistically to provide relevant, precise, and concise answers. Be sure to check out my portfolio for practical examples.
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Machine Learning
    Data Science
    Microsoft Excel
    SQL
    Statistical Analysis
    Python
    Report
    Data Visualization
  • $100 hourly
    I am an AI & Data Analytics technology entrepreneur with 14+ years of expertise in executing and delivering innovative and cutting edge products. I specialize in working with the latest AI & data tech & and have a lot of experience in working in all areas of AI + Data. What I can do for you: 1. Go from 0 to 1 on your AI ambitions: Complete consulting-to-implementation experience in the areas of LLMs & AI projects. I can help define your AI roadmap, to developing prototypes and providing complete production support for your AI + data needs. 2. Develop and implement your ambitions from data: I have helped clients setup their entire data infrastructure from scratch - from consulting on data extraction to deriving insights from the data. This can be a $0 data stack with open-source or an enterprise data stack with the best commercial products! 3. Automate parts (or whole!) of your business - I can help you navigate the way by developing solutions and tools that will automate your business by making use of AI and data analytics. 4. My work spans a wide range - from helping organizations get started with their AI & analytics ambitions from the ground up, to working with large enterprises that are looking to develop highly targeted solutions in the area of Data + AI. Technologies: AI / LLM: OpenAI / Azure OpenAI, Google Vertex, AWS Bedrock, AWS Sagemaker, llama 2 AI tooling: Langchain, llamaindex Cloud: AWS, GCP and Azure Data Extraction: Airflow, Fivetran, Airbyte, Python scripts Databases / data warehouses: Snowflake, Redshift, RDS, Azure Fabric, Databricks Data Modeling: dbt, SQL Data Visualization: Apache Superset, PowerBI, Tableau, Metabase, Grafana
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Data Engineering
    Artificial Intelligence
    Visualization
    Chatbot
    Analytics
    Data Visualization
    Data Analytics
  • $100 hourly
    A Top 1% Expert Vetted Data Scientist & AI Chatbot Engineer on Upwork with a 100% Job Success Rate. Hello! I'm Manu Bhardwaj, a seasoned Data Scientist and AI Chatbot Engineer, renowned for my expertise in turning raw data into meaningful insights. I have made significant contributions to predictive analytics, particularly with groundbreaking research in the hedge fund industry, developing innovative investment strategies. With a profound mastery of Python, R, and SQL, I excel in building complex data models, enhancing ETL processes, and leveraging machine learning to its fullest potential. My career journey is highlighted by numerous awards and certifications, showcasing my commitment to continual learning and achieving excellence in the field of data science. Recently, I have pivoted towards the exciting domain of AI Chatbot Engineering, immersing myself in advanced OpenAI technologies like ChatGPT and GPT-4. My experiences with Claude and Large Language Models (LLMs) have significantly sharpened my skills in creating intuitive and engaging chatbots, effectively combining the art of conversation with the science of Generative AI. I am deeply passionate about devising data-driven solutions that not only elevate business performance but also drive forward the frontiers of innovation. Constantly seeking new challenges in data science, I am keen to explore uncharted territories and make impactful contributions to this vibrant and evolving field. Seeking a top-tier expert in Data Science and AI Chatbot Engineering? Connect with me on Upwork, where my record as a top 1% professional with a 100% job success rate speaks volumes about my capabilities. Let's embark on a journey together in the data and AI landscape, unlocking new possibilities and steering your business towards unprecedented success.
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Natural Language Processing
    LangChain
    AI Bot
    AI Agent Development
    NLP Tokenization
    Deep Learning
    C++
    Data Mining
    Back-End Development
    Artificial Intelligence
    Database Programming
    Predictive Analytics
    Machine Learning
    Data Science
    Data Analysis
  • $30 hourly
    Senior Data Engineer with more than 4 years of professional experience in Cloud Data/Software Engineering. CERTIFIED - SNOWFLAKE SPECIALIST (SnowPro) CERTIFIED - AZURE SPECIALIST CERTIFIED - ACM ICPC I am an experienced data professional with a diverse background in data warehousing, ETL, and data integration. Currently working at North European IGaming compamy, I specialize in DWH development in BigQuery, building transformations using DBT and Airflow, and managing real-time and batch integrations in Google Cloud
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    ETL
    Data Modeling
    Google Cloud Platform
    Data Warehousing
    Microsoft Azure
    BigQuery
    dbt
    Snowflake
    Apache Airflow
    SQL
    Python
    Data Engineering
  • $35 hourly
    Dynamic professional with 8+ years of experience in Supply Chain, Sales and Operations Planning (S&OP), and IT Business Analysis in Financial Services. Currently pursuing an MBA in Supply Chain at Syracuse University and holding a PMP-PMI certification with a Microsoft Certified Power BI Data Analyst Associate credential. Recently obtained CPIM certification, demonstrating advanced knowledge in production and inventory management, and a commitment to excellence in supply chain optimization. Proven track record in enhancing operational strategies, leading diverse teams, and implementing innovative solutions in technology and supply chain management. 💹 ANALYTICS & VISUALIZATION EXPERTISE My proficiency in using Excel, SQL, and Power BI & Google Data Studio (Looker Studio) allows me to extract meaningful insights from complex data sets. I can analyze your data to identify trends, patterns, and opportunities for optimization. 💻SOFTWARE DEVELOPMENT AND BUSINESS ANALYSIS With my experience in Software Development as a Business Analyst, I can provide valuable insights into your project's requirements and ensure its success. ☑️ SOFTWARE MASTERY I have mastered a range of software tools including: 📌Office: Microsoft 365 and Google Suite 📌Project management: JIRA, Confluence, Trello, Notion, ClickUp, Airtable, and MS Project 📌Design: Visme.co, Canva, Balsamiq, and Figma 📌Diagram: Draw.io, Visio, and Miro ️🎯 COLLABORATION AND RESULTS-DRIVEN With my diverse skill set and professional acumen, I am confident in my ability to deliver high-quality results that exceed your expectations. Let's collaborate to achieve your project goals.
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Wireframe & Prototyping Software
    Microsoft Office
    Manufacturing & Construction
    Supply Chain & Logistics
    Organizational Background
    Analytics
    Startup Company
    Business Process Management
    Jira
    Microsoft Power BI
    Process Flow Diagram
    ERP Software
    Fashion Merchandising
    Supply Chain Management
    Business Intelligence
  • $115 hourly
    I am a Senior Consultant with a Master's degree in Information Science & Technology with a specialized certificate in Business Analytics & Data Science. My experiences include building data pipeline using SQL and python and using visualization tools such as Tableau and Microsoft Power BI to create custom dashboards. I have experience using various tools and applications, including Excel, Tableau, Power BI, Snowflake, Alteryx, AWS, Visual Studio, SSIS, SSRS, Azure DevOps, Databricks, Azure Data Factory, SAP HANA, SAP Digital Board Room, MicroStrategy, and Sigma, and I am proficient in SQL, Python, and R. I am also willing to learn any new tool or technology to get the job done.
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    HTML
    Node.js
    React
    Databricks Platform
    Alteryx, Inc.
    CSS
    ETL
    Microsoft SQL Server
    Data Warehousing
    Data Analysis
    Microsoft Power BI
    JavaScript
    Microsoft Excel
    Python
    R
    SQL
    Tableau
  • $60 hourly
    I am Lenin Mishra. I have 7 years of experience in building modern data stack pipelines. Currently, I am leading the analytics engineering team of a multinational bank with teams spread over 6 countries. I specialise in the Modern Data Stack(MDS). Some characteristics of an MDS are:- 1. They have a fully managed ELT data pipeline like Airbyte or Stitch or Fivetran. 2. A columnar storage data warehouse like Redshift, Snowflake, or Big Query to store data. 3. A data transformation tool like DBT. 4. A BI tool like Tableau or Looker or some sort of data visualization platform. So, if you need someone to help you in any one of the areas, do hire me!
    vsuc_fltilesrefresh_TrophyIcon Data Preprocessing Specialists
    Data Migration
    ETL Pipeline
    Data Warehousing & ETL Software
    Data Warehousing
    PostgreSQL
    Database Architecture
    Amazon Redshift
    Snowflake
    Database Design
    Fivetran
    dbt
    Talend Data Integration
    AWS Lambda
    SQL
    Python
  • Want to browse more freelancers?
    Sign up

How it works

1. Post a job (it’s free)

Tell us what you need. Provide as many details as possible, but don’t worry about getting it perfect.

2. Talent comes to you

Get qualified proposals within 24 hours, and meet the candidates you’re excited about. Hire as soon as you’re ready.

3. Collaborate easily

Use Upwork to chat or video call, share files, and track project progress right from the app.

4. Payment simplified

Receive invoices and make payments through Upwork. Only pay for work you authorize.

Trusted by

How do I hire a Data Preprocessing Specialist on Upwork?

You can hire a Data Preprocessing Specialist on Upwork in four simple steps:

  • Create a job post tailored to your Data Preprocessing Specialist project scope. We’ll walk you through the process step by step.
  • Browse top Data Preprocessing Specialist talent on Upwork and invite them to your project.
  • Once the proposals start flowing in, create a shortlist of top Data Preprocessing Specialist profiles and interview.
  • Hire the right Data Preprocessing Specialist for your project from Upwork, the world’s largest work marketplace.

At Upwork, we believe talent staffing should be easy.

How much does it cost to hire a Data Preprocessing Specialist?

Rates charged by Data Preprocessing Specialists on Upwork can vary with a number of factors including experience, location, and market conditions. See hourly rates for in-demand skills on Upwork.

Why hire a Data Preprocessing Specialist on Upwork?

As the world’s work marketplace, we connect highly-skilled freelance Data Preprocessing Specialists and businesses and help them build trusted, long-term relationships so they can achieve more together. Let us help you build the dream Data Preprocessing Specialist team you need to succeed.

Can I hire a Data Preprocessing Specialist within 24 hours on Upwork?

Depending on availability and the quality of your job post, it’s entirely possible to sign up for Upwork and receive Data Preprocessing Specialist proposals within 24 hours of posting a job description.

Schedule a call