Data Engineer – Validation & Quality

Posted 3 days ago

Worldwide

Summary

Data Engineer – Validation & Quality
About the Role
We are looking for a Data Engineer – Validation & Quality to ensure that every dataset inside Perceive Now is verifiable, accurate, and audit-traceable. You will architect quantitative validation frameworks, build contradiction and anomaly detection systems, and integrate automated evidence scoring mechanisms into our 25-layer data reasoning pipeline.
Responsibilities
Design and implement validation frameworks using Python (Pandas, NumPy, Polars) for data quality enforcement, schema validation, and field-level consistency checks.


Build contradiction-detection and reconciliation pipelines leveraging rule-based systems, cosine similarity, and statistical control models.


Develop automated confidence scoring models for each record or “Evidence Bundle,” integrating factors like source reliability, freshness, and duplication metrics.


Orchestrate validation jobs through Temporal / Airflow / Prefect, ensuring deterministic execution and full observability.


Automate checksum verification, schema drift detection, and data sampling across hundreds of data sources.


Create and maintain lineage graphs and quality dashboards in PostgreSQL, OpenSearch, and Grafana for continuous visibility.


Collaborate with Kernel and Governance pods to embed validation metadata and scoring outputs directly into evidence objects.


Ensure compliance with enterprise-grade data governance and security frameworks (SOC 2, GDPR, ISO 27001).


Required Qualifications
5+ years of experience in data engineering, MLOps validation, or data quality automation.


Strong expertise in Python (Pandas, NumPy, Polars), SQL, and ETL optimization.


Proficiency in PostgreSQL query optimization, window functions, and materialized views for performance tuning.


Experience designing data lineage and reconciliation frameworks using audit tables or time-versioned stores.


Hands-on with Airflow / Prefect / Temporal for scheduled and event-driven pipelines.


Working knowledge of OpenTelemetry, Prometheus, and Grafana for pipeline observability.

Deliverables
  • Preferred Skills
  • Familiarity with Data Quality (DQ) frameworks like Great Expectations / Soda Core.
  • Experience integrating checksum, PII masking, and encryption verification layers.
  • Understanding of semantic versioning, schema registry systems, and data governance catalogs (e.g., OpenMetadata, Amundsen).
  • Key Performance Metrics
  • Validation Accuracy ≥ 99 %
  • Schema Drift Detection Time < 10 min
  • False Positive Rate in Contradiction Detection < 2 %
  • More than 30 hrs/week
    Hourly
  • 6+ months
    Duration
  • Expert
    Experience Level
  • $17.00

    -

    $25.00

    Hourly
  • Remote Job
  • Complex project
    Project Type
Skills and Expertise
Mandatory skills
Algorithm Development
Nice-to-have skills
Python
Machine Learning
Tools
Python
Pandas
NumPy
Polars
Activity on this job
  • Proposals:20 to 50
  • Last viewed by client:3 days ago
  • Interviewing:
    20
  • Invites sent:
    30
  • Unanswered invites:
    6
About the client
Member since Oct 29, 2025
  • India
    Ludhiana10:22 PM

Explore similar jobs on Upwork

Snowflake
Database Design
Data Integration
Data Preprocessing
Data Transformation
Data Migration
Data Engineering
ETL Pipeline
SQL
Looker
Data Visualization
Scripting Language
Database University AssignmentsHourly‐ Posted 1 month ago
SQL
Database
Microsoft Excel
Database Design
Database Management
SQL Server Integration Services
Excel Macros
Excel Formula
Microsoft Power BI
Microsoft Excel PowerPivot
Power Query
Data Entry
Data Cleaning
Data Analytics
Data Extraction

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo