AI Data Engineer

Posted 2 weeks ago

Worldwide

Summary

Project description Join the Data Engineering team to contribute to the ongoing maintenance and improvement of an internal LLM-powered assistant that uses hosted LLM APIs and internal knowledge sources, with a focus on reliability, retrieval quality, and operational excellence. Responsibilities Maintain and enhance ingestion/enrichment pipelines for internal content (parsing/extraction, normalization, metadata enrichment, deduplication, and quality monitoring )Improve indexing and retrieval performance and quality (chunking/segmentation refinements, embedding/index update workflows, metadata filtering, caching) and support hybrid retrieval capabilities (vector + keyword/BM25 + metadata )Implement and maintain access-aware retrieval by propagating/enforcing document permissions through indexing and query-time filters, including audit logs and validation tests Improve source attribution so responses reliably point to the correct documents and sections in a consistent format .Extend and harden tool/workflow execution and automations (scheduled/trigger-based), including retries, timeouts, idempotency, concurrency controls, and run history Develop and maintain evaluation and regression testing (golden sets, automated scoring) and support structured comparisons across LLM providers/models as requiredOperate the platform in production: observability (logs/metrics/tracing), alerting, incident support, performance tuning, and cost controls, plus runbooks and handover documentation Skills Must have 8+ years of hands-on experience in Data Science and 5+ years in Machine Learning, with a proven track record, demonstrated through a robust portfolio of projects. Strong programming skills in languages such as Python and familiarity building ETL pipelines Expertise in SQL and experience with both relational (preferably Postgres) and NoSQL databases (Open Search or Elastic Search. Familiarity with AWS cloud platform and its services. Experience with version control systems (e.g., Git) and CI/CD pipeline s.Ability to build scalable infrastructure to embed and search very large number of documents. Ability to move fast in an environment where things are sometimes loosely defined and may have competing priorities or deadlines. Expertise in ML inference optimizations. Solid experience with Hybrid RAG, chunking/segmentation refinements, embedding/index update workflows, metadata filtering, caching, etc. Knowledge of network optimization for distributed ML training and inference. Understanding of distributed training patterns and checkpointing strategies. Strong English skills (B2 and higher) Strong verbal and written communication skill s.Ability to work independently and collaborate in a group. Nice to have Agile certification Oracle/Microsoft attestations and certifications Domain knowledge Trading and Capital Markets Languages - English: C1 Advanced

  • More than 30 hrs/week
    Hourly
  • 1-3 months
    Duration
  • Expert
    Experience Level
  • $20.00

    -

    $23.00

    Hourly
  • Remote Job
  • Ongoing project
    Project Type

Contract-to-hire opportunity

This lets talent know that this job could become full time.
Learn more
Skills and Expertise
Mandatory skills
ETL Pipeline
SQL
Python
Activity on this job
  • Proposals:20 to 50
  • Last viewed by client:last week
  • Interviewing:
    4
  • Invites sent:
    0
  • Unanswered invites:
    0
About the client
Member since Apr 19, 2024
  • Poland
    Warsaw2:03 PM
  • $1K total spent
    1 hire, 0 active

Explore similar jobs on Upwork

Database University AssignmentsHourly‐ Posted 9 months ago
SQL
Database
Microsoft Excel
Database Design
Database Management
SQL Server Integration Services
Excel Macros
Excel Formula
Microsoft Power BI
Microsoft Excel PowerPivot
Power Query
Data Entry
Data Cleaning
Data Analytics
Data Extraction
AWS Glue
Apache Kafka
Python
HubSpot
Salesforce CRM
REST API
Node.js

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo