AI/RAG Developer & Data Engineer — Document Extraction Engine

Posted 6 days ago

Worldwide

Summary

Tech startup looking for an AI developer / data engineer to improve and scale the extraction engine of our existing AI document-processing software, DataXtrak. Important: the software already exists and works. We are not starting from scratch ,there is a functional prototype in place. We need someone to build on top of it, improve it and make it production-ready, not rebuild it from zero. Scope: -Enhance the existing document data-extraction pipeline (data engineering): make it handle large volumes reliably. -Improve / extend the AI / RAG system (retrieval-augmented generation) for more accurate, intelligent extraction. -Add strong data cleaning and structuring (handle messy, inconsistent or duplicate data and turn it into clean, reliable output — structured exports / spreadsheets). -Add an optimized database search system (fast, efficient querying and indexing across large datasets). -Set up the server architecture and deployment to move the existing software from desktop to a scalable, secure SaaS (servers, storage, APIs). -Ensure strong data security and confidentiality at every layer (we are a European company and must follow GDPR / European data-protection standards). Profile: Strong in Python, data engineering and document processing. Solid experience with LLMs / RAG systems. Experience in data cleaning and data quality (normalizing messy real-world data). Comfortable with databases and query optimization. Experience deploying secure, scalable server / cloud (SaaS) infrastructure. Mindful of data security and GDPR compliance. Able to work on and improve an existing codebase (not just greenfield projects). Engagement: remote, hourly. Goal: production-ready version before the end of August. Please share examples of similar AI / RAG and data-engineering projects you've built.

  • $7,000.00

    Fixed-price
  • Intermediate
    Experience Level
  • Remote Job
  • Complex project
    Project Type

Contract-to-hire opportunity

This lets talent know that this job could become full time.
Learn more
Skills and Expertise
Mandatory skills
Machine Learning
Python
Activity on this job
  • Proposals:50+
  • Last viewed by client:1 hour ago
  • Interviewing:
    3
  • Invites sent:
    0
  • Unanswered invites:
    0
About the client
Member since Jun 13, 2026
  • France
    8:08 PM

Explore similar jobs on Upwork

Database University AssignmentsHourly‐ Posted 9 months ago
SQL
Database
Microsoft Excel
Database Design
Database Management
SQL Server Integration Services
Excel Macros
Excel Formula
Microsoft Power BI
Microsoft Excel PowerPivot
Power Query
Data Entry
Data Cleaning
Data Analytics
Data Extraction
AWS Glue
Apache Kafka
Python
HubSpot
Salesforce CRM
REST API
Node.js

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo