Machine learning engineer for AI chatbot development and document retrieval system

Posted last week

Worldwide

Summary

Job Description (AI Chatbot RAG system) We are looking for a Machine Learning Engineer to build a RAG-based chatbot system using a local LLM that can answer questions from a collection of PDF documents. Project Overview We have a dataset consisting of multiple PDF files (mixed structure: text-heavy documents, notices, and tabular information inside PDFs). The goal is to build a system where users can ask natural language questions and get accurate answers grounded strictly in the content of these PDFs. The system must: Extract and process text from PDFs Chunk and index the content for retrieval Use embeddings + vector search for relevant context retrieval Use a local LLM (not API-based) to generate answers Ensure answers are strictly grounded in retrieved content (minimize hallucination) Key Responsibilities Design and implement a RAG pipeline for PDF documents Build a robust PDF parsing and chunking system Create embedding + vector database pipeline (FAISS, Chroma, etc.) Integrate a local LLM for inference (Ollama, vLLM, Transformers, etc.) Optimize retrieval quality and context selection Ensure system is stable and production-ready (not just a demo script) Required Skills Strong experience with RAG systems Experience with PDF parsing / document ingestion Knowledge of embeddings and vector search Experience with local LLMs (Ollama / vLLM / HuggingFace Transformers) Strong Python skills Nice to Have Experience with hybrid retrieval (BM25 + vector search) Experience handling noisy PDFs (OCR, scanned documents) Understanding of chunking strategies and retrieval evaluation Deliverables End-to-end working RAG chatbot PDF ingestion + preprocessing pipeline Vector index setup Local LLM integration Simple documentation explaining architecture How We Work We prefer engineers who think in terms of system design, not just libraries. Important Note This is not a basic chatbot or prompt engineering task. We are specifically looking for someone with real experience building RAG-based document intelligence systems.

  • $150.00

    Fixed-price
  • Intermediate
    Experience Level
  • Remote Job
  • Ongoing project
    Project Type
Skills and Expertise
Mandatory skills
Artificial Intelligence
Activity on this job
  • Proposals:15 to 20
  • Last viewed by client:5 days ago
  • Interviewing:
    2
  • Invites sent:
    0
  • Unanswered invites:
    0
About the client
Member since Aug 9, 2023
  • Pakistan
    9:27 AM

Explore similar jobs on Upwork

Founding Partners for AI PlatformHourly‐ Posted 3 days ago
Artificial Intelligence
WordPress
Web Design

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo