Machine learning engineer for AI chatbot development and document retrieval system
Worldwide
Job Description (AI Chatbot RAG system) We are looking for a Machine Learning Engineer to build a RAG-based chatbot system using a local LLM that can answer questions from a collection of PDF documents. Project Overview We have a dataset consisting of multiple PDF files (mixed structure: text-heavy documents, notices, and tabular information inside PDFs). The goal is to build a system where users can ask natural language questions and get accurate answers grounded strictly in the content of these PDFs. The system must: Extract and process text from PDFs Chunk and index the content for retrieval Use embeddings + vector search for relevant context retrieval Use a local LLM (not API-based) to generate answers Ensure answers are strictly grounded in retrieved content (minimize hallucination) Key Responsibilities Design and implement a RAG pipeline for PDF documents Build a robust PDF parsing and chunking system Create embedding + vector database pipeline (FAISS, Chroma, etc.) Integrate a local LLM for inference (Ollama, vLLM, Transformers, etc.) Optimize retrieval quality and context selection Ensure system is stable and production-ready (not just a demo script) Required Skills Strong experience with RAG systems Experience with PDF parsing / document ingestion Knowledge of embeddings and vector search Experience with local LLMs (Ollama / vLLM / HuggingFace Transformers) Strong Python skills Nice to Have Experience with hybrid retrieval (BM25 + vector search) Experience handling noisy PDFs (OCR, scanned documents) Understanding of chunking strategies and retrieval evaluation Deliverables End-to-end working RAG chatbot PDF ingestion + preprocessing pipeline Vector index setup Local LLM integration Simple documentation explaining architecture How We Work We prefer engineers who think in terms of system design, not just libraries. Important Note This is not a basic chatbot or prompt engineering task. We are specifically looking for someone with real experience building RAG-based document intelligence systems.
$150.00
Fixed-price- IntermediateExperience Level
- Remote Job
- Ongoing projectProject Type
Skills and Expertise
Activity on this job
- Proposals:15 to 20
- Last viewed by client:5 days ago
- Interviewing:2
- Invites sent:0
- Unanswered invites:0
About the client
- Pakistan11:51 AM
Explore similar jobs on Upwork
How it works
Create your free profileHighlight your skills and experience, show your portfolio, and set your ideal pay rate.
Work the way you wantApply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
Get paid securelyFrom contract to payment, we help you work safely and get paid securely.
About Upwork
- 4.9/5(Average rating of clients by professionals)
- G2 2021#1 freelance platform
- 49,000+Signed contract every week
- $2.3BFreelancers earned on Upwork in 2020
Find the best freelance jobs
Growing your career is as easy as creating a free profile and finding work like this that fits your skills.
Trusted by