AI/LLM Expert & Instructor – Enterprise RAG, LoRA & Fine-Tuning Training (Remote)
Worldwide
Project Overview We are looking for an experienced AI/LLM Engineer and Technical Trainer to deliver an advanced training program for our engineering team focused on Enterprise LLMs, Retrieval-Augmented Generation (RAG), LoRA, and Efficient Fine-Tuning. The goal is to transform our technical team into experts capable of designing, deploying, and maintaining enterprise AI solutions using open-source LLMs. This engagement combines technical instruction, architecture guidance, hands-on labs, and mentoring. Training Scope Block 1 – Enterprise LLM & RAG Fundamentals Duration: 1–2 weeks Module 1: Embeddings & Vector Databases Topics include: How embeddings represent documents as vectors Semantic search concepts Vector similarity search Vector database architectures Storage options and tradeoffs Pinecone Weaviate Milvus Qdrant Indexing strategies Chunking techniques Metadata filtering Module 2: Retrieval & Generation Evaluation Teach best practices for evaluating RAG systems, including: Retrieval metrics NDCG MRR Recall Precision Generation quality metrics BLEU ROUGE Hallucination detection RAG benchmarking Evaluation pipelines Ground-truth datasets Module 3: Build an Internal Enterprise Chatbot (POC) Guide the team through building a production-style proof of concept: Enterprise documentation chatbot Private knowledge base Source citation and traceability Document ingestion pipeline RAG architecture Deployment within existing infrastructure Module 4: RAG Integration Platform Explain how to orchestrate RAG pipelines using a centralized platform, including: Model orchestration Prompt pipelines Retrieval workflows API integration Enterprise architecture Observability Block 2 – LoRA & Efficient Fine-Tuning Duration: 2–3 weeks Module 1: LoRA & QLoRA Fundamentals Topics include: Parameter-efficient fine-tuning LoRA architecture QLoRA Adapter-based training Low-rank matrices GPU memory optimization VRAM reduction Cost comparison: LoRA vs Full Fine-Tuning Performance tradeoffs Module 2: Dataset Preparation Teach best practices for: Dataset collection Cleaning Labeling Validation Balancing Support conversations FAQs Domain-specific documentation Synthetic data generation Module 3: Domain-Specific Adapter Training Hands-on training covering: Training LoRA adapters Financial domain Telecommunications Legal Healthcare Model evaluation Comparison against base models Module 4: Reusable Adapter Catalog Teach how to build reusable adapter libraries: Modular adapters Domain-specific adapters Adapter versioning Combining LoRA with RAG Multi-domain architecture Production deployment Desired Deliverables The selected expert will: Deliver live remote training sessions Prepare presentation materials Create hands-on labs Provide code examples Assist with architecture discussions Review participant exercises Answer technical questions Help build a production-quality Proof of Concept (POC) Required Skills We are looking for someone with strong experience in: LLMs Llama Mistral Qwen Gemma DeepSeek Open-source language models RAG LangChain LlamaIndex Haystack Hybrid Search Vector databases Embeddings Vector Databases Experience with one or more: Pinecone Weaviate Milvus Qdrant Fine-Tuning LoRA QLoRA PEFT Hugging Face Transformers TRL Unsloth (preferred) Axolotl (nice to have) ML Frameworks PyTorch Hugging Face Accelerate BitsAndBytes Deployment FastAPI vLLM TGI Docker Kubernetes (preferred) REST/gRPC APIs Evaluation Experience with: RAG evaluation BLEU ROUGE NDCG MRR Hallucination detection Benchmarking Nice to Have Enterprise AI architecture experience Production LLM deployment Multi-tenant AI platforms GPU optimization Model serving at scale Experience teaching engineers or delivering corporate training English fluency (Spanish is a strong plus) Engagement Remote Part-time / Contract Approximately 3–5 weeks Live sessions plus office hours Flexible scheduling To Apply Please include: A brief introduction about your experience. Examples of enterprise RAG or LLM projects you've built. Experience with LoRA/QLoRA fine-tuning. Experience teaching or mentoring technical teams. Links to GitHub, technical blog posts, publications, or conference talks (if available). Your proposed hourly rate and availability.
- Less than 30 hrs/weekHourly
- 1-3 monthsDuration
- ExpertExperience Level
$60.00
-
$500.00
Hourly- Remote Job
- Ongoing projectProject Type
Skills and Expertise
Activity on this job
- Proposals:50+
- Last viewed by client:2 hours ago
- Interviewing:0
- Invites sent:1
- Unanswered invites:1
About the client
- United StatesSan Francisco6:47 AM
- $13K total spent16 hires, 5 active
- 434 hours
Explore similar jobs on Upwork
How it works
Create your free profileHighlight your skills and experience, show your portfolio, and set your ideal pay rate.
Work the way you wantApply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
Get paid securelyFrom contract to payment, we help you work safely and get paid securely.
About Upwork
- 4.9/5(Average rating of clients by professionals)
- G2 2021#1 freelance platform
- 49,000+Signed contract every week
- $2.3BFreelancers earned on Upwork in 2020
Find the best freelance jobs
Growing your career is as easy as creating a free profile and finding work like this that fits your skills.
Trusted by