AI/LLM Expert & Instructor – Enterprise RAG, LoRA & Fine-Tuning Training (Remote)

Posted 3 hours ago

Worldwide

Summary

Project Overview We are looking for an experienced AI/LLM Engineer and Technical Trainer to deliver an advanced training program for our engineering team focused on Enterprise LLMs, Retrieval-Augmented Generation (RAG), LoRA, and Efficient Fine-Tuning. The goal is to transform our technical team into experts capable of designing, deploying, and maintaining enterprise AI solutions using open-source LLMs. This engagement combines technical instruction, architecture guidance, hands-on labs, and mentoring. Training Scope Block 1 – Enterprise LLM & RAG Fundamentals Duration: 1–2 weeks Module 1: Embeddings & Vector Databases Topics include: How embeddings represent documents as vectors Semantic search concepts Vector similarity search Vector database architectures Storage options and tradeoffs Pinecone Weaviate Milvus Qdrant Indexing strategies Chunking techniques Metadata filtering Module 2: Retrieval & Generation Evaluation Teach best practices for evaluating RAG systems, including: Retrieval metrics NDCG MRR Recall Precision Generation quality metrics BLEU ROUGE Hallucination detection RAG benchmarking Evaluation pipelines Ground-truth datasets Module 3: Build an Internal Enterprise Chatbot (POC) Guide the team through building a production-style proof of concept: Enterprise documentation chatbot Private knowledge base Source citation and traceability Document ingestion pipeline RAG architecture Deployment within existing infrastructure Module 4: RAG Integration Platform Explain how to orchestrate RAG pipelines using a centralized platform, including: Model orchestration Prompt pipelines Retrieval workflows API integration Enterprise architecture Observability Block 2 – LoRA & Efficient Fine-Tuning Duration: 2–3 weeks Module 1: LoRA & QLoRA Fundamentals Topics include: Parameter-efficient fine-tuning LoRA architecture QLoRA Adapter-based training Low-rank matrices GPU memory optimization VRAM reduction Cost comparison: LoRA vs Full Fine-Tuning Performance tradeoffs Module 2: Dataset Preparation Teach best practices for: Dataset collection Cleaning Labeling Validation Balancing Support conversations FAQs Domain-specific documentation Synthetic data generation Module 3: Domain-Specific Adapter Training Hands-on training covering: Training LoRA adapters Financial domain Telecommunications Legal Healthcare Model evaluation Comparison against base models Module 4: Reusable Adapter Catalog Teach how to build reusable adapter libraries: Modular adapters Domain-specific adapters Adapter versioning Combining LoRA with RAG Multi-domain architecture Production deployment Desired Deliverables The selected expert will: Deliver live remote training sessions Prepare presentation materials Create hands-on labs Provide code examples Assist with architecture discussions Review participant exercises Answer technical questions Help build a production-quality Proof of Concept (POC) Required Skills We are looking for someone with strong experience in: LLMs Llama Mistral Qwen Gemma DeepSeek Open-source language models RAG LangChain LlamaIndex Haystack Hybrid Search Vector databases Embeddings Vector Databases Experience with one or more: Pinecone Weaviate Milvus Qdrant Fine-Tuning LoRA QLoRA PEFT Hugging Face Transformers TRL Unsloth (preferred) Axolotl (nice to have) ML Frameworks PyTorch Hugging Face Accelerate BitsAndBytes Deployment FastAPI vLLM TGI Docker Kubernetes (preferred) REST/gRPC APIs Evaluation Experience with: RAG evaluation BLEU ROUGE NDCG MRR Hallucination detection Benchmarking Nice to Have Enterprise AI architecture experience Production LLM deployment Multi-tenant AI platforms GPU optimization Model serving at scale Experience teaching engineers or delivering corporate training English fluency (Spanish is a strong plus) Engagement Remote Part-time / Contract Approximately 3–5 weeks Live sessions plus office hours Flexible scheduling To Apply Please include: A brief introduction about your experience. Examples of enterprise RAG or LLM projects you've built. Experience with LoRA/QLoRA fine-tuning. Experience teaching or mentoring technical teams. Links to GitHub, technical blog posts, publications, or conference talks (if available). Your proposed hourly rate and availability.

Less than 30 hrs/week
Hourly
1-3 months
Duration
Expert
Experience Level
$60.00
-
$500.00
Hourly
Remote Job
Ongoing project
Project Type

Contract-to-hire opportunity

This lets talent know that this job could become full time.
Learn more

Skills and Expertise

Mandatory skills

Python

Machine Learning

Activity on this job

Proposals:50+
Last viewed by client:2 hours ago
Interviewing:
0
Invites sent:
1
Unanswered invites:
1

About the client

Member since Apr 30, 2024

United States
San Francisco6:47 AM
$13K total spent
16 hires, 5 active
434 hours

Explore similar jobs on Upwork

Quantum Computing Consultant – High-Dimensional Combinatorial Opt…Hourly‐ Posted 3 weeks ago

Quantum Computing

Data Scientist (Mid-to-Senior) — Machine Learning & Predictive An…Hourly‐ Posted 4 weeks ago

Predictive Model

SQL

pandas

Data Science

Python

Machine Learning

Python Scikit-Learn

Deep Learning

Predictive Analytics

Data Analysis

How it works

Create your free profile
Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
Work the way you want
Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
Get paid securely
From contract to payment, we help you work safely and get paid securely.