AI/LLM Expert & Instructor – Enterprise RAG, LoRA & Fine-Tuning Training (Remote)

Posted 3 hours ago

Worldwide

Summary

Project Overview We are looking for an experienced AI/LLM Engineer and Technical Trainer to deliver an advanced training program for our engineering team focused on Enterprise LLMs, Retrieval-Augmented Generation (RAG), LoRA, and Efficient Fine-Tuning. The goal is to transform our technical team into experts capable of designing, deploying, and maintaining enterprise AI solutions using open-source LLMs. This engagement combines technical instruction, architecture guidance, hands-on labs, and mentoring. Training Scope Block 1 – Enterprise LLM & RAG Fundamentals Duration: 1–2 weeks Module 1: Embeddings & Vector Databases Topics include: How embeddings represent documents as vectors Semantic search concepts Vector similarity search Vector database architectures Storage options and tradeoffs Pinecone Weaviate Milvus Qdrant Indexing strategies Chunking techniques Metadata filtering Module 2: Retrieval & Generation Evaluation Teach best practices for evaluating RAG systems, including: Retrieval metrics NDCG MRR Recall Precision Generation quality metrics BLEU ROUGE Hallucination detection RAG benchmarking Evaluation pipelines Ground-truth datasets Module 3: Build an Internal Enterprise Chatbot (POC) Guide the team through building a production-style proof of concept: Enterprise documentation chatbot Private knowledge base Source citation and traceability Document ingestion pipeline RAG architecture Deployment within existing infrastructure Module 4: RAG Integration Platform Explain how to orchestrate RAG pipelines using a centralized platform, including: Model orchestration Prompt pipelines Retrieval workflows API integration Enterprise architecture Observability Block 2 – LoRA & Efficient Fine-Tuning Duration: 2–3 weeks Module 1: LoRA & QLoRA Fundamentals Topics include: Parameter-efficient fine-tuning LoRA architecture QLoRA Adapter-based training Low-rank matrices GPU memory optimization VRAM reduction Cost comparison: LoRA vs Full Fine-Tuning Performance tradeoffs Module 2: Dataset Preparation Teach best practices for: Dataset collection Cleaning Labeling Validation Balancing Support conversations FAQs Domain-specific documentation Synthetic data generation Module 3: Domain-Specific Adapter Training Hands-on training covering: Training LoRA adapters Financial domain Telecommunications Legal Healthcare Model evaluation Comparison against base models Module 4: Reusable Adapter Catalog Teach how to build reusable adapter libraries: Modular adapters Domain-specific adapters Adapter versioning Combining LoRA with RAG Multi-domain architecture Production deployment Desired Deliverables The selected expert will: Deliver live remote training sessions Prepare presentation materials Create hands-on labs Provide code examples Assist with architecture discussions Review participant exercises Answer technical questions Help build a production-quality Proof of Concept (POC) Required Skills We are looking for someone with strong experience in: LLMs Llama Mistral Qwen Gemma DeepSeek Open-source language models RAG LangChain LlamaIndex Haystack Hybrid Search Vector databases Embeddings Vector Databases Experience with one or more: Pinecone Weaviate Milvus Qdrant Fine-Tuning LoRA QLoRA PEFT Hugging Face Transformers TRL Unsloth (preferred) Axolotl (nice to have) ML Frameworks PyTorch Hugging Face Accelerate BitsAndBytes Deployment FastAPI vLLM TGI Docker Kubernetes (preferred) REST/gRPC APIs Evaluation Experience with: RAG evaluation BLEU ROUGE NDCG MRR Hallucination detection Benchmarking Nice to Have Enterprise AI architecture experience Production LLM deployment Multi-tenant AI platforms GPU optimization Model serving at scale Experience teaching engineers or delivering corporate training English fluency (Spanish is a strong plus) Engagement Remote Part-time / Contract Approximately 3–5 weeks Live sessions plus office hours Flexible scheduling To Apply Please include: A brief introduction about your experience. Examples of enterprise RAG or LLM projects you've built. Experience with LoRA/QLoRA fine-tuning. Experience teaching or mentoring technical teams. Links to GitHub, technical blog posts, publications, or conference talks (if available). Your proposed hourly rate and availability.

  • Less than 30 hrs/week
    Hourly
  • 1-3 months
    Duration
  • Expert
    Experience Level
  • $60.00

    -

    $500.00

    Hourly
  • Remote Job
  • Ongoing project
    Project Type

Contract-to-hire opportunity

This lets talent know that this job could become full time.
Learn more
Skills and Expertise
Mandatory skills
Python
Machine Learning
Activity on this job
  • Proposals:50+
  • Last viewed by client:2 hours ago
  • Interviewing:
    0
  • Invites sent:
    1
  • Unanswered invites:
    1
About the client
Member since Apr 30, 2024
  • United States
    San Francisco6:47 AM
  • $13K total spent
    16 hires, 5 active
  • 434 hours

Explore similar jobs on Upwork

Quantum Computing
Predictive Model
SQL
pandas
Data Science
Python
Machine Learning
Python Scikit-Learn
Deep Learning
Predictive Analytics
Data Analysis

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo