You will get an Audit why the RAG / Agentic pipeline is not performing (Senior Expert)
Top Rated

Top Rated

Project details
You get an audit from a senior engineer (Ph.D.) with 20 years of production experience and 10 years in semantic search/RAG, focused on root causes, measurable quality, and practical fixes, not generic embeddings, models or prompt engineering advice.
• holistic view of the system failures: search, ranking, evaluation, and LLM behavior together.
• I look for root causes, not surface level tweaks. Focus on measurable retrieval quality, grounding, latency, and cost.
• Identify high‑friction, high‑value scenarios, why earlier attempts failed
• I bring engineering judgment from 20 years of building production systems.
• I've helped create dozens of practical search systems in production, across many domains: e-commerce, FMCG, telecom, legal, auto.
• I can communicate findings clearly to both technical and non-technical teams.
• Top rated, Expert-vetted on Upwork. Written and coached extensively on building search systems.
Feel free to reach out first if you have any custom requirements.
• holistic view of the system failures: search, ranking, evaluation, and LLM behavior together.
• I look for root causes, not surface level tweaks. Focus on measurable retrieval quality, grounding, latency, and cost.
• Identify high‑friction, high‑value scenarios, why earlier attempts failed
• I bring engineering judgment from 20 years of building production systems.
• I've helped create dozens of practical search systems in production, across many domains: e-commerce, FMCG, telecom, legal, auto.
• I can communicate findings clearly to both technical and non-technical teams.
• Top rated, Expert-vetted on Upwork. Written and coached extensively on building search systems.
Feel free to reach out first if you have any custom requirements.
AI Algorithms
Large Language Model, Transformer ModelAI Applications
AI Chatbot, AI Content Creation, AI-Enhanced Classification, AI-Enhanced Medical Imaging, AI-Generated Code, AIOps, Anomaly Detection, Automatic Speech Recognition, Conversational AI, Natural Language Generation, Natural Language UnderstandingAI Development Language
PythonAI Tools
GitHub Copilot, Hugging Face, NVIDIA AI Platform, PyTorchAI Models
BERT, ChatGPT, GPT-4, LLaMA, OpenAI Codex, WhisperWhat's included $750
These options are included with the project scope.
$750
- Delivery Time 4 days
- Number of Revisions 0
Optional add-ons
You can add these on the next page.
Followup
(+ 10 Days)
+$4,500Frequently asked questions
49 reviews
(48)
(1)
(0)
(0)
(0)
This project doesn't have any reviews.
MM
MANOJ M.
May 15, 2026
AI ML RAG Architecture eCommerce
D
Gaetan D.
Dec 10, 2025
Applied ML Framework Engineer (PyTorch Backend · Secure Computation Prototype)
Nishant was careful and methodical in his approach, taking the time to understand the requirements and implementing/testing the cryptographic components with solid attention to detail. Communication was clear and timely throughout the project, and I’d be happy to work with him again on future MPC/ML work.
ZH
Zitai H.
Jul 7, 2025
60 minute consultation
BM
Brendan M.
Jun 27, 2025
60 minute consultation
GS
Galen S.
Apr 8, 2025
60 minute consultation
Nishant was a great resource to get my team started in thinking about the design of our AI platform
About Nishant
Senior AI Architect | Agents, Search, RAG, LLM Workflows
100%
Job Success
Bangalore, India - 10:17 pm local time
Upwork: 10+ years | 70+ Projects | $500K+ delivered | Production ML across 10+ industries | Top-Rated Plus
Focus areas:
✨ architecting reliable Multi-Agent workflows
✨ making internal knowledge base more accessible
✨ automating repetitive work
Recent projects involve:
✨ Agentic search and RAG Agents across company data.
✨ Orchestrate reliable LLM Agent workflows: Claude, Codex, Skills, Langchain, Memory, Worktrees, state graphs.
✨ Migrate from costly cloud-based Voice platforms (Vapi, Retell, Bland) to cost-effective on-premise alternatives (Livekit, Pipecat). Optimize Latency and accuracy of ASR + LLMs + TTS pipelines.
✨ Building speech recognition for custom domains - Whisper, pyannote diarization, noisy multi-speakers. Whisper - domain-specific finetune, real-time, speech-text alignment, diarization.
✨ Training deep learning custom models.
✨ Mentoring teams, taking products to market
✨ Coaching: Multi-Agent systems, RAG, LLMs, Voice Agents. Claude, Codex.
Many companies treat building workflow Agents as a casual week-long project. This only leads to wasted time and resources. My article titled - Why your GPT + Vector Search demo won't make it to production - (linked in my portfolio) discusses a disciplined approach to building search agents.
--- Background --
I finished my Ph.D. from Carnegie Mellon and worked at IBM Research, building AI solutions and distributed systems tools. For almost a decade, I have helped many clients through the entire Machine Learning lifecycle, building wide variety of deep learning models for text, vision and speech problems. My projects include a mixture of creative, algorithmic problem solving and building across the tech stack.
I bring a very unique combination of skills to the table:
↪ I’m a senior AI engineer: Strategic problem solving with cutting edge AI tech.
- Reliable Multi-Agent Systems (Codex, Claude CLI/APIs, Skills, Memory)
- Voice Agents (Pipecat, Livekit, Vapi)
- LLM Agents (OpenAI, Claude, Gemini, DSPy, prompt engineering, LLM as judge)
- DeepResearch/RAG: Qdrant, ElasticSearch, Vespa, Pinecone, Weaviate
- model fine-tuning/inference, LoRA, RLHF (Transformers, GPT, ViT, ASR, CLIP),
- compression/optimization (Pytorch, TensorRT, Deepspeed, ONNX)
- cloud: Docker/AWS
- programming Stack: Python, Pytorch, FastAPI.
↪ As an Architect - I go from ambiguous specs to a clean, modular design of a complex system. Plan a system as a whole before optimizing the parts. Keep design simple. Mentor team.
↪ As a Researcher - I can explain and distill out consumable stuff from the most math dense, cutting-edge academic text. Published internationally over 20 years. Solving complex, multi-disciplinary problems is my core strength.
↪ As a Product Entrepreneur - I deeply care about the product strategy: who buys it, who consumes/uses it. I hustle to bridge the gap between cutting-edge tech and impactful products. I spot, lead and mentor deep tech talents.
--- Testimonials --
Please have a look at my portfolio to learn more about my past engagements and feedback.
🏆 Expert-Vetted, Top 1%
"Nishant is extremely knowledgeable about NLP and LLM incorporation into a project. He is very thoughtful and considered his responses carefully when answering my questions. He provided valuable insight and advice which will save significant time, effort and expense. His advanced expertise and wide-ranging experience, including academic publications as well as business consulting give him exceptional capability. I recommend Nishant highly. "
"Nishant is incredible to work with. He is a fount of machine learning wisdom and knowhow. His involvement with our project was instrumental in taking it to new heights."
Steps for completing your project
After purchasing the project, send requirements so Nishant can start the project.
Delivery time starts when Nishant receives requirements from you.
Nishant works on your project following the steps below.
Revisions may occur after the delivery date.
Analyze and Diagnose your RAG/Agentic search pipeline
- Analyze your pipeline - Diagnose issues - Provide recommendations and feedback
