You will get production-ready LLM inference server optimized for speed

Name: You will get production-ready LLM inference server optimized for speed
Availability: InStock

Christ Herve O.

4.6

Christ Herve O.

4.6

Project details

I specialise in deploying and optimising LLM inference servers for AI startups using vLLM and SGLang. Whether you need a quick deployment or a full production-grade inference stack, I handle the infrastructure, so your team can focus on building the product. I deploy on any cloud or serverless GPU platform, including Runpod, Modal, AWS and GCP.

AI Algorithms

Large Language Model, Transformer Model

AI Applications

AI Chatbot, AI Text-to-Speech

AI Development Language

Python

AI Tools

Hugging Face, PyTorch, TensorFlow

AI Models

LLaMA, Whisper

What's included

Service Tiers	Starter $150	Standard $300	Advanced $500
Delivery Time	3 days	5 days	10 days
Number of Revisions	1	2	3
AI Model Integration	-	-	-
Batch Normalization	-	-	-
Database Integration	-	-	-
Detailed Code Comments	-	-	-
Image Upscaling	-	-	-
MLOps	-
Model Deployment
Model Documentation	-	-	-
Model Monitoring	-
Model Testing & Optimization	-
Model Tuning	-	-	-
Natural Language Processing	-	-	-
NLP Tokenization	-	-	-
Pre-Training	-	-	-
Prompt Engineering	-	-	-
Setup File
Source Code

Optional add-ons You can add these on the next page.

Fast Delivery

+$50 - $200

Additional Revision

+$50

LoRA Adapter Integration (+ 1 Day)

+$100

Serverless GPU Deployment (+ 1 Day)

+$150

Speculative Decoding Setup (+ 1 Day)

+$100

Frequently asked questions

4.6

10 reviews

90% Complete

(9)

1% Complete

(0)

1% Complete

(0)

10% Complete

(1)

1% Complete

(0)

FastAPI Developer – Build AI MCP Gateway with Routing Logic & Fallback (Docker, AI APIs) Working with Christ on the FastAPI Developer – Build AI MCP Gateway with Routing Logic & Fallback (Docker, AI APIs) project was a productive and professional experience.

He delivered a solid implementation of the gateway module, showing strong command of FastAPI, routing logic, and Dockerized API structures. The code was organized, functional, and integrated well with the existing architecture.

Communication was consistent, and Christ responded promptly to feedback, implementing the required updates clearly and efficiently. His structured approach and technical understanding helped maintain a smooth workflow throughout the project.

Overall, a dependable and skilled developer who met all key deliverables with good quality and attention to detail. Recommended for complex API and backend automation projects.

Runpod Serverless Instance setup

Runpod Serverless Instance Setup Excellent communication - delivered exactly what was asked of him. I am about to give a second piece of work.

AI consulting and programming It is with great sadness that I end this contract and provide this feedback. Christ was great during the first three months of the contract. We made great progress, all was going very well. But, suddenly, he became unresponsive and blamed it on personal issues. I was understanding and gave him time, but week after week he promised to get work done and failed. The systems that were developed and once worked well stopped working (model hallucinating) and he was unable to fix them. I asked for documentation of his work, the delivery was poor. I prepared a set of questions related to the systems, few were poorly answered, most simply ignored. He ignored my questions and requests and worked whenever he wanted on whatever he wanted, no explanations given.
I don't understand what happened to him. It almost feels like he is a different person now.
My company spent five months and a good amount of money in this project and we are leaving mostly empty-handed. Without the systems working, a hallucinating model, and almost no documentation, we are back to square zero. More than money, we lost time to market.
This project was a big loss for us and I can't recommend Christ to anyone. I wish we could recuperate, at least, some of the work done.
I hope he finds his way back to a responsible, responsive and capable professional he was for a while. But, the way he is now, I'd not hire him again.

Python Developer Needed for Google Sheets Integration using ChatGPT Christ did an excellent job

About Christ Herve

Machine Learning/Software Engineer

81% Job Success

4.6 (10 reviews)

Yaounde, Cameroon - 1:47 pm local time

👋 I'm a Machine Learning/Software Engineer with 3 years in AI and 5 years in computer engineering, specializing in production-grade Python and Rust. I build end-to-end ML solutions—from fine-tuning transformers like BERT and T5 to deploying models with vLLM and Docker, crafting scalable data pipelines with Prefect, and integrating cutting-edge LLMs (OpenAI, Claude, Gemini). My expertise spans FastAPI-powered ML APIs, and workflow automation, all backed by clean, test-driven code and modern DevOps practices. Whether you need intelligent automation, model serving at scale, or robust API solutions, I deliver scalable systems that actually work. Ready to bring AI innovation to your project? Let's build something exceptional together! 🚀

Steps for completing your project

After purchasing the project, send requirements so Christ Herve can start the project.

Delivery time starts when Christ Herve receives requirements from you.

Christ Herve works on your project following the steps below.

Revisions may occur after the delivery date.

Requirement Analysis

Review your model, GPU specs, platform choice, and performance targets to plan the optimal setup.

Server Deployment

Deploy vLLM or SGLang on your chosen platform with proper configuration.

Review the work, release payment, and leave feedback to Christ Herve.

Select service tier

Starter$150

Standard$300

Advanced$500

Quick LLM Deploy

vLLM/SGLang server deployed and ready to serve.

Delivery Time 3 days
Number of Revisions 1
- Model Deployment
- Setup File
- Source Code

3 days delivery — Jul 3, 2026

Revisions may occur after this date.

Upwork Payment Protection

Fund the project upfront. Christ Herve gets paid once you are satisfied with the work.

You will get production-ready LLM inference server optimized for speed

Let a pro handle the details

Let a pro handle the details

Project details

AI Algorithms

AI Applications

AI Development Language

AI Tools

AI Models

What's included

Frequently asked questions

MS

AD

AD

AC

WD

About Christ Herve

Machine Learning/Software Engineer

Steps for completing your project

After purchasing the project, send requirements so Christ Herve can start the project.

Christ Herve works on your project following the steps below.

Requirement Analysis

Server Deployment

Review the work, release payment, and leave feedback to Christ Herve.

Select service tier

Quick LLM Deploy

You will get production-ready LLM inference server optimized for speed

Let a pro handle the details

Let a pro handle the details

Project details

AI Algorithms

AI Applications

AI Development Language

AI Tools

AI Models

What's included

Frequently asked questions

MS

AD

AD

AC

WD

About Christ Herve

Machine Learning/Software Engineer

Steps for completing your project

After purchasing the project, send requirements so Christ Herve can start the project.

Christ Herve works on your project following the steps below.

Requirement Analysis

Server Deployment

Review the work, release payment, and leave feedback to Christ Herve.

Select service tier

Quick LLM Deploy

Optional add-ons (5)