AI Proxy Layer Development
Worldwide
I’m building a proxy layer that sits in front of AI agents and cuts their inference cost 40–70% — automatically, without the agent changing its logic. The project involves complex technical tasks, requiring experience in AI and cost optimization. The ideal candidate will have a strong understanding of AI systems and be able to develop efficient solutions. This is NOT an agent-building job. I’m not looking for agent workflows, memory stores, or MCP orchestration. I need the layer underneath that: a high-throughput proxy that normalizes traffic across providers, caches intelligently, routes each call to the cheapest model that holds quality, and proves it. The deliverable I care about most: take one real agentic workload, instrument its current cost, apply the optimization stack, and produce a measured before/after — “cut compute N% with quality held, here’s the eval.” That number is the goal, not a polished UI. You’re a fit if you’ve worked on / contributed to: • vLLM, SGLang, LiteLLM, Portkey, or similar serving/gateway infrastructure • LLM serving internals: KV/prefix caching, continuous batching, quantization, model routing • Provider API normalization and tool-call handling across models • Eval design for LLM output quality You’re not a fit if your background is agent frameworks, RAG apps, or “I use the OpenAI API.” This role is about what happens below the API call. To apply — required, or I won’t read it: 1. Link a specific piece of your work (GitHub PR, repo, project) that shows serving/gateway/caching/routing work. Not a portfolio site — the actual code. 2. In two sentences: how would you handle routing a tool-calling request from an OpenAI-shaped agent to an open-weight model whose tool-call format differs? 3. Skip the generic intro. Lead with #1 and #2.
- Less than 30 hrs/weekHourly
- 1-3 monthsDuration
- IntermediateExperience Level
- Remote Job
- Ongoing projectProject Type
Skills and Expertise
Activity on this job
- Proposals:20 to 50
- Last viewed by client:2 days ago
- Interviewing:5
- Invites sent:0
- Unanswered invites:0
About the client
- USAFranklin Square3:27 PM
- $2.8K total spent3 hires, 3 active
Explore similar jobs on Upwork
How it works
Create your free profileHighlight your skills and experience, show your portfolio, and set your ideal pay rate.
Work the way you wantApply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
Get paid securelyFrom contract to payment, we help you work safely and get paid securely.
About Upwork
- 4.9/5(Average rating of clients by professionals)
- G2 2021#1 freelance platform
- 49,000+Signed contract every week
- $2.3BFreelancers earned on Upwork in 2020
Find the best freelance jobs
Growing your career is as easy as creating a free profile and finding work like this that fits your skills.
Trusted by