AI Proxy Layer Development

Posted 4 days ago

Worldwide

Summary

I’m building a proxy layer that sits in front of AI agents and cuts their inference cost 40–70% — automatically, without the agent changing its logic. The project involves complex technical tasks, requiring experience in AI and cost optimization. The ideal candidate will have a strong understanding of AI systems and be able to develop efficient solutions. This is NOT an agent-building job. I’m not looking for agent workflows, memory stores, or MCP orchestration. I need the layer underneath that: a high-throughput proxy that normalizes traffic across providers, caches intelligently, routes each call to the cheapest model that holds quality, and proves it. The deliverable I care about most: take one real agentic workload, instrument its current cost, apply the optimization stack, and produce a measured before/after — “cut compute N% with quality held, here’s the eval.” That number is the goal, not a polished UI. You’re a fit if you’ve worked on / contributed to: • vLLM, SGLang, LiteLLM, Portkey, or similar serving/gateway infrastructure • LLM serving internals: KV/prefix caching, continuous batching, quantization, model routing • Provider API normalization and tool-call handling across models • Eval design for LLM output quality You’re not a fit if your background is agent frameworks, RAG apps, or “I use the OpenAI API.” This role is about what happens below the API call. To apply — required, or I won’t read it: 1. Link a specific piece of your work (GitHub PR, repo, project) that shows serving/gateway/caching/routing work. Not a portfolio site — the actual code. 2. In two sentences: how would you handle routing a tool-calling request from an OpenAI-shaped agent to an open-weight model whose tool-call format differs? 3. Skip the generic intro. Lead with #1 and #2.

  • Less than 30 hrs/week
    Hourly
  • 1-3 months
    Duration
  • Intermediate
    Experience Level
  • Remote Job
  • Ongoing project
    Project Type
Skills and Expertise
Mandatory skills
AI Agent Development
Deep Learning
Activity on this job
  • Proposals:20 to 50
  • Last viewed by client:2 days ago
  • Interviewing:
    5
  • Invites sent:
    0
  • Unanswered invites:
    0
About the client
Member since Sep 11, 2025
  • USA
    Franklin Square3:27 PM
  • $2.8K total spent
    3 hires, 3 active

Explore similar jobs on Upwork

AI Agent Development
AI Implementation
Chatbot Development
Gen AI Developer (Contract)Fixed-price‐ Posted 1 month ago
AI Agent Development
Python
JavaScript
API
Node.js
Deep Learning
React
PostgreSQL

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo