Search Freelance Jobs on Upwork

Posted 4 weeks ago

Build Python orchestrator with Streamlit UI for parallel multi-LLM querying

Fixed price
Intermediate
Est. budget: $4,000.00

We need a senior Python developer to build a custom internal research tool for our healthcare AI consultancy. The tool runs 75 standardized queries across six AI platforms in parallel, captures structured responses and citations, and writes results to an Excel template for analysis. What you're building A Python application with three components: Six API adapters: OpenAI (ChatGPT), Anthropic (Claude), Perplexity, Google (Gemini), xAI (Grok), and SerpAPI (for Google AI Overviews). All hit the official APIs and return a normalized response object. Async orchestrator that fires all 75 queries in parallel across the six platforms, handles retries and rate limits, tags client domain and competitor mentions in real time, and writes results to a provided Excel template. Streamlit UI for click-to-run operation, with engagement metadata input, live progress monitoring, results preview, plus Google Drive upload and Slack notification integrations on completion. Deliverables Six platform adapter modules with normalized response interface Async orchestrator with error handling and retry logic Excel writer (we provide the template) Streamlit dashboard Google Drive and Slack integrations README, setup guide, and operator documentation Test suite covering 70%+ of code paths All code in a GitHub private repo owned by us What we provide at kickoff Detailed architecture document and UI specification AHS Excel scan template, production-ready Full 75-query taxonomy All six API keys, provisioned and shared via 1Password Sample completed scans for reference Direct Slack access to the project lead Timeline and process 10 business days from kickoff to delivery Fixed price, paid in three milestones (30% / 40% / 30%) $500 bonus if delivered on or before business day 8 Daily commits to the repo required Weekly Friday sync at 10am ET Mutual NDA signed before access to query taxonomy Required skills 5+ years Python with strong asyncio experience Hands-on experience with at least 3 of: OpenAI API, Anthropic API, Perplexity Sonar API, Google Gemini API, xAI Grok API, SerpAPI openpyxl for Excel manipulation Streamlit for UI Google Cloud OAuth for Drive integration Slack API for notifications Strong Git/GitHub workflow Out of scope Query design, AI scoring synthesis, deck generation, mobile, and multi-user authentication. This is one focused tool with a clear endpoint. Please answer in your proposal Walk us through a recent Python project that integrated 3+ third-party APIs. What broke and how did you fix it? Which of the six APIs in scope have you used in production? What's your approach to rate limits and transient failures across async API calls? What scope questions do you have before bidding? Not a fit if You've never integrated more than two LLM APIs, you're uncomfortable with fixed-price contracts, or you can't commit to daily commits on a 10-business-day timeline.

Posted 2 weeks ago

AI Agent Infrastructure Consultant — Parallel Agents Optimization (Python/Node)

Fixed price
Expert
Est. budget: $150.00

**Overview** We are a fast-growing SaaS company with a lean engineering team (~10 devs) utilizing a modern Python (FastAPI/Django) and Node.js backend, React frontend, and PostgreSQL stack. We have already deployed an initial multi-model agent stack—Claude Code, LiteLLM gateway, Git worktrees, and MCP integrations. We need an expert to run an intensive architecture review and optimization session for our current infrastructure. We are not looking for someone to build a full-time, weeks-long project from scratch. Instead, we need a seasoned engineer who has shipped this exact type of infrastructure end-to-end to audit our setup, identify architectural gaps, and guide our team on hardened implementation. This project must move fast. If your timeline is measured in weeks, please do not apply. We want someone who looks at this scope, jumps into a review session, and delivers actionable architectural guidance in days. This starts as a focused, urgent consultation. However, we expect ongoing advisory work—follow-ups, architecture adjustments, and enhancement reviews—as the AI tooling landscape shifts. For the right engineer, this will turn into a recurring relationship. We are completely open to a fixed price per milestone or an hourly structure. **What You Need to Have Actually Shipped and Can Review (Not Just Read About)** * **Full Agentic Coding Harnesses:** The entire loop: orchestrator → subagent → CI gate → merge loops. * **Isolation Layers:** Configured execution layers (such as E2B, Modal, or secure Docker runtimes) as isolated sandboxes for AI-generated code. * **Parallel Claude Code Sessions:** Managed multiple simultaneous subagents on scoped tasks via Git worktrees. * **Self-Hosted LiteLLM Gateways:** Routing to multiple models (Claude, GPT, Gemini, DeepSeek). * **MCP Server Infrastructure:** Connected file system, PostgreSQL, Atlassian, and Slack tool layers for active agents. * **Agent Framework Structures:** Used CLAUDE.md, COMMON\_MISTAKES.md, subagent role definitions, hook scripts, and settings.json. * **Human-in-the-Loop Orchestration:** Built Plan Mode or equivalent approval gates before agent execution. * **Multi-Agent Frameworks:** 7-agent feature factory patterns or frameworks like LangGraph, CrewAI, or Autogen. * **Durable Workflow Engines:** Applied Temporal, n8n, or similar tools for long-running agent workflow execution. * **Mechanical Quality Gates:** Treating CI green as the ultimate gate for agent output quality. \[[1](https://manveerc.substack.com/p/ai-agent-sandboxing-guide)\] **Our Current Stack (What you are reviewing)** * **Backend:** Python (FastAPI / Django) & Node.js (TypeScript) * **Frontend:** React (Next.js) * **Database & ORM:** PostgreSQL / Prisma / SQLAlchemy * **Infrastructure:** Docker Compose, AWS (ECS/EKS) * **CI/CD:** GitHub Actions / GitLab CI * **AI Layer:** Claude Code with shared `.claude/` directory, CLAUDE.md, and LiteLLM gateway in Docker * **MCP:** Atlassian (Jira/Confluence), custom PostgreSQL MCP server, Slack * **Workflow Automation:** Temporal / n8n * **QA Automation:** Playwright / Autonoma **Scope of Work (Review & Advisory Only)** 1. **Comprehensive Audit:** Audit our current agent harness and identify architectural gaps against a production-grade standard. 2. **Sandbox Strategy Consultation:** Review our environment strategy to ensure highly secure, isolated execution runtimes for agent code runs. 3. **Workflow Hardening Review:** Evaluate our parallel agent workflow setup (Git worktrees, subagent role configs, hook scripts, and settings lockdown). 4. **CI Pipeline Integration Strategy:** Advise on wiring our sandbox execution layer into the existing CI pipeline so agent-executed code runs in clean snapshots, not live infra. 5. **Architectural Runbook:** Deliver an optimization report / documented standard that our backend lead can easily own and execute going forward. **How to Apply** Skip the generic pitch. Show us something real to be considered: 1. A GitHub repo, architecture diagram, or Loom walkthrough of an agentic harness you have actually shipped. 2. Specific tools from our stack you have personally configured (E2B, LiteLLM, Claude Code, etc.). 3. One sentence explaining the hardest problem you solved to get full agent loops running reliably. 4. Your availability to conduct this high-impact architectural review session this week.

Posted last month

Python CLI Engineer — Open Source AI Agent Integration (MCP/LangGraph/CrewAI)

Hourly: $70.00 - $85.00
Expert
Est. time: 1 to 3 months, Less than 30 hrs/week

Overview We're building an open-source CLI gateway for multi-agent AI orchestration — model-agnostic, MCP-native, and designed to bring any agent framework online with a single command. The repo is active, well-documented, and growing. We need an engineer to accelerate integration coverage and help attract open-source contributors. The Work Build agent templates and runnable examples for LangGraph, CrewAI, and similar frameworks Add LLM provider support (Groq, Mistral, Gemini, etc.) to the Hermes runtime Write clean, contributor-friendly code that models good PR hygiene Submit work via fork → PR → merge workflow on GitHub You Are Strong Python developer with CLI tooling experience Familiar with at least one of: LangGraph, CrewAI, LiteLLM, LangChain Comfortable with open source GitHub workflows (fork, PR, issues, reviews) Self-directed — you read docs, ask good questions, and don't wait to be unblocked Nice to Have Experience with MCP (Model Context Protocol) Familiarity with SSE, OAuth 2.1, or agent credential management Prior open source contributions Engagement Part-time to start, 20 hrs/week Fixed milestones per integration delivered Potential to grow with the project To Apply Share your GitHub profile and one example of open source work or a project that shows your Python and agent framework experience. https://github.com/ax-platform/ax-gateway

Posted 3 weeks ago

Senior Software engineer with deep knowledge of AI/LLM integrations across multiple cloud platforms

Hourly: $70.00 - $85.00
Expert
Est. time: More than 6 months, 30+ hrs/week

I need an expert senior software engineer that can provide consulting services around implementation best practices of LLM's and AI into existing application workflows. i.e. leveraging AI to extract data from a document as part of an ingestion pipeline.

Posted 3 weeks ago

Seeking Experienced Trainer – Agentic AI, RAG Systems, and LLM Applications – Remote – $1,100/day

Fixed price
Expert
Est. budget: $1,100.00

NobleProg is seeking an experienced AI Trainer to deliver a live, instructor-led remote training focused on helping technical professionals integrate Agentic AI and RAG systems into their existing workflows. This opportunity is designed for participants with strong technical backgrounds (Data Engineering and Workflow Automation) but limited formal AI experience, with the goal of applying AI to real-world systems rather than learning theory. Engagement Details Location: Remote Duration: 2 days Audience: Data Engineers and Workflow Developers Participants: 4+ Daily Rate $1,100 per day Course Scope This training focuses on practical, hands-on development of AI-powered systems using Retrieval-Augmented Generation (RAG) and agent-based architectures. The course will follow a Core & Split approach, starting with shared foundational concepts, moving into role-specific deep dives, and concluding with an integrated session demonstrating how AI systems are built and applied across workflows and data pipelines. NobleProg SOP - https://share.synthesia.io/a0788c6e-56d5-4da8-92c6-0d5c03ad6d52 Key Topics Include - Practical introduction to LLM applications and AI system architecture - Retrieval-Augmented Generation (RAG) design and implementation - Data preparation, embeddings, and vector database concepts - Agentic AI fundamentals (tools, decision-making, multi-step workflows) - Orchestration frameworks such as LangChain, LangGraph, or similar - Role-based applications: RAG pipelines for data engineers and AI-driven workflows for workflow developers - End-to-end system integration (RAG + agents + automation) Trainer Responsibilities - Deliver engaging, instructor-led remote training with strong hands-on focus - Translate AI concepts into practical applications for non-AI technical professionals - Structure delivery using a Core & Split model to address different roles - Provide real-world exercises aligned with data pipelines and workflow automation - Facilitate an integrated session demonstrating how different components work together - Prepare training materials (trainer retains ownership of content) Required Qualifications - Hands-on experience building LLM-based applications, including RAG systems and agent-based workflows - Strong proficiency in Python and experience with APIs, data pipelines, or automation systems - Experience with frameworks such as LangChain, LangGraph, or similar - Proven experience delivering technical training to engineering audiences - Ability to simplify AI concepts and connect them to real-world use cases Nice to Have - Background in data engineering, workflow automation, or solutions architecture - Familiarity with MCP or emerging agent orchestration frameworks - Experience designing modular or role-based training programs preferred - Experience building production-grade AI applications preferred https://docs.google.com/document/d/184VlJipyixkLNJ_HnP3aPt4YToedTUAlji_LxkuLhRU/edit?usp=sharing Please review and approve this tentative outline. We will be meeting with the client to determine whether they prefer a 1-day or 2-day delivery format. The agenda may require some adjustments based on the client's specific objectives, technical background, and areas of interest, which can be finalized during the trainer-client consultation call. Could you please review the proposed outline and let us know if you see any red flags, gaps, concerns, or topics that may require immediate attention? We would also appreciate any recommendations regarding scope, level of technical depth, hands-on exercises, or prerequisite knowledge that should be addressed before presenting this to the client. Thank you for your feedback. How to Apply Please include - A brief overview of your experience with Agentic AI and RAG systems - Your experience delivering technical or AI-focused training - Examples of AI systems or applications you have built - Your approach to teaching participants without formal AI background - Availability for remote delivery

Posted 2 weeks ago

Software Engineer, AI Infrastructure

Hourly
Expert
Est. time: More than 6 months, 30+ hrs/week

The Role: As a Software Engineer on our AI Infrastructure team, you will help design the core systems that power Prism AI’s generative AI platform. You will help build infrastructure and tools that ensure the reliability, performance, quality, and availability of our AI system. Our mission is to make Prism AI the most reliable and user friendly generative AI platform in the world. You will partner closely with our cloud infrastructure team, product team, and performance team to deliver infrastructure that bridges the gap between our customer and the ultra-performant proprietary Prism inference engine. Key Responsibilities: Contribute to the design and development of scalable backend infrastructure that supports distributed training, inference, and data pipelines Build and maintain core backend services such as LLM CI/CD pipeline, control plane, and model serving systems Support performance optimization, cost efficiency, and reliability improvements across compute, storage, and networking layers Building frameworks and safeguards to ensure Prism AI has the best model quality in the industry Collaborate with performance, training, and product teams to translate research and product needs into infrastructure solutions Participate in code reviews, technical discussions, and continuous integration and deployment processes Minimum Qualifications: Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience) 3 years of experience in software engineering, with a focus on infrastructure or machine learning systems Strong programming skills in Python, Go, or a similar language Proven experience in ML infrastructure and tooling (e.g., PyTorch, MLflow, Vertex AI, SageMaker, Kubernetes, etc.). Basic understanding of LLM knowledge (e.g., context length, disaggregated prefill, KV cache memory estimation, etc) Preferred Qualifications: 5+ years of experience in software engineering, with a focus on infrastructure or machine learning systems Experience with open source inference engine like vLLM, Sglang, or TRT-LLM Contributions to open-source infrastructure or ML projects Experience in building large scale ML/MLOps infrastructure

Posted 3 weeks ago

Tests, Security Audit for Credit Cards, Possible Refactor: Vibe-coded Social Graphify + Lovable

Hourly: $70.00 - $85.00
Expert
Est. time: 1 to 3 months, 30+ hrs/week

Add Tests, Security Audit for Credit Cards, Possible Refactor: Vibe-coded Social Graphify + Lovable: Do not use ai to write your proposal, or for any of your communication. I do not need AI or an AI detection tool to know. It's obscene. I can't trust Claude to make tests. It always slyly reverts when making a productive change to changing the tests to mock data. So I need a human to implement them. Full-stack web dev js, python, llm's. Additional background or interest in: Graph Theory, Graph Neural Nets, Graphical Probabilistic Models, Bayesian Neural Nets, Category Theory, Pre-Deep Learning Natural Language Processing (Cfg's, etc.), Semantic Web Tech (rdf/owl/xbrl), Library Sciences, Pre-LLM Machine Learning (+Stat/Econometrics/etc.), Federated Learning and Crypto is appreciated. Do not use ai to write your proposal, or for any of your communication. I do not need AI or an AI detection tool to know. It's obscene.

Posted last week

Senior AI Solutions Engineer – Local + Azure/AWS GovCloud LLM Deployment – USA Only

Hourly: $40.00 - $55.00
Expert
Est. time: 3 to 6 months, 30+ hrs/week

Eligibility: This role is open to U.S. citizens only due to client security and compliance requirements. Please apply through this posting only — do not contact Data-Sleek directly regarding this position. Applications received outside this channel will not be considered and reported to Upwork. Data-Sleek is looking for a Senior AI Solutions Engineer to lead our on-premise and government-cloud AI deployments. You will design, build, and deploy AI-powered data pipelines for clients who cannot use commercial cloud due to ITAR, CMMC, or other data residency constraints, beginning with a client in the aerospace and defense sector. Beyond this first engagement, you will become Data-Sleek's go-to engineer for AI deployments across defense and aerospace clients, building the practice rather than just executing a single project. About Data-Sleek Founded in 2020, Data‑Sleek® is a U.S.-based AI and data consulting firm that helps mid-market companies build the data foundation that AI actually runs on. We own the full path — data strategy, architecture, integration, warehousing, and AI implementation — so organizations can adopt AI with confidence, stay compliant, and scale, without first hiring an internal data team. Our distributed U.S. team (San Francisco, Los Angeles, Irvine, Dallas, Chicago, and New York) partners with clients across healthcare, finance, insurance, logistics, and technology, modernizing data platforms with best-in-class tools like Snowflake, dbt, Fivetran, Tableau, and AWS. Trusted by Fortune 500 institutions and growing companies alike, Data‑Sleek turns complex data into measurable outcomes — faster insight, lower cost, and AI projects that deliver. About the Role You will own the technical delivery of AI-powered data pipelines in restricted environments where commercial cloud is not an option. The immediate engagement centers on a Product Lifecycle Management (PLM) data migration: building a pipeline that connects to a client's SharePoint on a restricted Microsoft 365 government tenant, reads engineering documents, classifies and summarizes them, detects duplicates, and rates naming-convention compliance to produce a migration-readiness report. You will start on-premise, then help the client evaluate and move to government cloud for production. Key Responsibilities AI Pipeline Development Build AI pipelines that connect to a client's SharePoint on a government cloud tenant, read engineering documents, classify them by type, generate summaries, detect duplicates, and rate naming-convention compliance in support of PLM data migration. Catalog large document repositories and produce migration-readiness reports and Excel catalogs that give clients a clear, measurable picture of their data. Engineer document-parsing workflows across DOCX, PDF, and XLSX formats, including embedding generation and database operations. On-Premise & Government Cloud Deployment Deploy on-premise first — a Mac Mini running Gemma via Ollama — standing up, serving, and tuning local inference infrastructure. Evaluate and migrate to production on Azure OpenAI (Azure Government) or AWS Bedrock (GovCloud) when the client is ready to scale. Keep deployments compliant within ITAR-sensitive, restricted-network boundaries throughout. Architecture & Cost Advisory Produce cost models and architecture recommendations that help client IT teams make informed platform decisions based on measured data, not vendor pitches. Compare deployment options — local, Azure Government, and AWS GovCloud — on cost, performance, and compliance, and explain the trade-offs clearly. Practice Building & Delivery Serve as Data-Sleek's go-to engineer for AI deployments across defense and aerospace clients. Build a reusable capability — a repeatable AI-solutions practice — rather than executing a single one-off project. What You Bring Required U.S. Citizen: U.S. citizenship is required and non-negotiable due to ITAR and client security and compliance requirements. Production LLM deployment: You have stood up inference infrastructure — not just called an API. You've handled model loading, memory constraints, failure modes, and throughput tuning in a real deployment. Local inference: Ollama, vLLM, llama.cpp, LM Studio, or TGI. You've served open-source models (Gemma, Llama, Mistral) on local hardware. Cloud AI platforms: Azure OpenAI or AWS Bedrock — at least one. Service configuration, model access, authentication, and token-based pricing. Python: Pipeline engineering — document parsing (DOCX, PDF, XLSX), API integrations, embedding generation, and database operations (SQLite, Postgres). Experience: 5+ years post-degree in software engineering, data engineering, or ML engineering. Strong Preferences Microsoft ecosystem: Entra ID, Microsoft Graph API, and SharePoint REST API at the API level. GCC High experience is a bonus. MCP (Model Context Protocol): Experience building or consuming MCP servers — a significant plus for a fast-evolving protocol. Workflow orchestration: n8n, Temporal, Airflow, or similar. The pipeline is orchestrated, not scripted. Government cloud awareness: Understanding of what FedRAMP High, IL4/IL5, and ITAR mean for cloud architecture decisions. Embeddings & vector similarity: sentence-transformers, pgvector, Qdrant, or FAISS for duplicate detection.  Bonus (valued if present) Aerospace or defense experience: Familiarity with ECOs, BOMs, and AS9100 saves ramp time. Apple Silicon optimization: MLX, Metal acceleration, and Ollama tuning on M-series chips. Agentic frameworks: Bedrock AgentCore or Azure AI Foundry — the future direction involves agentic AI workflows on government cloud. What This Role Is Not Model training or fine-tuning. This is deployment engineering, not research. Data science or statistical modeling. The AI here is document understanding and classification, not predictive analytics. Frontend development. The deliverable is an Excel catalog and a report, not a web app. Sales or client acquisition. Data-Sleek's leadership manages the client relationship; you focus on delivery. Engagement & Compensation Remote, US-based. Occasional on-site travel to client facilities for hardware deployment and workshops may be needed. An average of 2–3 trips for the first engagement may be possible. Compensation. $40-$55/hour Why Join Data-Sleek? At Data-Sleek, you'll lead AI deployments in environments most engineers never touch — government cloud and on-premise systems where commercial tools simply aren't an option. Your work will directly shape how defense and aerospace clients adopt AI, and you'll build a reusable capability the company grows around. We focus on doing the right thing architecturally rather than selling the most expensive option, and we give our engineers the autonomy to deliver real solutions for real constraints. How to Apply If you've shipped real LLM deployments with real constraints, we'd like to hear from you. Please submit: Your resume A brief note describing one LLM deployment you've shipped — what model, what infrastructure, what data source, and what went wrong. Data-Sleek® is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all contractors.

Posted 3 weeks ago

Need Java expert with AI integration

Fixed price
Expert
Est. budget: $500.00

Work description We are looking for an experienced Java / Spring Boot developer who can help design and build backend applications with AI integration. The ideal candidate should have strong experience in Java-based microservices, REST APIs, cloud deployment, and integrating AI/LLM services into business applications. The work may include building APIs, connecting with AI models, integrating OpenAI/Azure OpenAI/LLM services, creating AI agents or automation workflows, and improving existing backend systems. Responsibilities: Design and develop scalable backend services using Java and Spring Boot. Integrate AI/LLM APIs such as OpenAI, Azure OpenAI, LangChain4j, Semantic Kernel, or similar tools. Build RESTful APIs for AI-powered features. Work with databases, authentication, external APIs, and cloud services. Optimize performance, security, and reliability of backend services. Write clean, maintainable, and well-tested code. Troubleshoot production issues and improve existing applications. Required Skills: Strong experience with Java 8/11/17+ and Spring Boot. Experience with REST APIs, microservices, and JSON. Hands-on experience integrating AI/LLM APIs. Knowledge of prompt engineering, embeddings, RAG, vector databases, or AI agents is a plus. Experience with SQL/NoSQL databases. Familiarity with AWS, Azure, or GCP. Good communication and ability to work independently. Preferred Skills: Azure OpenAI or OpenAI API integration. LangChain4j, Spring AI, or Semantic Kernel experience. Vector DB experience such as Pinecone, Chroma, Weaviate, FAISS, or Azure AI Search. Docker, Kubernetes, CI/CD knowledge. Healthcare, finance, or enterprise application experience is a plus. Project Goal: We need a Java expert who can help us build AI-powered backend features and integrate them into our existing application in a scalable and secure way. To Apply: Please share examples of Java/Spring Boot projects you have worked on, especially any projects involving AI, LLMs, automation, or API integrations.

Posted 3 weeks ago

Full stack AI Agent Developer — AI Data Pipeline, Vector DB & Real-Time Push API. Web crawlers.

Hourly: $50.00 - $100.00
Expert
Est. time: 1 to 3 months, 30+ hrs/week

Title: Backend Developer — AI Data Pipeline, Vector DB & Real-Time Push API Post: We are building an automated backend system that continuously crawls public web sources, processes and indexes content using AI, and delivers updates via webhooks. Looking for someone who has built this type of system before and can move fast. NDA required before project details are shared. What you’ll build: • Web crawler network —. • AI processing pipeline — cleans, deduplicates, chunks, and embeds ingested content into a vector database using an LLM embedding model. Quality scoring and incremental updates required. • Push API — monitors for significant content changes and delivers updates via webhook endpoints automatically. Includes configurable push schedules per subscriber, REST query endpoint, API key authentication, and token usage tracking per key. Tech stack (flexible — use what you know best): • Python (FastAPI) or Node.js • Any vector DB — Pinecone, ChromaDB, Supabase • Any LLM API — Anthropic or OpenAI • Any scheduler — n8n, APScheduler, cron • Redis for queue management • Railway, Render, or AWS for deployment Requirements: • NDA signed before kickoff — non-negotiable • Must have built RAG pipelines or vector DB systems in production — not tutorials • Must have experience with web crawlers and scheduled job pipelines • Must have experience with webhook delivery systems • GitHub or portfolio showing relevant deployed work required • 95%+ Job Success Score preferred • Individual contractors only — no agencies To apply include: • Example of a similar system you’ve built — web crawler, RAG pipeline, or push notification API • Your preferred stack for this type of build • Brief technical approach in 3–5 sentences • Hourly rate and availability to start Budget: $50–$80/hr Timeline: 3 weeks — focused sprint with daily check-ins

Jobs Per Page: