Search Freelance Jobs on Upwork

Posted 3 weeks ago

Software Engineer, AI Infrastructure

Hourly
Expert
Est. time: More than 6 months, 30+ hrs/week

The Role: As a Software Engineer on our AI Infrastructure team, you will help design the core systems that power Prism AI’s generative AI platform. You will help build infrastructure and tools that ensure the reliability, performance, quality, and availability of our AI system. Our mission is to make Prism AI the most reliable and user friendly generative AI platform in the world. You will partner closely with our cloud infrastructure team, product team, and performance team to deliver infrastructure that bridges the gap between our customer and the ultra-performant proprietary Prism inference engine. Key Responsibilities: Contribute to the design and development of scalable backend infrastructure that supports distributed training, inference, and data pipelines Build and maintain core backend services such as LLM CI/CD pipeline, control plane, and model serving systems Support performance optimization, cost efficiency, and reliability improvements across compute, storage, and networking layers Building frameworks and safeguards to ensure Prism AI has the best model quality in the industry Collaborate with performance, training, and product teams to translate research and product needs into infrastructure solutions Participate in code reviews, technical discussions, and continuous integration and deployment processes Minimum Qualifications: Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience) 3 years of experience in software engineering, with a focus on infrastructure or machine learning systems Strong programming skills in Python, Go, or a similar language Proven experience in ML infrastructure and tooling (e.g., PyTorch, MLflow, Vertex AI, SageMaker, Kubernetes, etc.). Basic understanding of LLM knowledge (e.g., context length, disaggregated prefill, KV cache memory estimation, etc) Preferred Qualifications: 5+ years of experience in software engineering, with a focus on infrastructure or machine learning systems Experience with open source inference engine like vLLM, Sglang, or TRT-LLM Contributions to open-source infrastructure or ML projects Experience in building large scale ML/MLOps infrastructure

Posted 3 weeks ago

Machine Learning Engineer

Fixed price
Entry Level
Est. budget: $250.00

We are looking for an entry-level Software Engineer who is strong in computer science fundamentals and algorithms. You will work on real-world software problems. This role suits someone who enjoys bridging theory and practice: thinking carefully about problem formulation, writing clean and efficient code, and taking ownership of results end-to-end. ROLE OVERVIEW You will work within a small, cross-functional team to build software. You will be expected to think algorithmically, write quality code, and communicate your findings clearly to non-technical stakeholders. KEY RESPONSIBILITIES Analyse product and business requirements provided by the team. Select appropriate algorithms and architectures based on data characteristics, constraints, and performance requirements. Design and implement efficient data structures and algorithms. TOOLS & STACK Knowledge in these areas are preferred. Postgres and Mongo DB Machine learning and LLM frameworks Middleware and mobile concepts especially React-Native and Javascript/NodeJS Infrastructure: Basic familiarity with GCP or AWS Version control: Git QUALIFICATIONS Bachelor's degree in Computer Science is preferred Strong foundations in algorithms and data structures — able to reason about complexity and write efficient code. Good understanding of machine learning and LLM concepts Clear written and verbal communication; able to explain model behaviour and trade-offs to non-specialists.

Posted 2 weeks ago

GenAI Engineer for Client

Hourly
Intermediate
Est. time: More than 6 months, 30+ hrs/week

We are seeking a skilled GenAI engineer to work with our client in a remote or Chicago-based capacity. The ideal candidate will have experience in developing and implementing AI solutions, with a strong understanding of machine learning and data analysis. Responsibilities include designing AI models, integrating AI into existing systems, and collaborating with cross-functional teams to enhance AI capabilities. If you have a passion for AI and a proven track record in delivering innovative solutions, we would love to hear from you.

Posted 3 weeks ago

AI Agent Infrastructure Consultant — Parallel Agents Optimization (Python/Node)

Fixed price
Expert
Est. budget: $150.00

**Overview** We are a fast-growing SaaS company with a lean engineering team (~10 devs) utilizing a modern Python (FastAPI/Django) and Node.js backend, React frontend, and PostgreSQL stack. We have already deployed an initial multi-model agent stack—Claude Code, LiteLLM gateway, Git worktrees, and MCP integrations. We need an expert to run an intensive architecture review and optimization session for our current infrastructure. We are not looking for someone to build a full-time, weeks-long project from scratch. Instead, we need a seasoned engineer who has shipped this exact type of infrastructure end-to-end to audit our setup, identify architectural gaps, and guide our team on hardened implementation. This project must move fast. If your timeline is measured in weeks, please do not apply. We want someone who looks at this scope, jumps into a review session, and delivers actionable architectural guidance in days. This starts as a focused, urgent consultation. However, we expect ongoing advisory work—follow-ups, architecture adjustments, and enhancement reviews—as the AI tooling landscape shifts. For the right engineer, this will turn into a recurring relationship. We are completely open to a fixed price per milestone or an hourly structure. **What You Need to Have Actually Shipped and Can Review (Not Just Read About)** * **Full Agentic Coding Harnesses:** The entire loop: orchestrator → subagent → CI gate → merge loops. * **Isolation Layers:** Configured execution layers (such as E2B, Modal, or secure Docker runtimes) as isolated sandboxes for AI-generated code. * **Parallel Claude Code Sessions:** Managed multiple simultaneous subagents on scoped tasks via Git worktrees. * **Self-Hosted LiteLLM Gateways:** Routing to multiple models (Claude, GPT, Gemini, DeepSeek). * **MCP Server Infrastructure:** Connected file system, PostgreSQL, Atlassian, and Slack tool layers for active agents. * **Agent Framework Structures:** Used CLAUDE.md, COMMON\_MISTAKES.md, subagent role definitions, hook scripts, and settings.json. * **Human-in-the-Loop Orchestration:** Built Plan Mode or equivalent approval gates before agent execution. * **Multi-Agent Frameworks:** 7-agent feature factory patterns or frameworks like LangGraph, CrewAI, or Autogen. * **Durable Workflow Engines:** Applied Temporal, n8n, or similar tools for long-running agent workflow execution. * **Mechanical Quality Gates:** Treating CI green as the ultimate gate for agent output quality. \[[1](https://manveerc.substack.com/p/ai-agent-sandboxing-guide)\] **Our Current Stack (What you are reviewing)** * **Backend:** Python (FastAPI / Django) & Node.js (TypeScript) * **Frontend:** React (Next.js) * **Database & ORM:** PostgreSQL / Prisma / SQLAlchemy * **Infrastructure:** Docker Compose, AWS (ECS/EKS) * **CI/CD:** GitHub Actions / GitLab CI * **AI Layer:** Claude Code with shared `.claude/` directory, CLAUDE.md, and LiteLLM gateway in Docker * **MCP:** Atlassian (Jira/Confluence), custom PostgreSQL MCP server, Slack * **Workflow Automation:** Temporal / n8n * **QA Automation:** Playwright / Autonoma **Scope of Work (Review & Advisory Only)** 1. **Comprehensive Audit:** Audit our current agent harness and identify architectural gaps against a production-grade standard. 2. **Sandbox Strategy Consultation:** Review our environment strategy to ensure highly secure, isolated execution runtimes for agent code runs. 3. **Workflow Hardening Review:** Evaluate our parallel agent workflow setup (Git worktrees, subagent role configs, hook scripts, and settings lockdown). 4. **CI Pipeline Integration Strategy:** Advise on wiring our sandbox execution layer into the existing CI pipeline so agent-executed code runs in clean snapshots, not live infra. 5. **Architectural Runbook:** Deliver an optimization report / documented standard that our backend lead can easily own and execute going forward. **How to Apply** Skip the generic pitch. Show us something real to be considered: 1. A GitHub repo, architecture diagram, or Loom walkthrough of an agentic harness you have actually shipped. 2. Specific tools from our stack you have personally configured (E2B, LiteLLM, Claude Code, etc.). 3. One sentence explaining the hardest problem you solved to get full agent loops running reliably. 4. Your availability to conduct this high-impact architectural review session this week.

Posted 2 days ago

Forward Deployed AI Engineer

Hourly: $30.00 - $60.00
Intermediate
Est. time: More than 6 months, 30+ hrs/week

We are looking for a hands-on Forward Deployed AI Engineer to help build practical AI systems This is not a pure backend role and not a strategy-only consulting role. You will work close to end users, understand how their workflows actually operate, and then build AI-enabled tools that solve specific business problems. The ideal person is a strong software engineer who is comfortable with ambiguity, can communicate clearly with non-technical stakeholders, and can take an AI prototype from idea to something reliable and usable. What you will do - Learn the business workflows, systems, data, and constraints. - Build AI applications using Claude or similar large language models. - Use the right mix of prompting, retrieval, tool use, agents, and workflow automation. - Own delivery from scoping through prototype, testing, hardening, and handoff. - Create evaluations to determine whether the system is accurate, reliable, and safe enough to use. - Translate between domain experts and technical implementation. - Work carefully with sensitive or regulated data. - Document what you build so it can be maintained and reused. What we are looking for - Strong Python engineering skills. - Hands-on experience building with LLMs, preferably Claude or the Anthropic API. - Experience with RAG, structured prompting, tool use, evaluation, or agentic workflows. - Ability to operate independently in a messy, ambiguous environment. - Strong communication skills with both technical and non-technical stakeholders. - Track record of shipping working software, not just demos. - Comfort working with real-world data, integrations, and imperfect requirements. Helpful but not required - Prior forward deployed engineering, solutions engineering, or technical consulting experience. - Experience building AI tools for enterprise customers. - Experience in regulated or sensitive-data environments. - Familiarity with validation, auditability, traceability, or compliance-oriented workflows.

Posted 2 days ago

Multi-Agent AI, RAG, LLMOps & Cloud Strategy

Fixed price
Expert
Est. budget: $500.00

Seeking an experienced AI Solutions Architect to provide consultation and design an enterprise-grade Agentic AI platform capable of automating business workflows, retrieving knowledge from enterprise data sources, and integrating with existing business systems. This engagement focuses on solution architecture, technical design, technology selection, implementation planning, and best practices rather than full-scale application development. Scope of Consultation The consultant will: Assess current business processes and identify high-value AI automation opportunities. Design an enterprise Agentic AI architecture aligned with business and technical requirements. Define multi-agent workflows, agent responsibilities, and orchestration strategies. Design a scalable Retrieval-Augmented Generation (RAG) architecture for enterprise knowledge retrieval. Recommend the appropriate Large Language Models (OpenAI, Claude, Gemini, AWS Bedrock, Azure OpenAI, etc.) based on cost, performance, and use cases. Recommend vector database technologies and semantic search architecture. Design secure integrations with enterprise applications, APIs, and internal knowledge repositories. Define prompt engineering strategies, AI guardrails, evaluation methodology, and governance practices. Recommend cloud architecture and deployment strategies for AWS, Azure, or Google Cloud Platform. Provide guidance on LLMOps, monitoring, observability, security, model lifecycle management, and scalability. Develop an implementation roadmap, including phases, estimated effort, risks, and technical recommendations. Required Expertise Enterprise AI Solution Architecture Agentic AI Multi-Agent Systems Retrieval-Augmented Generation (RAG) Large Language Models (LLMs) LangChain LangGraph Prompt Engineering OpenAI API Anthropic Claude Google Gemini AWS Bedrock Python Vector Databases Enterprise System Integration AWS Microsoft Azure Google Cloud Platform AI Governance LLMOps MLOps Workflow Automation Deliverables Enterprise Agentic AI solution architecture document Multi-agent workflow design and orchestration diagrams RAG architecture and knowledge management design Vector database recommendation and data flow architecture Cloud deployment architecture Integration strategy for enterprise systems AI governance, security, and LLMOps recommendations Implementation roadmap with milestones and estimated effort Architecture review presentation and knowledge transfer session Project Outcome Delivered a comprehensive enterprise AI architecture and implementation strategy that provides a scalable foundation for deploying Agentic AI solutions. The consultation enabled stakeholders to make informed technology decisions, reduce implementation risks, accelerate development, and establish best practices for governance, security, and long-term operational success.

Posted 3 weeks ago

Build a Customer Churn Prediction Model with API Endpoint

Fixed price
Expert
Est. budget: $100.00

We're a small SaaS company and we have a CSV dataset (~10,000 rows) of customer activity data including features like subscription length, login frequency, support tickets filed, monthly spend, and whether the customer churned (binary label). We need someone to build a simple but effective churn prediction model and expose it via a lightweight API so our internal tools can call it. Scope of work: - Perform basic exploratory data analysis (EDA) and generate a few key visualizations (churn distribution, feature correlations, top predictive features) - Clean and preprocess the data (handle missing values, encode categoricals, scale features) - Train and evaluate at least 2-3 classification models (e.g., Logistic Regression, Random Forest, XGBoost) with appropriate metrics (accuracy, precision, recall, F1, AUC-ROC) - Select the best model and save it as a serialized file (pickle or joblib) - Build a simple FastAPI endpoint that accepts customer features as JSON input and returns a churn probability score - Provide a Jupyter notebook with the full EDA + modeling workflow, plus the FastAPI app code - Include a brief README with setup instructions Deliverables: GitHub repo or zip with notebook, model file, API code, requirements.txt, and README. We're looking for someone who can do this quickly and cleanly — no over-engineering needed, just solid ML fundamentals and a working API. Ideally completed within a couple of days.

Posted 2 months ago

AI Content Reviewer & Expert Contributor

Fixed price
Intermediate
Est. budget: $30.00

We’re looking for experienced AI professionals to provide short, original quotes, practical insights, and light content feedback for our educational articles and guides. Your real-world perspective will help make the content more accurate, useful, and trustworthy for readers. The initial project involves reviewing and contributing to one guide, with the possibility of ongoing work. Example guide: onlinemastersdegrees.org/best-programs/information-systems/ **What You’ll Do:** * Review AI education content for accuracy and clarity * Leave light feedback through Google Docs comments * Provide brief expert quotes, usually 2–5 sentences each * Offer practical insights based on real-world AI, machine learning, or data science experience * Help add context around AI careers, degree programs, certifications, skills, tools, and industry expectations **For the Initial Project:** We’re looking to add approximately 3–4 short expert quotes to one AI guide. Quotes should be original, practical, and based on your professional experience. **Details:** * $30 per page * Pages typically take 20–30 minutes * Clear guidelines and examples provided * Contract, flexible, and ongoing work **Relevant Experience May Include:** * Artificial intelligence * Machine learning * Data science * Generative AI * Natural language processing * Computer vision * AI product development * MLOps * AI governance, risk, or compliance * Responsible AI * AI education or workforce development **In your submission, please include:** 1. A few sentences about your AI background, professional experience, and areas of expertise 2. Any relevant degrees, certifications, credentials, or notable AI projects 3. Link to your LinkedIn profile To help us sort through automated submissions, please put the name of Shopify’s CEO at the top of your submission.

Posted 4 weeks ago

Tavus AI Agent Development

Hourly: $15.00 - $35.00
Intermediate
Est. time: 1 to 3 months, Less than 30 hrs/week

We are seeking a skilled freelancer to develop a Tavus AI agent capable of interviewing people and recording conversations directly to cloud storage. The ideal candidate will have experience in AI development and cloud integration, ensuring seamless and secure data storage.

Jobs Per Page: