Search Freelance Jobs on Upwork

Posted 4 weeks ago

Fix ML Model Inference Bug + Improve FastAPI Prediction Endpoint

Fixed price
Expert
Est. budget: $200.00

We have a small Python-based machine learning inference service built with FastAPI and scikit-learn. The model was trained on structured tabular data, but our prediction endpoint is currently failing because of feature mismatch errors between the training pipeline and incoming API payloads. We need an experienced ML/MLOps engineer to quickly debug the issue, clean up the preprocessing logic, and make the `/predict` endpoint work reliably again. The goal is not to retrain the full model or build a large system. We only need a focused fix: review the existing model artifact, inspect the expected feature columns, update the API preprocessing code, and provide a short explanation of what was wrong. Bonus if you can also add a simple test request example or basic validation for missing fields. This should be a quick one-time task for someone comfortable with Python, scikit-learn, Pandas, FastAPI, and ML deployment workflows.

Posted 2 weeks ago

Senior AI Developer — Fintech/Banking | LLM Agents, AWS, Python

Hourly: $20.00 - $60.00
Expert
Est. time: More than 6 months, 30+ hrs/week

We're hiring a senior AI developer to build and deploy AI solutions for a fintech/credit-union platform. The work spans autonomous banking agents, fraud detection, credit scoring, and bill-pay/invoice automation — at the intersection of LLMs, cloud infrastructure, and financial-domain expertise, with security and compliance built in from the start. This is a long-term, ongoing engagement. What you'll do: AI agents & orchestration - Design, build, and deploy multi-agent systems using Amazon Bedrock Agents, LangChain, and related frameworks - Architect agentic workflows for core banking use cases: credit scoring, fraud detection, bill-pay automation, invoice management - Define agent personas, memory strategies, tool-use patterns, and escalation paths for production banking agents LLM engineering - Fine-tune, prompt-engineer, and evaluate LLMs for financial-domain tasks - Build RAG pipelines over credit-union knowledge bases, policy docs, and member data - Implement guardrails, content filtering, and compliance checks for safe, regulated outputs - Monitor performance, hallucination rates, and latency against SLAs Cloud infrastructure (AWS & Azure) - Architect and manage AI/ML workloads on AWS (Bedrock, SageMaker, Lambda, S3, IAM, VPC) and Azure (OpenAI Service, Azure ML, AKS) - Design secure, cost-optimized environments compliant with NCUA, PCI-DSS, and SOC 2 - Implement infrastructure-as-code with Terraform or AWS CDK DevOps & MLOps - Build and maintain CI/CD pipelines (GitHub Actions, Jenkins, CodePipeline, Azure DevOps) - Containerize services with Docker, orchestrate with Kubernetes (EKS/AKS) - Apply MLOps best practices: model versioning, A/B testing, canary deployments, automated rollback - Stand up observability with logging, tracing, and alerting Python development - Write clean, well-tested Python for AI pipelines, REST APIs, and data workflows - Build FastAPI/Flask microservices exposing agent capabilities to frontend and core banking systems - Integrate with financial data sources, core banking APIs, and third-party fintech services Banking applications - Build credit-scoring models using alternative data and explainable AI (XAI) - Develop real-time fraud detection with behavioral analytics, anomaly detection, and auto-decisioning - Create conversational agents for bill pay, account management, and member self-service - Automate invoice workflows: extraction, classification, approval routing, reconciliation - Partner with compliance/risk to keep AI decisions auditable, fair, and regulatory-compliant What you should have: - 5+ years software engineering; 3+ years in AI/ML or LLM engineering - 2+ years building AI for banking, credit unions, or financial services - Hands-on experience with Amazon Bedrock, LangChain, Python, AWS, and infrastructure-as-code - Working knowledge of NCUA, PCI-DSS, SOC 2, GLBA, and Fair Lending requirements - Bachelor's or Master's in Computer Science, Software Engineering, Data Science, or related field Nice to have: - AWS or Azure AI/ML certifications - Open-source LLM experience (Llama, Mistral, Phi) and self-hosted inference (vLLM, Ollama) - Vector databases (Pinecone, OpenSearch, pgvector) - Graph-based fraud networks and graph ML - AI governance / responsible AI framework experience - Prior work at a credit union, community bank, or fintech lending platform To apply, please share: - Your resume highlighting AI and banking project experience - A brief note on your most impactful AI agent or LLM project in a financial-services context - Links to GitHub, portfolio, or published papers (optional but encouraged)

Posted 4 weeks ago

Software Engineer, AI Infrastructure

Hourly
Expert
Est. time: More than 6 months, 30+ hrs/week

The Role: As a Software Engineer on our AI Infrastructure team, you will help design the core systems that power Prism AI’s generative AI platform. You will help build infrastructure and tools that ensure the reliability, performance, quality, and availability of our AI system. Our mission is to make Prism AI the most reliable and user friendly generative AI platform in the world. You will partner closely with our cloud infrastructure team, product team, and performance team to deliver infrastructure that bridges the gap between our customer and the ultra-performant proprietary Prism inference engine. Key Responsibilities: Contribute to the design and development of scalable backend infrastructure that supports distributed training, inference, and data pipelines Build and maintain core backend services such as LLM CI/CD pipeline, control plane, and model serving systems Support performance optimization, cost efficiency, and reliability improvements across compute, storage, and networking layers Building frameworks and safeguards to ensure Prism AI has the best model quality in the industry Collaborate with performance, training, and product teams to translate research and product needs into infrastructure solutions Participate in code reviews, technical discussions, and continuous integration and deployment processes Minimum Qualifications: Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience) 3 years of experience in software engineering, with a focus on infrastructure or machine learning systems Strong programming skills in Python, Go, or a similar language Proven experience in ML infrastructure and tooling (e.g., PyTorch, MLflow, Vertex AI, SageMaker, Kubernetes, etc.). Basic understanding of LLM knowledge (e.g., context length, disaggregated prefill, KV cache memory estimation, etc) Preferred Qualifications: 5+ years of experience in software engineering, with a focus on infrastructure or machine learning systems Experience with open source inference engine like vLLM, Sglang, or TRT-LLM Contributions to open-source infrastructure or ML projects Experience in building large scale ML/MLOps infrastructure

Posted 5 days ago

Multi-Agent AI, RAG, LLMOps & Cloud Strategy

Fixed price
Expert
Est. budget: $500.00

Seeking an experienced AI Solutions Architect to provide consultation and design an enterprise-grade Agentic AI platform capable of automating business workflows, retrieving knowledge from enterprise data sources, and integrating with existing business systems. This engagement focuses on solution architecture, technical design, technology selection, implementation planning, and best practices rather than full-scale application development. Scope of Consultation The consultant will: Assess current business processes and identify high-value AI automation opportunities. Design an enterprise Agentic AI architecture aligned with business and technical requirements. Define multi-agent workflows, agent responsibilities, and orchestration strategies. Design a scalable Retrieval-Augmented Generation (RAG) architecture for enterprise knowledge retrieval. Recommend the appropriate Large Language Models (OpenAI, Claude, Gemini, AWS Bedrock, Azure OpenAI, etc.) based on cost, performance, and use cases. Recommend vector database technologies and semantic search architecture. Design secure integrations with enterprise applications, APIs, and internal knowledge repositories. Define prompt engineering strategies, AI guardrails, evaluation methodology, and governance practices. Recommend cloud architecture and deployment strategies for AWS, Azure, or Google Cloud Platform. Provide guidance on LLMOps, monitoring, observability, security, model lifecycle management, and scalability. Develop an implementation roadmap, including phases, estimated effort, risks, and technical recommendations. Required Expertise Enterprise AI Solution Architecture Agentic AI Multi-Agent Systems Retrieval-Augmented Generation (RAG) Large Language Models (LLMs) LangChain LangGraph Prompt Engineering OpenAI API Anthropic Claude Google Gemini AWS Bedrock Python Vector Databases Enterprise System Integration AWS Microsoft Azure Google Cloud Platform AI Governance LLMOps MLOps Workflow Automation Deliverables Enterprise Agentic AI solution architecture document Multi-agent workflow design and orchestration diagrams RAG architecture and knowledge management design Vector database recommendation and data flow architecture Cloud deployment architecture Integration strategy for enterprise systems AI governance, security, and LLMOps recommendations Implementation roadmap with milestones and estimated effort Architecture review presentation and knowledge transfer session Project Outcome Delivered a comprehensive enterprise AI architecture and implementation strategy that provides a scalable foundation for deploying Agentic AI solutions. The consultation enabled stakeholders to make informed technology decisions, reduce implementation risks, accelerate development, and establish best practices for governance, security, and long-term operational success.

Posted 4 weeks ago

AI Agent Infrastructure Consultant — Parallel Agents Optimization (Python/Node)

Fixed price
Expert
Est. budget: $150.00

**Overview** We are a fast-growing SaaS company with a lean engineering team (~10 devs) utilizing a modern Python (FastAPI/Django) and Node.js backend, React frontend, and PostgreSQL stack. We have already deployed an initial multi-model agent stack—Claude Code, LiteLLM gateway, Git worktrees, and MCP integrations. We need an expert to run an intensive architecture review and optimization session for our current infrastructure. We are not looking for someone to build a full-time, weeks-long project from scratch. Instead, we need a seasoned engineer who has shipped this exact type of infrastructure end-to-end to audit our setup, identify architectural gaps, and guide our team on hardened implementation. This project must move fast. If your timeline is measured in weeks, please do not apply. We want someone who looks at this scope, jumps into a review session, and delivers actionable architectural guidance in days. This starts as a focused, urgent consultation. However, we expect ongoing advisory work—follow-ups, architecture adjustments, and enhancement reviews—as the AI tooling landscape shifts. For the right engineer, this will turn into a recurring relationship. We are completely open to a fixed price per milestone or an hourly structure. **What You Need to Have Actually Shipped and Can Review (Not Just Read About)** * **Full Agentic Coding Harnesses:** The entire loop: orchestrator → subagent → CI gate → merge loops. * **Isolation Layers:** Configured execution layers (such as E2B, Modal, or secure Docker runtimes) as isolated sandboxes for AI-generated code. * **Parallel Claude Code Sessions:** Managed multiple simultaneous subagents on scoped tasks via Git worktrees. * **Self-Hosted LiteLLM Gateways:** Routing to multiple models (Claude, GPT, Gemini, DeepSeek). * **MCP Server Infrastructure:** Connected file system, PostgreSQL, Atlassian, and Slack tool layers for active agents. * **Agent Framework Structures:** Used CLAUDE.md, COMMON\_MISTAKES.md, subagent role definitions, hook scripts, and settings.json. * **Human-in-the-Loop Orchestration:** Built Plan Mode or equivalent approval gates before agent execution. * **Multi-Agent Frameworks:** 7-agent feature factory patterns or frameworks like LangGraph, CrewAI, or Autogen. * **Durable Workflow Engines:** Applied Temporal, n8n, or similar tools for long-running agent workflow execution. * **Mechanical Quality Gates:** Treating CI green as the ultimate gate for agent output quality. \[[1](https://manveerc.substack.com/p/ai-agent-sandboxing-guide)\] **Our Current Stack (What you are reviewing)** * **Backend:** Python (FastAPI / Django) & Node.js (TypeScript) * **Frontend:** React (Next.js) * **Database & ORM:** PostgreSQL / Prisma / SQLAlchemy * **Infrastructure:** Docker Compose, AWS (ECS/EKS) * **CI/CD:** GitHub Actions / GitLab CI * **AI Layer:** Claude Code with shared `.claude/` directory, CLAUDE.md, and LiteLLM gateway in Docker * **MCP:** Atlassian (Jira/Confluence), custom PostgreSQL MCP server, Slack * **Workflow Automation:** Temporal / n8n * **QA Automation:** Playwright / Autonoma **Scope of Work (Review & Advisory Only)** 1. **Comprehensive Audit:** Audit our current agent harness and identify architectural gaps against a production-grade standard. 2. **Sandbox Strategy Consultation:** Review our environment strategy to ensure highly secure, isolated execution runtimes for agent code runs. 3. **Workflow Hardening Review:** Evaluate our parallel agent workflow setup (Git worktrees, subagent role configs, hook scripts, and settings lockdown). 4. **CI Pipeline Integration Strategy:** Advise on wiring our sandbox execution layer into the existing CI pipeline so agent-executed code runs in clean snapshots, not live infra. 5. **Architectural Runbook:** Deliver an optimization report / documented standard that our backend lead can easily own and execute going forward. **How to Apply** Skip the generic pitch. Show us something real to be considered: 1. A GitHub repo, architecture diagram, or Loom walkthrough of an agentic harness you have actually shipped. 2. Specific tools from our stack you have personally configured (E2B, LiteLLM, Claude Code, etc.). 3. One sentence explaining the hardest problem you solved to get full agent loops running reliably. 4. Your availability to conduct this high-impact architectural review session this week.

Posted last month

Senior Software engineer with deep knowledge of AI/LLM integrations across multiple cloud platforms

Hourly: $70.00 - $85.00
Expert
Est. time: More than 6 months, 30+ hrs/week

I need an expert senior software engineer that can provide consulting services around implementation best practices of LLM's and AI into existing application workflows. i.e. leveraging AI to extract data from a document as part of an ingestion pipeline.

Jobs Per Page: