AI Engineer

Posted 6 days ago

Only freelancers located in the U.S. may apply.U.S. located freelancers only

Summary

I’m looking for an AI Engineer to help build an automated red-teaming product based on open-source models. This is a short-term, hands-on project for around 2 months, with an expected commitment of about 20 hours per week. The goal is to build a specialized red-teaming engine that can generate adversarial prompts across different risk domains, severity levels, and attack strategies — then automatically run those prompts against target AI models to identify bad cases, failure patterns, and safety gaps. 🔍 What you’ll work on Build red-teaming systems on top of open-source LLMs, including fine-tuning, prompt optimization, evaluation pipelines, and model orchestration. Design automated prompt generation workflows across risk domains such as self-harm, hate, violence, sexual safety, misinformation, fraud, cyber, and other high-risk areas. Generate prompts across different harm levels, from benign edge cases to policy-borderline and clearly unsafe scenarios, while maintaining structured taxonomies and evaluation criteria. Run automated tests against target models such as Gemma, Llama, Qwen, or other open-source / closed-source models to surface jailbreak patterns, over-refusal, under-refusal, and policy inconsistencies. Build feedback loops that turn model failures into stronger red-team prompts, improved eval sets, remediation recommendations, and continuous safety testing. 🧠 What I’m looking for Hands-on experience with open-source LLMs, fine-tuning, LoRA / QLoRA, RAG, model evaluation, and LLM inference pipelines. Familiarity with AI safety, red teaming, adversarial prompting, jailbreaks, safety evals, or trust & safety systems. Ability to build end-to-end systems, including data pipelines, model serving, eval harnesses, scoring, dashboards, and automation workflows. Bonus if you’ve worked on model safety, content moderation, policy evaluation, agentic testing, or automated eval infrastructure. ⏳ Project setup Duration: around 2 months Time commitment: about 20 hours per week Format: flexible / remote-friendly Stage: early-stage build, from 0 to 1 🚀 Why this is interesting This is not about manually writing red-team prompts one by one. The goal is to build a scalable system that can continuously generate, test, categorize, and learn from model failures — helping teams understand where AI models break, why they break, and how to improve them. If you enjoy working with open-source models, AI safety, red teaming, and fast 0-to-1 product building, I’d love to chat. Feel free to DM me if this sounds like you, or if you know someone who might be a good fit.

  • Less than 30 hrs/week
    Hourly
  • 1-3 months
    Duration
  • Intermediate
    Experience Level
  • $5.00

    -

    $10.00

    Hourly
  • Remote Job
  • Ongoing project
    Project Type
Skills and Expertise
Mandatory skills
LLM Prompt Engineering
Activity on this job
  • Proposals:5 to 10
  • Interviewing:
    0
  • Invites sent:
    0
  • Unanswered invites:
    0
About the client
Member since Apr 2, 2026
  • USA
    Austin12:21 PM

Explore similar jobs on Upwork

Test Automation Framework
Automated Testing
JavaScript
Python
Auto-GPT
Desktop Application Testing
Web Testing
Bug Reports
Software Testing
Functional Testing
Product Stability
Manual Testing
Automated Testing
C#

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo