AI Engineer
Only freelancers located in the U.S. may apply.U.S. located freelancers only
I’m looking for an AI Engineer to help build an automated red-teaming product based on open-source models. This is a short-term, hands-on project for around 2 months, with an expected commitment of about 20 hours per week. The goal is to build a specialized red-teaming engine that can generate adversarial prompts across different risk domains, severity levels, and attack strategies — then automatically run those prompts against target AI models to identify bad cases, failure patterns, and safety gaps. 🔍 What you’ll work on Build red-teaming systems on top of open-source LLMs, including fine-tuning, prompt optimization, evaluation pipelines, and model orchestration. Design automated prompt generation workflows across risk domains such as self-harm, hate, violence, sexual safety, misinformation, fraud, cyber, and other high-risk areas. Generate prompts across different harm levels, from benign edge cases to policy-borderline and clearly unsafe scenarios, while maintaining structured taxonomies and evaluation criteria. Run automated tests against target models such as Gemma, Llama, Qwen, or other open-source / closed-source models to surface jailbreak patterns, over-refusal, under-refusal, and policy inconsistencies. Build feedback loops that turn model failures into stronger red-team prompts, improved eval sets, remediation recommendations, and continuous safety testing. 🧠 What I’m looking for Hands-on experience with open-source LLMs, fine-tuning, LoRA / QLoRA, RAG, model evaluation, and LLM inference pipelines. Familiarity with AI safety, red teaming, adversarial prompting, jailbreaks, safety evals, or trust & safety systems. Ability to build end-to-end systems, including data pipelines, model serving, eval harnesses, scoring, dashboards, and automation workflows. Bonus if you’ve worked on model safety, content moderation, policy evaluation, agentic testing, or automated eval infrastructure. ⏳ Project setup Duration: around 2 months Time commitment: about 20 hours per week Format: flexible / remote-friendly Stage: early-stage build, from 0 to 1 🚀 Why this is interesting This is not about manually writing red-team prompts one by one. The goal is to build a scalable system that can continuously generate, test, categorize, and learn from model failures — helping teams understand where AI models break, why they break, and how to improve them. If you enjoy working with open-source models, AI safety, red teaming, and fast 0-to-1 product building, I’d love to chat. Feel free to DM me if this sounds like you, or if you know someone who might be a good fit.
- Less than 30 hrs/weekHourly
- 1-3 monthsDuration
- IntermediateExperience Level
$5.00
-
$10.00
Hourly- Remote Job
- Ongoing projectProject Type
Skills and Expertise
Activity on this job
- Proposals:5 to 10
- Interviewing:0
- Invites sent:0
- Unanswered invites:0
About the client
- USAAustin12:21 PM
Explore similar jobs on Upwork
How it works
Create your free profileHighlight your skills and experience, show your portfolio, and set your ideal pay rate.
Work the way you wantApply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
Get paid securelyFrom contract to payment, we help you work safely and get paid securely.
About Upwork
- 4.9/5(Average rating of clients by professionals)
- G2 2021#1 freelance platform
- 49,000+Signed contract every week
- $2.3BFreelancers earned on Upwork in 2020
Find the best freelance jobs
Growing your career is as easy as creating a free profile and finding work like this that fits your skills.
Trusted by