Python developer for AI Model Response Evaluator
Worldwide
Looking for an experienced developer to evaluate AI-generated coding responses. Python background is strongly preferred. This requires genuine engineering judgment, you need to understand code well enough to critically assess whether a solution is correct, complete, and well-reasoned. Not a coding job, but real coding experience is a must. Who I'm looking for: - Python-heavy background - Comfortable reading and reviewing code, not just writing it - Clear, direct communicator with evidence-based reasoning - Detail-oriented and consistent Prior experience with AI labeling is a plus but not required. To apply, tell me: - Your engineering background - languages, domains, years of experience - One example where you reviewed someone else's code and caught a real issue - Your availability and preferred payment structure - Github link
- More than 30 hrs/weekHourly
- 3-6 monthsDuration
- ExpertExperience Level
$15.00
-
$25.00
Hourly- Remote Job
- Ongoing projectProject Type
Skills and Expertise
Activity on this job
- Proposals:20 to 50
- Last viewed by client:3 weeks ago
- Interviewing:4
- Invites sent:6
- Unanswered invites:1
About the client
- USATyler12:56 AM
- $1.3K total spent14 hires, 4 active
- 255 hours
Explore similar jobs on Upwork
How it works
Create your free profileHighlight your skills and experience, show your portfolio, and set your ideal pay rate.
Work the way you wantApply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
Get paid securelyFrom contract to payment, we help you work safely and get paid securely.
About Upwork
- 4.9/5(Average rating of clients by professionals)
- G2 2021#1 freelance platform
- 49,000+Signed contract every week
- $2.3BFreelancers earned on Upwork in 2020
Find the best freelance jobs
Growing your career is as easy as creating a free profile and finding work like this that fits your skills.
Trusted by