Speech AI Engineer – Fine-tune Open-Source Speech-to-Text Model
Worldwide
We’re looking for an experienced ML engineer to fine-tune an open-source speech-to-text model for improved transcription accuracy on English and Hindi-English (Hinglish) conversational audio. Responsibilities: * Prepare and clean speech datasets * Fine-tune the ASR model * Evaluate transcription quality (WER/CER) * Optimize inference and document the training pipeline Requirements: * Experience with speech recognition (Whisper, Cohere Transcribe, NeMo, wav2vec2, etc.) * Strong PyTorch/Hugging Face experience * Experience with multilingual or code-switched speech is preferred * Familiarity with WER evaluation and dataset preparation Please include: * Relevant ASR projects you’ve worked on * Models you’ve fine-tuned * Links to GitHub or publications (if available) * Your expected timeline for an initial fine-tuned model
$8,000.00
Fixed-price- ExpertExperience Level
- Remote Job
- Complex projectProject Type
Skills and Expertise
Activity on this job
- Proposals:50+
- Last viewed by client:4 days ago
- Hires:2
- Interviewing:3
- Invites sent:1
- Unanswered invites:0
About the client
- United StatesSan Francisco7:22 AM
- $290K total spent74 hires, 37 active
- 218 hours
Explore similar jobs on Upwork
How it works
Create your free profileHighlight your skills and experience, show your portfolio, and set your ideal pay rate.
Work the way you wantApply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
Get paid securelyFrom contract to payment, we help you work safely and get paid securely.
About Upwork
- 4.9/5(Average rating of clients by professionals)
- G2 2021#1 freelance platform
- 49,000+Signed contract every week
- $2.3BFreelancers earned on Upwork in 2020
Find the best freelance jobs
Growing your career is as easy as creating a free profile and finding work like this that fits your skills.
Trusted by