AI Robotics Developer (World Models, VLA, RL)
Worldwide
Summary We are looking for a highly skilled AI Robotics Developer to build scalable simulation pipelines and train state-of-the-art robotic policies. If you excel at turning physics simulations into rich training grounds and leveraging that data for advanced Reinforcement Learning, World Models, and Vision-Language-Action (VLA) architectures, we want you on our team. What You'll Be Doing - Simulation & Environment Design: Build and maintain highly diverse, physics-accurate simulation environments using MuJoCo, NVIDIA Isaac Sim, or ManiSkill to ensure robust policy generalization. - Synthetic Data Generation: Architect pipelines to generate large-scale, high-quality synthetic "human-like" demonstration and visual data entirely within simulation. - Reinforcement Learning: Utilize generated simulation data to train robust control policies using PPO, GRPO, or advanced model-based RL approaches. - World Models & VLA Development: Train Vision-Language-Action (VLA) models for generalized robotic control. Design and implement predictive world models to enable efficient planning, reasoning, and representation learning in complex environments. - Fine-Tuning & Alignment: Apply Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) to align robotic behaviors, refine VLA outputs, and improve policy reliability. What We’re Looking For - Deep Simulation Expertise: Hands-on experience with MuJoCo, Isaac Sim, or similar physics engines. You know how to randomize domains and build environments that prevent policies from overfitting. - Policy & Model Training Experience: Strong background in applying RL, training World Models (e.g., Dreamer or JEPA architectures), and developing VLA policies for robotic control tasks. - Modern Architecture Knowledge: Deep familiarity with Transformer-based architectures, large-scale representation learning, and multi-modal integration for robotics. - PyTorch & GPU Proficiency: Comfort building complex training loops and scaling them across GPU clusters. Nice to Have - Experience bridging the "Sim-to-Real" gap and deploying policies trained in simulation onto physical hardware. - Contributions to open-source RL or robotics simulation frameworks (e.g., MolmoAct, ManiSkill). Screening Question (Required) Please answer the following question in your application. Proposals without this answer will not be considered: "What are the top 3 most difficult technical problems you have solved related to simulation or policy training? Describe each in 5 sentences (15 total)." Engagement Details - Type: Contract (with potential for long-term engagement) - Availability: Immediate start preferred If you are passionate about pushing the boundaries of embodied AI through simulation and advanced training methodologies, apply below! Strong candidates will be shortlisted for a personal interview.
$700.00
Fixed-price- ExpertExperience Level
- Remote Job
- Ongoing projectProject Type
Skills and Expertise
Activity on this job
- Proposals:5 to 10
- Last viewed by client:yesterday
- Interviewing:6
- Invites sent:20
- Unanswered invites:10
About the client
- South KoreaSeoul4:04 AM
- $22K total spent12 hires, 8 active
- Tech & ITMid-sized company (10-99 people)
Explore similar jobs on Upwork
How it works
Create your free profileHighlight your skills and experience, show your portfolio, and set your ideal pay rate.
Work the way you wantApply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
Get paid securelyFrom contract to payment, we help you work safely and get paid securely.
About Upwork
- 4.9/5(Average rating of clients by professionals)
- G2 2021#1 freelance platform
- 49,000+Signed contract every week
- $2.3BFreelancers earned on Upwork in 2020
Find the best freelance jobs
Growing your career is as easy as creating a free profile and finding work like this that fits your skills.
Trusted by