AI Engineer to Build a 100% AI UGC Video Generation Platform

Posted 1 hour ago

Worldwide

Summary

We're looking for an AI / generative video specialist to build a self-hosted platform (running on our own server) that generates fully AI UGC videos, starting from an avatar we create ourselves and producing spoken clips in Italian with correct, properly synchronized lip sync. We operate in the supplements and beauty sector, so our videos often feature physical products on camera. It is critical that the product label stays intact, legible, and undistorted throughout AI generation — no warped text, garbled logos, or mangled packaging. This is a hard requirement. We already have active, working accounts on the main services (Seedance, fal.ai, ElevenLabs, and others). The platform must connect to these via API — not rebuild everything from scratch. Important — what already works: the Italian voice generated by ElevenLabs works very well for us. That part of the pipeline is solid, and we want to keep using it. Each avatar will have its own ElevenLabs voice code, which the platform should use as the starting point for generation. What the platform must do Maintain an internal library of avatars (5–6, no more) that we create ourselves using Nano Banana (reference images of the characters). Generation should always start from one of these avatars, paired with its corresponding ElevenLabs voice code. Generate multiple videos in parallel, where each subsequent scene starts from the last frame of the previous scene, to maintain visual consistency and character continuity. When a product appears in the scene, the label/packaging must remain accurate and readable (correct text, logo, and design) across every generated frame — including scenes generated from the last frame of the previous one, where drift tends to accumulate. Automatically stitch the individual scenes (roughly 5–6 seconds each) into a single final video. Desired output: full AI videos lasting between 30 and 60 seconds (60 max). Maintain correct Italian dialogue, perfectly synchronized with the lip sync. Critical point — ITALIAN LIP SYNC We'll say it twice: the lip sync must work in ITALIAN. This is the core problem — so far we have not been able to get a decent result. Working Italian lip sync is the requirement that determines whether this project succeeds. If you don't have concrete experience with Italian lip sync (or with solving model limitations on non-English languages), this project is not for you. Architecture / integrations Platform hosted on our own server (no closed third-party SaaS). API integration with the services we already use: Seedance, fal.ai, ElevenLabs, and our other active accounts. Orchestrated pipeline: avatar (from internal library) + ElevenLabs Italian voice → parallel scene generation with frame-to-frame continuity → lip sync → automatic editing → export. We're open to suggestions We're not locked into a specific technical approach. We're open to any suggestions on tools, models, or architecture that will make the platform work reliably — especially anything that improves Italian lip sync quality and keeps product labels stable. If you know a better way to achieve our goal, tell us. Deliverables A working platform, installed on our server. Technical documentation (setup, APIs, configuration). A reproducible pipeline: from avatar to final edited video. At least 2–3 test videos demonstrating working Italian lip sync at 30–60 seconds

  • $1,000.00

    Fixed-price
  • Expert
    Experience Level
  • Remote Job
  • Ongoing project
    Project Type

Contract-to-hire opportunity

This lets talent know that this job could become full time.
Learn more
Skills and Expertise
Mandatory skills
AI-Generated Video
Artificial Intelligence
Activity on this job
  • Proposals:20 to 50
  • Interviewing:
    0
  • Invites sent:
    0
  • Unanswered invites:
    0
About the client
Member since Apr 22, 2020
  • Italy
    Torino11:26 AM
  • $8.3K total spent
    16 hires, 5 active
  • 261 hours
  • Health & Fitness
    Small company (2-9 people)

Explore similar jobs on Upwork

AI-Generated Video
Social Media Video
Video Animation
AI Video Creation for Mobile AppsHourly‐ Posted 2 months ago
Graphic Design
CapCut

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo