Garrett Goodman’s Post

View profile for Garrett Goodman, graphic

Leaping language barriers with GenAI 🦾 Unlocking global audiences 🌐 Localizing TV and Film

LLM-powered dubbing. It's a mouthful. And a jaw-dropper 🤯 Here's just one example 👇 Context Aware Scene Translation (CAST) So before even starting a translation, we'll use LLMs to fully analyse a script. Things like: What's the vibe, what are the key topics, and who is the intended audience for the translation? Then we'll pass specific instructions to the translation engine. I.e. Jane is passionately explaining to John about how trees can talk. The mood is hopeful. Jane is off-screen, so translate for emotional intensity. John is on-screen, so translate for timing and lipsync. (NB: our prompts don't actually read like that. That bit is our secret sauce 🤫) The result? We compared a set of videos with CAST to the best-in-breed machine translation engines from Big Tech. CAST delivered 38% improvements in translation accuracy and 56% boosts in audio-visual alignment. What that means... We used to say that our professional translators and dubbing directors were there to fine tune the last mile. Now it's just the last 100 meters 🏃♀️💨

View organization page for Papercup, graphic

9,127 followers

Tackling challenges with LLM-powered dubbing at Papercup. Our LLM-powered dubbing is a game-changer, but every tech has its hurdles. Here’s how we’re tackling them: 1️⃣ Hallucinations: LLMs sometimes make up information. We counter this with prompt engineering and human review to ensure translations stay true to the source. 2️⃣ Consistency: Keeping terminology and character voices consistent in long-form content is tough. We use memory mechanisms, glossaries, and human oversight to maintain continuity. 3️⃣ Specialized Content: For niche industries, we fine-tune models on domain-specific data and consult with experts to get it right. 4️⃣ Emotional Nuance: AI struggles with capturing emotional shifts. We’re improving this with sentiment analysis and human touch for the final polish. 5️⃣ Ethical Considerations: We’re transparent with data use, address biases, and see our AI as a tool to enhance, not replace, human creativity. By acknowledging these challenges, we’re constantly refining our system to push the boundaries of AI dubbing while ensuring top-notch quality! https://lnkd.in/epbHCgix #AI #LLMs #Dubbing #LargeLanguageModels

Large language models: revolutionizing AI dubbing and beyond

Large language models: revolutionizing AI dubbing and beyond

papercup.com

Jim Louderback

Creator Economy Sherpa | Award Winning Curator, Moderator & Speaker | "Inside the Creator Economy" Newsletter | Board of Director | Geek

2mo

Wow, very cool. Glad to see you adding this to your repertoire, and interested in hearing what you learn and how you adapt!

Brett Snelgrove

Global Head of Content at Copyright Capital | Social & Video Content Strategic Leader

2mo

Those five points of the what AI dubbing can sometimes strugggle with are on point 👏 . The most impressive thing I am seeing at the moment is not the technology itself but how people are using it to build new innovative solutions. #HumanInTheLoop

See more comments

To view or add a comment, sign in

Explore topics