2 translation models are available:
base
: Fast, cover most use casesenhanced
: Slower, but higher quality and with context awareness
Quickstart
To enable translation, settranslation
to true
on your request, and add a translation_config
object :
Translation configuration fields
Target language codes for translation output. See the list of supported
language codes in >Supported
Languages.
Specifies the translation model to be used.
Keep translated segments aligned with source segmentation. Use
true
for subtitles/dubbing; set false
for a more natural flow in the target language.- When true, the system attempts to match the translated segments (utterances, sentences) to the timing and structure of the original detected speech segments.
- When false, the translation might be more fluid or natural-sounding in the target language but could deviate from the original utterance segmentation.
Controls alignment with visual cues, specifically lip movements. When enabled
(default), uses an advanced lip synchronization algorithm that aligns
translated output with speaker’s lip movements using timestamps from lip
activity. \ This enhances viewing experience for dubbed content but may
occasionally merge distinct words into single objects to achieve better visual
sync. Set to
false
if strict word-for-word mapping is required over visual
timing synchronization.Enable context-aware translation. When
true
, the model leverages extra
context and style preferences for better accuracy. Turn off for purely literal
translations.Additional context to improve terminology, proper nouns, or disambiguation.
Effective with
Example:
context_adaptation: true
. Example:
"Medical consultation between doctor and patient discussing cardiology"
Prefer informal register when available; useful for chatty UX or youth
audiences. Especially relevant for languages with formal/informal distinctions
(e.g., French “tu/vous”, German “du/Sie”, Spanish “tú/usted”, Dutch “U/jij”).
Result
The transcription result will contain a"translation"
key with the output of the model:
If you enabled the
subtitles
generation, those will also benefits from the
translation model.Best practices
- Set
target_languages
to only the languages you need. - Use
enhanced
withcontext_adaptation
for high-accuracy, domain-heavy content. - Provide a meaningful
context
to improve terminology and named entities. - Keep
match_original_utterances: true
for subtitles; set tofalse
for a more natural flow. - Pair with language detection and code switching when source language may vary.