POST
/
v2
/
transcription

Authorizations

x-gladia-key
string
headerrequired

Your personal Gladia API key

Body

application/json
audio_url
string
required

URL to a Gladia file or to an external audio or video file

context_prompt
string

[Alpha] Context to feed the transcription model with for possible better accuracy

custom_vocabulary
string[]

[Alpha] Specific vocabulary list to feed the transcription model with

detect_language
boolean
default: true

Detect the language from the given audio

enable_code_switching
boolean
default: false

Detect multiple languages in the given audio

code_switching_config
object

Specify the configuration for code switching

language
enum<string>

Set the spoken language for the given audio (ISO 639 standard)

Available options:
af,
sq,
am,
ar,
hy,
as,
az,
ba,
eu,
be,
bn,
bs,
br,
bg,
ca,
zh,
hr,
cs,
da,
nl,
en,
et,
fo,
fi,
fr,
gl,
ka,
de,
el,
gu,
ht,
ha,
haw,
he,
hi,
hu,
is,
id,
it,
ja,
jv,
kn,
kk,
km,
ko,
lo,
la,
lv,
ln,
lt,
lb,
mk,
mg,
ms,
ml,
mt,
mi,
mr,
mn,
mymr,
ne,
no,
nn,
oc,
ps,
fa,
pl,
pt,
pa,
ro,
ru,
sa,
sr,
sn,
sd,
si,
sk,
sl,
so,
es,
su,
sw,
sv,
tl,
tg,
ta,
tt,
te,
th,
bo,
tr,
tk,
uk,
ur,
uz,
vi,
cy,
yi,
yo,
jp
callback_url
string

Callback URL we will do a POST request to with the result of the transcription

subtitles
boolean
default: false

Enable subtitles generation for this transcription

subtitles_config
object

Configuration for subtitles generation if subtitles is enabled

diarization
boolean
default: false

Enable speaker recognition (diarization) for this audio

diarization_config
object

Speaker recognition configuration, if diarization is enabled

translation
boolean
default: false

[Beta] Enable translation for this audio

translation_config
object

[Beta] Translation configuration, if translation is enabled

summarization
boolean
default: false

[Beta] Enable summarization for this audio

summarization_config
object

[Beta] Summarization configuration, if summarization is enabled

moderation
boolean
default: false

[Alpha] Enable moderation for this audio

named_entity_recognition
boolean
default: false

[Alpha] Enable named entity recognition for this audio

chapterization
boolean
default: false

[Alpha] Enable chapterization for this audio

name_consistency
boolean
default: false

[Alpha] Enable names consistency for this audio

custom_spelling
boolean
default: false

[Alpha] Enable custom spelling for this audio

custom_spelling_config
object

[Alpha] Custom spelling configuration, if custom_spelling is enabled

structured_data_extraction
boolean
default: false

[Alpha] Enable structured data extraction for this audio

structured_data_extraction_config
object

[Alpha] Structured data extraction configuration, if structured_data_extraction is enabled

sentiment_analysis
boolean
default: false

[Alpha] Enable sentiment analysis for this audio

audio_to_llm
boolean
default: false

[Alpha] Enable audio to llm processing for this audio

audio_to_llm_config
object

[Alpha] Audio to llm configuration, if audio_to_llm is enabled

custom_metadata
object

Custom metadata you can attach to this transcription

sentences
boolean
default: false

Enable sentences for this audio

display_mode
boolean
default: false

[Alpha] Allows to change the output display_mode for this audio. The output will be reordered, creating new utterances when speakers overlapped

diarization_enhanced
boolean
default: false

[Alpha] Use enhanced diarization for this audio

Response

201 - application/json
id
string
required

Id of the job

result_url
string
required

Prebuilt URL with your transcription id to fetch the result