API Documentation
Upload
Pre-recorded endpoints
Live endpoints
Live WS actions
Live WS messages
- Realtime messages
- Post-processing messages
- Acknowledgment messages
- Lifecycle messages
Live callbacks
- Realtime messages
- Post-processing messages
- Acknowledgment messages
- Lifecycle messages
Live webhooks
- Lifecycle messages
Final transcript
Payload definition for the message post_final_transcript
.
Id of the live session
Date of creation of the message. The date is formatted as an ISO 8601 string
post_final_transcript
The message data
Metadata for the given transcription & audio file
Duration of the transcribed audio file
Number of distinct channels in the transcribed audio file
x > 1
Billed duration in seconds (audio_duration * number_of_distinct_channels)
Duration of the transcription in seconds
Transcription of the audio speech
All transcription on text format without any other information
af
, sq
, am
, ar
, hy
, as
, ast
, az
, ba
, eu
, be
, bn
, bs
, br
, bg
, my
, ca
, ceb
, zh
, hr
, cs
, da
, nl
, en
, et
, fo
, fi
, fr
, fy
, ff
, gd
, gl
, lg
, ka
, de
, el
, gu
, ht
, ha
, haw
, he
, hi
, hu
, is
, ig
, ilo
, id
, ga
, it
, ja
, jv
, kn
, kk
, km
, ko
, lo
, la
, lv
, ln
, lt
, lb
, mk
, mg
, ms
, ml
, mt
, mi
, mr
, mo
, mn
, mymr
, ne
, no
, nn
, oc
, or
, pa
, ps
, fa
, pl
, pt
, pa
, ro
, ru
, sa
, sr
, sn
, sd
, si
, sk
, sl
, so
, es
, su
, sw
, ss
, sv
, tl
, tg
, ta
, tt
, te
, th
, bo
, tn
, tr
, tk
, uk
, ur
, uz
, vi
, cy
, wo
, xh
, yi
, yo
, zu
Transcribed speech utterances present in the audio
All the detected languages in the audio sorted from the most detected to the less detected
af
, sq
, am
, ar
, hy
, as
, ast
, az
, ba
, eu
, be
, bn
, bs
, br
, bg
, my
, ca
, ceb
, zh
, hr
, cs
, da
, nl
, en
, et
, fo
, fi
, fr
, fy
, ff
, gd
, gl
, lg
, ka
, de
, el
, gu
, ht
, ha
, haw
, he
, hi
, hu
, is
, ig
, ilo
, id
, ga
, it
, ja
, jv
, kn
, kk
, km
, ko
, lo
, la
, lv
, ln
, lt
, lb
, mk
, mg
, ms
, ml
, mt
, mi
, mr
, mo
, mn
, mymr
, ne
, no
, nn
, oc
, or
, pa
, ps
, fa
, pl
, pt
, pa
, ro
, ru
, sa
, sr
, sn
, sd
, si
, sk
, sl
, so
, es
, su
, sw
, ss
, sv
, tl
, tg
, ta
, tt
, te
, th
, bo
, tn
, tr
, tk
, uk
, ur
, uz
, vi
, cy
, wo
, xh
, yi
, yo
, zu
Start timestamp in seconds of this utterance
End timestamp in seconds of this utterance
Confidence on the transcribed utterance (1 = 100% confident)
Audio channel of where this utterance has been transcribed from
x > 0
If diarization
enabled, speaker identification number
x > 0
List of words of the utterance, split by timestamp
Spoken word
Start timestamps in seconds of the spoken word
End timestamps in seconds of the spoken word
Confidence on the transcribed word (1 = 100% confident)
Transcription for this utterance
If sentences
has been enabled, sentences results
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null
if success
is true
. Contains the error details of the failed model
If sentences
has been enabled, transcription as sentences.
If subtitles
has been enabled, subtitles results
If translation
has been enabled, translation of the audio speech transcription
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null
if success
is true
. Contains the error details of the failed model
List of translated transcriptions, one for each target_languages
Contains the error details of the failed addon
All transcription on text format without any other information
af
, sq
, am
, ar
, hy
, as
, ast
, az
, ba
, eu
, be
, bn
, bs
, br
, bg
, my
, ca
, ceb
, zh
, hr
, cs
, da
, nl
, en
, et
, fo
, fi
, fr
, fy
, ff
, gd
, gl
, lg
, ka
, de
, el
, gu
, ht
, ha
, haw
, he
, hi
, hu
, is
, ig
, ilo
, id
, ga
, it
, ja
, jv
, kn
, kk
, km
, ko
, lo
, la
, lv
, ln
, lt
, lb
, mk
, mg
, ms
, ml
, mt
, mi
, mr
, mo
, mn
, mymr
, ne
, no
, nn
, oc
, or
, pa
, ps
, fa
, pl
, pt
, pa
, ro
, ru
, sa
, sr
, sn
, sd
, si
, sk
, sl
, so
, es
, su
, sw
, ss
, sv
, tl
, tg
, ta
, tt
, te
, th
, bo
, tn
, tr
, tk
, uk
, ur
, uz
, vi
, cy
, wo
, xh
, yi
, yo
, zu
If sentences
has been enabled, sentences results for this translation
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null
if success
is true
. Contains the error details of the failed model
If sentences
has been enabled, transcription as sentences.
If subtitles
has been enabled, subtitles results for this translation
Transcribed speech utterances present in the audio
All the detected languages in the audio sorted from the most detected to the less detected
af
, sq
, am
, ar
, hy
, as
, ast
, az
, ba
, eu
, be
, bn
, bs
, br
, bg
, my
, ca
, ceb
, zh
, hr
, cs
, da
, nl
, en
, et
, fo
, fi
, fr
, fy
, ff
, gd
, gl
, lg
, ka
, de
, el
, gu
, ht
, ha
, haw
, he
, hi
, hu
, is
, ig
, ilo
, id
, ga
, it
, ja
, jv
, kn
, kk
, km
, ko
, lo
, la
, lv
, ln
, lt
, lb
, mk
, mg
, ms
, ml
, mt
, mi
, mr
, mo
, mn
, mymr
, ne
, no
, nn
, oc
, or
, pa
, ps
, fa
, pl
, pt
, pa
, ro
, ru
, sa
, sr
, sn
, sd
, si
, sk
, sl
, so
, es
, su
, sw
, ss
, sv
, tl
, tg
, ta
, tt
, te
, th
, bo
, tn
, tr
, tk
, uk
, ur
, uz
, vi
, cy
, wo
, xh
, yi
, yo
, zu
Start timestamp in seconds of this utterance
End timestamp in seconds of this utterance
Confidence on the transcribed utterance (1 = 100% confident)
Audio channel of where this utterance has been transcribed from
x > 0
If diarization
enabled, speaker identification number
x > 0
List of words of the utterance, split by timestamp
Transcription for this utterance
If summarization
has been enabled, summarization of the audio speech transcription
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null
if success
is true
. Contains the error details of the failed model
If summarization
has been enabled, summary of the transcription
If moderation
has been enabled, moderation of the audio speech transcription
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null
if success
is true
. Contains the error details of the failed model
If moderation
has been enabled, moderated transcription
If named_entity_recognition
has been enabled, the detected entities
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null
if success
is true
. Contains the error details of the failed model
If named_entity_recognition
has been enabled, the detected entities.
If name_consistency
has been enabled, Gladia will improve consistency of the names accross the transcription
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null
if success
is true
. Contains the error details of the failed model
If name_consistency
has been enabled, Gladia will improve the consistency of the names across the transcription
If custom_spelling
has been enabled, Gladia will correct the spelling of the transcription
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null
if success
is true
. Contains the error details of the failed model
If custom_spelling
has been enabled, Gladia will correct the spelling of the transcription
If structured_data_extraction
has been enabled, structured data extraction results
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null
if success
is true
. Contains the error details of the failed model
If structured_data_extraction
has been enabled, results of the AI structured data extraction for the defined classes.
If sentiment_analysis
has been enabled, sentiment analysis of the audio speech transcription
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null
if success
is true
. Contains the error details of the failed model
If sentiment_analysis
has been enabled, Gladia will analyze the sentiments and emotions of the audio
If audio_to_llm
has been enabled, audio to llm results of the audio speech transcription
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null
if success
is true
. Contains the error details of the failed model
If audio_to_llm
has been enabled, results of the AI custom analysis
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null
if success
is true
. Contains the error details of the failed model
If sentences
has been enabled, sentences of the audio speech transcription. Deprecated: content will move to the transcription
object.
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
If sentences
has been enabled, transcription as sentences.
If display_mode
has been enabled, the output will be reordered, creating new utterances when speakers overlapped
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null
if success
is true
. Contains the error details of the failed model
If display_mode
has been enabled, proposes an alternative display output.
If chapterization
has been enabled, will generate chapters name for different parts of the given audio.
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
If chapterization
has been enabled, will generate chapters name for different parts of the given audio.
Was this page helpful?