Final transcript

session_id

string

required

Id of the live session

Example:

"4a39145c-2844-4557-8f34-34883f7be7d9"

created_at

string

required

Date of creation of the message. The date is formatted as an ISO 8601 string

Example:

"2021-09-01T12:00:00.123Z"

type

enum<string>

default:post_final_transcript

required

Available options:

post_final_transcript

Example:

"post_final_transcript"

data

object

required

The message data

Show child attributes

data.metadata

object

required

Metadata for the given transcription & audio file

Show child attributes

data.metadata.audio_duration

number

required

Duration of the transcribed audio file

Example:

3600

data.metadata.number_of_distinct_channels

integer

required

Number of distinct channels in the transcribed audio file

Required range: x >= 1

Example:

1

data.metadata.billing_time

number

required

Billed duration in seconds (audio_duration * number_of_distinct_channels)

Example:

3600

data.metadata.transcription_time

number

required

Duration of the transcription in seconds

Example:

20

data.transcription

object

Transcription of the audio speech

Show child attributes

data.transcription.full_transcript

string

required

All transcription on text format without any other information

data.transcription.languages

enum<string>[]

required

All the detected languages in the audio sorted from the most detected to the less detected

If one language is set, it will be used for the transcription. Otherwise, language will be auto-detected by the model.

Available options:

af,

am,

ar,

as,

az,

ba,

be,

bg,

bn,

bo,

br,

bs,

ca,

cs,

cy,

da,

de,

el,

en,

es,

et,

eu,

fa,

fi,

fo,

fr,

gl,

gu,

ha,

haw,

he,

hi,

hr,

ht,

hu,

hy,

id,

is,

it,

ja,

jw,

ka,

kk,

km,

kn,

ko,

la,

lb,

ln,

lo,

lt,

lv,

mg,

mi,

mk,

ml,

mn,

mr,

ms,

mt,

my,

ne,

nl,

nn,

no,

oc,

pa,

pl,

ps,

pt,

ro,

ru,

sa,

sd,

si,

sk,

sl,

sn,

so,

sq,

sr,

su,

sv,

sw,

ta,

te,

tg,

th,

tk,

tl,

tr,

tt,

uk,

ur,

uz,

vi,

yi,

yo,

zh

Example:

["en"]

data.transcription.utterances

object[]

required

Transcribed speech utterances present in the audio

Show child attributes

data.transcription.utterances.start

number

required

Start timestamp in seconds of this utterance

data.transcription.utterances.end

number

required

End timestamp in seconds of this utterance

data.transcription.utterances.confidence

number

required

Confidence on the transcribed utterance (1 = 100% confident)

data.transcription.utterances.channel

integer

required

Audio channel of where this utterance has been transcribed from

Required range: x >= 0

data.transcription.utterances.words

object[]

required

List of words of the utterance, split by timestamp

Show child attributes

data.transcription.utterances.words.word

string

required

Spoken word

data.transcription.utterances.words.start

number

required

Start timestamps in seconds of the spoken word

data.transcription.utterances.words.end

number

required

End timestamps in seconds of the spoken word

data.transcription.utterances.words.confidence

number

required

Confidence on the transcribed word (1 = 100% confident)

data.transcription.utterances.text

string

required

Transcription for this utterance

data.transcription.utterances.language

enum<string>

required

Spoken language in this utterance

Available options:

af,

am,

ar,

as,

az,

ba,

be,

bg,

bn,

bo,

br,

bs,

ca,

cs,

cy,

da,

de,

el,

en,

es,

et,

eu,

fa,

fi,

fo,

fr,

gl,

gu,

ha,

haw,

he,

hi,

hr,

ht,

hu,

hy,

id,

is,

it,

ja,

jw,

ka,

kk,

km,

kn,

ko,

la,

lb,

ln,

lo,

lt,

lv,

mg,

mi,

mk,

ml,

mn,

mr,

ms,

mt,

my,

ne,

nl,

nn,

no,

oc,

pa,

pl,

ps,

pt,

ro,

ru,

sa,

sd,

si,

sk,

sl,

sn,

so,

sq,

sr,

su,

sv,

sw,

ta,

te,

tg,

th,

tk,

tl,

tr,

tt,

uk,

ur,

uz,

vi,

yi,

yo,

zh

Example:

"en"

data.transcription.utterances.speaker

integer

If diarization enabled, speaker identification number

Required range: x >= 0

data.transcription.sentences

object[]

If sentences has been enabled, sentences results

Show child attributes

data.transcription.sentences.success

boolean

required

The audio intelligence model succeeded to get a valid output

data.transcription.sentences.is_empty

boolean

required

The audio intelligence model returned an empty value

data.transcription.sentences.exec_time

number

required

Time audio intelligence model took to complete the task

data.transcription.sentences.error

object

required

null if success is true. Contains the error details of the failed model

Show child attributes

data.transcription.sentences.error.status_code

integer

required

Status code of the addon error

Example:

500

data.transcription.sentences.error.exception

string

required

Reason of the addon error

data.transcription.sentences.error.message

string

required

Detailed message of the addon error

data.transcription.sentences.results

string[] | null

required

If sentences has been enabled, transcription as sentences.

data.transcription.subtitles

object[]

If subtitles has been enabled, subtitles results

Show child attributes

data.transcription.subtitles.format

enum<string>

required

Format of the current subtitle

Available options:

srt,

vtt

Example:

"srt"

data.transcription.subtitles.subtitles

string

required

Transcription on the asked subtitle format

data.translation

object

If translation has been enabled, translation of the audio speech transcription

Show child attributes

data.translation.success

boolean

required

The audio intelligence model succeeded to get a valid output

data.translation.is_empty

boolean

required

The audio intelligence model returned an empty value

data.translation.exec_time

number

required

Time audio intelligence model took to complete the task

data.translation.error

object

required

null if success is true. Contains the error details of the failed model

Show child attributes

data.translation.error.status_code

integer

required

Status code of the addon error

Example:

500

data.translation.error.exception

string

required

Reason of the addon error

data.translation.error.message

string

required

Detailed message of the addon error

data.translation.results

object[] | null

required

List of translated transcriptions, one for each target_languages

Show child attributes

data.translation.results.error

object

required

Contains the error details of the failed addon

Show child attributes

data.translation.results.error.status_code

integer

required

Status code of the addon error

Example:

500

data.translation.results.error.exception

string

required

Reason of the addon error

data.translation.results.error.message

string

required

Detailed message of the addon error

data.translation.results.full_transcript

string

required

All transcription on text format without any other information

data.translation.results.languages

enum<string>[]

required

All the detected languages in the audio sorted from the most detected to the less detected

Target language in iso639-1 format you want the transcription translated to

Available options:

af,

am,

ar,

as,

az,

ba,

be,

bg,

bn,

bo,

br,

bs,

ca,

cs,

cy,

da,

de,

el,

en,

es,

et,

eu,

fa,

fi,

fo,

fr,

gl,

gu,

ha,

haw,

he,

hi,

hr,

ht,

hu,

hy,

id,

is,

it,

ja,

jw,

ka,

kk,

km,

kn,

ko,

la,

lb,

ln,

lo,

lt,

lv,

mg,

mi,

mk,

ml,

mn,

mr,

ms,

mt,

my,

ne,

nl,

nn,

no,

oc,

pa,

pl,

ps,

pt,

ro,

ru,

sa,

sd,

si,

sk,

sl,

sn,

so,

sq,

sr,

su,

sv,

sw,

ta,

te,

tg,

th,

tk,

tl,

tr,

tt,

uk,

ur,

uz,

vi,

wo,

yi,

yo,

zh

Example:

["en"]

data.translation.results.utterances

object[]

required

Transcribed speech utterances present in the audio

Show child attributes

data.translation.results.utterances.start

number

required

Start timestamp in seconds of this utterance

data.translation.results.utterances.end

number

required

End timestamp in seconds of this utterance

data.translation.results.utterances.confidence

number

required

Confidence on the transcribed utterance (1 = 100% confident)

data.translation.results.utterances.channel

integer

required

Audio channel of where this utterance has been transcribed from

Required range: x >= 0

data.translation.results.utterances.words

object[]

required

List of words of the utterance, split by timestamp

Show child attributes

data.translation.results.utterances.words.word

string

required

Spoken word

data.translation.results.utterances.words.start

number

required

Start timestamps in seconds of the spoken word

data.translation.results.utterances.words.end

number

required

End timestamps in seconds of the spoken word

data.translation.results.utterances.words.confidence

number

required

Confidence on the transcribed word (1 = 100% confident)

data.translation.results.utterances.text

string

required

Transcription for this utterance

data.translation.results.utterances.language

enum<string>

required

Spoken language in this utterance

Available options:

af,

am,

ar,

as,

az,

ba,

be,

bg,

bn,

bo,

br,

bs,

ca,

cs,

cy,

da,

de,

el,

en,

es,

et,

eu,

fa,

fi,

fo,

fr,

gl,

gu,

ha,

haw,

he,

hi,

hr,

ht,

hu,

hy,

id,

is,

it,

ja,

jw,

ka,

kk,

km,

kn,

ko,

la,

lb,

ln,

lo,

lt,

lv,

mg,

mi,

mk,

ml,

mn,

mr,

ms,

mt,

my,

ne,

nl,

nn,

no,

oc,

pa,

pl,

ps,

pt,

ro,

ru,

sa,

sd,

si,

sk,

sl,

sn,

so,

sq,

sr,

su,

sv,

sw,

ta,

te,

tg,

th,

tk,

tl,

tr,

tt,

uk,

ur,

uz,

vi,

yi,

yo,

zh

Example:

"en"

data.translation.results.utterances.speaker

integer

If diarization enabled, speaker identification number

Required range: x >= 0

data.translation.results.sentences

object[]

If sentences has been enabled, sentences results for this translation

Show child attributes

data.translation.results.sentences.success

boolean

required

The audio intelligence model succeeded to get a valid output

data.translation.results.sentences.is_empty

boolean

required

The audio intelligence model returned an empty value

data.translation.results.sentences.exec_time

number

required

Time audio intelligence model took to complete the task

data.translation.results.sentences.error

object

required

null if success is true. Contains the error details of the failed model

Show child attributes

data.translation.results.sentences.error.status_code

integer

required

Status code of the addon error

Example:

500

data.translation.results.sentences.error.exception

string

required

Reason of the addon error

data.translation.results.sentences.error.message

string

required

Detailed message of the addon error

data.translation.results.sentences.results

string[] | null

required

If sentences has been enabled, transcription as sentences.

data.translation.results.subtitles

object[]

If subtitles has been enabled, subtitles results for this translation

Show child attributes

data.translation.results.subtitles.format

enum<string>

required

Format of the current subtitle

Available options:

srt,

vtt

Example:

"srt"

data.translation.results.subtitles.subtitles

string

required

Transcription on the asked subtitle format

data.summarization

object

If summarization has been enabled, summarization of the audio speech transcription

Show child attributes

data.summarization.success

boolean

required

The audio intelligence model succeeded to get a valid output

data.summarization.is_empty

boolean

required

The audio intelligence model returned an empty value

data.summarization.exec_time

number

required

Time audio intelligence model took to complete the task

data.summarization.error

object

required

null if success is true. Contains the error details of the failed model

Show child attributes

data.summarization.error.status_code

integer

required

Status code of the addon error

Example:

500

data.summarization.error.exception

string

required

Reason of the addon error

data.summarization.error.message

string

required

Detailed message of the addon error

data.summarization.results

string | null

required

If summarization has been enabled, summary of the transcription

data.named_entity_recognition

object

If named_entity_recognition has been enabled, the detected entities

Show child attributes

data.named_entity_recognition.success

boolean

required

The audio intelligence model succeeded to get a valid output

data.named_entity_recognition.is_empty

boolean

required

The audio intelligence model returned an empty value

data.named_entity_recognition.exec_time

number

required

Time audio intelligence model took to complete the task

data.named_entity_recognition.error

object

required

null if success is true. Contains the error details of the failed model

Show child attributes

data.named_entity_recognition.error.status_code

integer

required

Status code of the addon error

Example:

500

data.named_entity_recognition.error.exception

string

required

Reason of the addon error

data.named_entity_recognition.error.message

string

required

Detailed message of the addon error

data.named_entity_recognition.entity

string

required

If named_entity_recognition has been enabled, the detected entities.

data.sentiment_analysis

object

If sentiment_analysis has been enabled, sentiment analysis of the audio speech transcription

Show child attributes

data.sentiment_analysis.success

boolean

required

The audio intelligence model succeeded to get a valid output

data.sentiment_analysis.is_empty

boolean

required

The audio intelligence model returned an empty value

data.sentiment_analysis.exec_time

number

required

Time audio intelligence model took to complete the task

data.sentiment_analysis.error

object

required

null if success is true. Contains the error details of the failed model

Show child attributes

data.sentiment_analysis.error.status_code

integer

required

Status code of the addon error

Example:

500

data.sentiment_analysis.error.exception

string

required

Reason of the addon error

data.sentiment_analysis.error.message

string

required

Detailed message of the addon error

data.sentiment_analysis.results

string

required

If sentiment_analysis has been enabled, Gladia will analyze the sentiments and emotions of the audio

data.chapterization

object

If chapterization has been enabled, will generate chapters name for different parts of the given audio.

Show child attributes

data.chapterization.success

boolean

required

The audio intelligence model succeeded to get a valid output

data.chapterization.is_empty

boolean

required

The audio intelligence model returned an empty value

data.chapterization.exec_time

number

required

Time audio intelligence model took to complete the task

data.chapterization.error

object

required

null if success is true. Contains the error details of the failed model

Show child attributes

data.chapterization.error.status_code

integer

required

Status code of the addon error

Example:

500

data.chapterization.error.exception

string

required

Reason of the addon error

data.chapterization.error.message

string

required

Detailed message of the addon error

data.chapterization.results

object

required

If chapterization has been enabled, will generate chapters name for different parts of the given audio.

API Documentation

Live endpoints

Pre-recorded endpoints

Transcription (deprecated)

Final transcript