{
"id": "45463597-20b7-4af7-b3b3-f5fb778203ab",
"event": "transcription.success",
"payload": {
"metadata": {
"audio_duration": 3600,
"number_of_distinct_channels": 1,
"billing_time": 3600,
"transcription_time": 20
},
"transcription": {
"full_transcript": "<string>",
"languages": [
"en"
],
"utterances": [
{
"start": 123,
"end": 123,
"confidence": 123,
"channel": 1,
"words": [
{
"word": "<string>",
"start": 123,
"end": 123,
"confidence": 123
}
],
"text": "<string>",
"language": "en",
"speaker": 1
}
],
"sentences": [
{
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": [
"<string>"
]
}
],
"subtitles": [
{
"format": "srt",
"subtitles": "<string>"
}
]
},
"translation": {
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": [
{
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"full_transcript": "<string>",
"languages": [
"en"
],
"utterances": [
{
"start": 123,
"end": 123,
"confidence": 123,
"channel": 1,
"words": [
{
"word": "<string>",
"start": 123,
"end": 123,
"confidence": 123
}
],
"text": "<string>",
"language": "en",
"speaker": 1
}
],
"sentences": [
{
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": [
"<string>"
]
}
],
"subtitles": [
{
"format": "srt",
"subtitles": "<string>"
}
]
}
]
},
"summarization": {
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": "<string>"
},
"moderation": {
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": "<string>"
},
"named_entity_recognition": {
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"entity": "<string>"
},
"name_consistency": {
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": "<string>"
},
"speaker_reidentification": {
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": "<string>"
},
"structured_data_extraction": {
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": "<string>"
},
"sentiment_analysis": {
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": "<string>"
},
"audio_to_llm": {
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": [
{
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": {
"prompt": "<string>",
"response": "<string>"
}
}
]
},
"sentences": {
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": [
"<string>"
]
},
"display_mode": {
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": [
"<string>"
]
},
"chapterization": {
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": {}
},
"diarization": {
"success": true,
"is_empty": true,
"exec_time": 123,
"error": {
"status_code": 500,
"exception": "<string>",
"message": "<string>"
},
"results": [
{
"start": 123,
"end": 123,
"confidence": 123,
"channel": 1,
"words": [
{
"word": "<string>",
"start": 123,
"end": 123,
"confidence": 123
}
],
"text": "<string>",
"language": "en",
"speaker": 1
}
]
}
},
"custom_metadata": {
"user": "John Doe"
}
}Payload definition for the callback event transcription.success.
Id of the job
"45463597-20b7-4af7-b3b3-f5fb778203ab"
Type of event
transcription.success "transcription.success"
Result of the transcription
Show child attributes
Metadata for the given transcription & audio file
Show child attributes
Duration of the transcribed audio file
3600
Number of distinct channels in the transcribed audio file
x >= 11
Billed duration in seconds (audio_duration * number_of_distinct_channels)
3600
Duration of the transcription in seconds
20
Transcription of the audio speech
Show child attributes
All transcription on text format without any other information
All the detected languages in the audio sorted from the most detected to the less detected
If one language is set, it will be used for the transcription. Otherwise, language will be auto-detected by the model.
af, am, ar, as, az, ba, be, bg, bn, bo, br, bs, ca, cs, cy, da, de, el, en, es, et, eu, fa, fi, fo, fr, gl, gu, ha, haw, he, hi, hr, ht, hu, hy, id, is, it, ja, jw, ka, kk, km, kn, ko, la, lb, ln, lo, lt, lv, mg, mi, mk, ml, mn, mr, ms, mt, my, ne, nl, nn, no, oc, pa, pl, ps, pt, ro, ru, sa, sd, si, sk, sl, sn, so, sq, sr, su, sv, sw, ta, te, tg, th, tk, tl, tr, tt, uk, ur, uz, vi, yi, yo, zh ["en"]Transcribed speech utterances present in the audio
Show child attributes
Start timestamp in seconds of this utterance
End timestamp in seconds of this utterance
Confidence on the transcribed utterance (1 = 100% confident)
Audio channel of where this utterance has been transcribed from
x >= 0List of words of the utterance, split by timestamp
Show child attributes
Spoken word
Start timestamps in seconds of the spoken word
End timestamps in seconds of the spoken word
Confidence on the transcribed word (1 = 100% confident)
Transcription for this utterance
Spoken language in this utterance
af, am, ar, as, az, ba, be, bg, bn, bo, br, bs, ca, cs, cy, da, de, el, en, es, et, eu, fa, fi, fo, fr, gl, gu, ha, haw, he, hi, hr, ht, hu, hy, id, is, it, ja, jw, ka, kk, km, kn, ko, la, lb, ln, lo, lt, lv, mg, mi, mk, ml, mn, mr, ms, mt, my, ne, nl, nn, no, oc, pa, pl, ps, pt, ro, ru, sa, sd, si, sk, sl, sn, so, sq, sr, su, sv, sw, ta, te, tg, th, tk, tl, tr, tt, uk, ur, uz, vi, yi, yo, zh "en"
If diarization enabled, speaker identification number
x >= 0If sentences has been enabled, sentences results
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
Show child attributes
If sentences has been enabled, transcription as sentences.
If translation has been enabled, translation of the audio speech transcription
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
List of translated transcriptions, one for each target_languages
Show child attributes
Contains the error details of the failed addon
All transcription on text format without any other information
All the detected languages in the audio sorted from the most detected to the less detected
Target language in iso639-1 format you want the transcription translated to
af, am, ar, as, az, ba, be, bg, bn, bo, br, bs, ca, cs, cy, da, de, el, en, es, et, eu, fa, fi, fo, fr, gl, gu, ha, haw, he, hi, hr, ht, hu, hy, id, is, it, ja, jw, ka, kk, km, kn, ko, la, lb, ln, lo, lt, lv, mg, mi, mk, ml, mn, mr, ms, mt, my, ne, nl, nn, no, oc, pa, pl, ps, pt, ro, ru, sa, sd, si, sk, sl, sn, so, sq, sr, su, sv, sw, ta, te, tg, th, tk, tl, tr, tt, uk, ur, uz, vi, wo, yi, yo, zh ["en"]Transcribed speech utterances present in the audio
Show child attributes
Start timestamp in seconds of this utterance
End timestamp in seconds of this utterance
Confidence on the transcribed utterance (1 = 100% confident)
Audio channel of where this utterance has been transcribed from
x >= 0List of words of the utterance, split by timestamp
Show child attributes
Spoken word
Start timestamps in seconds of the spoken word
End timestamps in seconds of the spoken word
Confidence on the transcribed word (1 = 100% confident)
Transcription for this utterance
Spoken language in this utterance
af, am, ar, as, az, ba, be, bg, bn, bo, br, bs, ca, cs, cy, da, de, el, en, es, et, eu, fa, fi, fo, fr, gl, gu, ha, haw, he, hi, hr, ht, hu, hy, id, is, it, ja, jw, ka, kk, km, kn, ko, la, lb, ln, lo, lt, lv, mg, mi, mk, ml, mn, mr, ms, mt, my, ne, nl, nn, no, oc, pa, pl, ps, pt, ro, ru, sa, sd, si, sk, sl, sn, so, sq, sr, su, sv, sw, ta, te, tg, th, tk, tl, tr, tt, uk, ur, uz, vi, yi, yo, zh "en"
If diarization enabled, speaker identification number
x >= 0If sentences has been enabled, sentences results for this translation
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
Show child attributes
Status code of the addon error
500
Reason of the addon error
Detailed message of the addon error
If sentences has been enabled, transcription as sentences.
If subtitles has been enabled, subtitles results for this translation
If summarization has been enabled, summarization of the audio speech transcription
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
If summarization has been enabled, summary of the transcription
If moderation has been enabled, moderation of the audio speech transcription
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
If moderation has been enabled, moderated transcription
If named_entity_recognition has been enabled, the detected entities
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
Show child attributes
If named_entity_recognition has been enabled, the detected entities.
If name_consistency has been enabled, Gladia will improve consistency of the names accross the transcription
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
If name_consistency has been enabled, Gladia will improve the consistency of the names across the transcription
If speaker_reidentification has been enabled, results of the AI speaker reidentification.
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
Show child attributes
If speaker_reidentification has been enabled, results of the AI speaker reidentification.
If structured_data_extraction has been enabled, structured data extraction results
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
Show child attributes
If structured_data_extraction has been enabled, results of the AI structured data extraction for the defined classes.
If sentiment_analysis has been enabled, sentiment analysis of the audio speech transcription
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
If sentiment_analysis has been enabled, Gladia will analyze the sentiments and emotions of the audio
If audio_to_llm has been enabled, audio to llm results of the audio speech transcription
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
If audio_to_llm has been enabled, results of the AI custom analysis
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
Show child attributes
If sentences has been enabled, sentences of the audio speech transcription. Deprecated: content will move to the transcription object.
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
If sentences has been enabled, transcription as sentences.
If display_mode has been enabled, the output will be reordered, creating new utterances when speakers overlapped
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
If display_mode has been enabled, proposes an alternative display output.
If chapterization has been enabled, will generate chapters name for different parts of the given audio.
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
If chapterization has been enabled, will generate chapters name for different parts of the given audio.
If diarization has been requested and an error has occurred, the result will appear here
Show child attributes
The audio intelligence model succeeded to get a valid output
The audio intelligence model returned an empty value
Time audio intelligence model took to complete the task
null if success is true. Contains the error details of the failed model
[Deprecated] If diarization has been enabled, the diarization result will appear here
Show child attributes
Start timestamp in seconds of this utterance
End timestamp in seconds of this utterance
Confidence on the transcribed utterance (1 = 100% confident)
Audio channel of where this utterance has been transcribed from
x >= 0List of words of the utterance, split by timestamp
Show child attributes
Spoken word
Start timestamps in seconds of the spoken word
End timestamps in seconds of the spoken word
Confidence on the transcribed word (1 = 100% confident)
Transcription for this utterance
Spoken language in this utterance
af, am, ar, as, az, ba, be, bg, bn, bo, br, bs, ca, cs, cy, da, de, el, en, es, et, eu, fa, fi, fo, fr, gl, gu, ha, haw, he, hi, hr, ht, hu, hy, id, is, it, ja, jw, ka, kk, km, kn, ko, la, lb, ln, lo, lt, lv, mg, mi, mk, ml, mn, mr, ms, mt, my, ne, nl, nn, no, oc, pa, pl, ps, pt, ro, ru, sa, sd, si, sk, sl, sn, so, sq, sr, su, sv, sw, ta, te, tg, th, tk, tl, tr, tt, uk, ur, uz, vi, yi, yo, zh "en"
If diarization enabled, speaker identification number
x >= 0Custom metadata given in the initial request
{ "user": "John Doe" }