What is Audio Summarization
Audio summarization API is a tool that allows users to summarize audio content automatically using artificial intelligence and machine learning algorithms.
The process involves analyzing the audio input to identify key topics and themes and extracting relevant audio segments that best represent these topics. The output is a condensed version of the audio content that captures the most critical information while reducing the overall length of the audio file.
Audio summarization APIs can be used in various applications, including podcasting, audio news briefs, and audio transcription services. They can also be used in education, healthcare, and marketing industries to quickly and efficiently summarize audio content for research, analysis, and decision-making.
Some audio summarization APIs also offer additional features such as language detection, speaker identification, and keyword extraction, which can further enhance the accuracy and usefulness of the summary.
Activate the Audio Summarization
In the app
In the app Toggle the summarization using the checkbox

Using the API
curl -X 'POST' \
'https://api.gladia.io/audio/text/audio-transcription/' \
-H 'accept: application/json' \
-H 'x-gladia-key: XXXXXXXXXXXXXXX' \
-H 'Content-Type: multipart/form-data' \
-F "audio_url=http://files.gladia.io/example/audio-transcription/split_infinity.wav" \
-F "output_format=json" \
-F "toggle_summarization=true"
Output Format | Expected behavior | Example |
---|---|---|
json (default) | transcript is in "prediction" summary is in "prediction_raw.summarization" | { "prediction": [{...}], "prediction_raw": { "metadata": {...}, "transcription": [{...}], "summarization": "This message of hope encourages us to look to the past for guidance as we create a better future for ourselves despite difficult times." } } |
plain | Only summary the summary is returned in plain text | "Our present circumstances are unpredictable, but we can gain insight into the future by reflecting on our past experiences. Our actions will ultimately determine the outcome of the present." |
txt | transcript is in "prediction" summary is in "prediction_raw.summarization" | { "prediction": ..., "prediction_raw": { "metadata": {...}, "transcription": [{...}], "summarization": "This message of hope encourages us to look to the past for guidance as we create a better future for ourselves despite difficult times." } } |
vtt | transcript in VTT format is in "prediction" transcript in original format is in "prediction_raw.transcription" summary is in "prediction_raw.summarization" | { "prediction": "VTT transcript"..., "prediction_raw": { "metadata": {...}, "transcription": [{...}], "summarization": "This message of hope encourages us to look to the past for guidance as we create a better future for ourselves despite difficult times." } } |
srt | transcript in. SRT is in "prediction" transcript in original format is in "prediction_raw.transcription" summary is in "prediction_raw.summarization" | { "prediction": "SRT transcript", "prediction_raw": { "metadata": {...}, "transcription": [{...}], "summarization": "This message of hope encourages us to look to the past for guidance as we create a better future for ourselves despite difficult times." } } |
Full Examples
Plain output
"Hope remains despite the current minimalist atmosphere, as the past can guide us towards the future. Ultimately, it is up to us to shape the present."
JSON output
{
"prediction": [
{
"time_begin": 0.09,
"time_end": 2.07,
"transcription": "Split infinity",
"language": "en",
"confidence": 0.49,
"words": [
{
"word": " Split",
"begin": 0.09,
"end": 0.75,
"confidence": 0.7
},
{
"word": " infinity.",
"begin": 0.75,
"end": 1.47,
"confidence": 0.4
}
],
"speaker": "not_activated",
"channel": "channel_0"
},
{
"time_begin": 2.13,
"time_end": 5.19,
"transcription": "in a time when less is more",
"language": "en",
"confidence": 0.65,
"words": [
{
"word": " in",
"begin": 2.13,
"end": 2.65,
"confidence": 0.59
},
{
"word": " a",
"begin": 2.65,
"end": 2.87,
"confidence": 0.99
},
{
"word": " time",
"begin": 2.87,
"end": 3.33,
"confidence": 0.82
},
{
"word": " when",
"begin": 3.33,
"end": 3.79,
"confidence": 0.86
},
{
"word": " less",
"begin": 3.79,
"end": 4.07,
"confidence": 0.87
},
{
"word": " is",
"begin": 4.07,
"end": 4.41,
"confidence": 0.91
},
{
"word": " more.",
"begin": 4.41,
"end": 4.6899999999999995,
"confidence": 0.88
}
],
"speaker": "not_activated",
"channel": "channel_0"
},
{
"time_begin": 5.52,
"time_end": 20.4,
"transcription": "Where too much is never enough, there is always hope for the future. The future can be read from the past. The past foreshadows the present, and the present hasn't been written yet",
"language": "en",
"confidence": 0.75,
"words": [
{
"word": " Where",
"begin": 5.52,
"end": 5.76,
"confidence": 0.51
},
{
"word": " too",
"begin": 5.76,
"end": 6.1,
"confidence": 0.8
},
{
"word": " much",
"begin": 6.1,
"end": 6.4799999999999995,
"confidence": 0.81
},
{
"word": " is",
"begin": 6.4799999999999995,
"end": 6.92,
"confidence": 0.9
},
{
"word": " never",
"begin": 6.92,
"end": 7.26,
"confidence": 0.88
},
{
"word": " enough,",
"begin": 7.26,
"end": 7.819999999999999,
"confidence": 0.77
},
{
"word": " there",
"begin": 8.62,
"end": 8.7,
"confidence": 0.81
},
{
"word": " is",
"begin": 8.7,
"end": 8.959999999999999,
"confidence": 0.84
},
{
"word": " always",
"begin": 8.959999999999999,
"end": 9.459999999999999,
"confidence": 0.74
},
{
"word": " hope",
"begin": 9.459999999999999,
"end": 9.8,
"confidence": 0.83
},
{
"word": " for",
"begin": 9.8,
"end": 10.12,
"confidence": 0.9
},
{
"word": " the",
"begin": 10.12,
"end": 10.32,
"confidence": 0.82
},
{
"word": " future.",
"begin": 10.32,
"end": 10.76,
"confidence": 0.93
},
{
"word": " The",
"begin": 11.8,
"end": 11.899999999999999,
"confidence": 0.82
},
{
"word": " future",
"begin": 11.899999999999999,
"end": 12.219999999999999,
"confidence": 0.94
},
{
"word": " can",
"begin": 12.219999999999999,
"end": 12.6,
"confidence": 0.9
},
{
"word": " be",
"begin": 12.6,
"end": 12.86,
"confidence": 0.91
},
{
"word": " read",
"begin": 12.86,
"end": 13.059999999999999,
"confidence": 0.9
},
{
"word": " from",
"begin": 13.059999999999999,
"end": 13.34,
"confidence": 0.82
},
{
"word": " the",
"begin": 13.34,
"end": 13.559999999999999,
"confidence": 0.82
},
{
"word": " past.",
"begin": 13.559999999999999,
"end": 14.139999999999999,
"confidence": 0.81
},
{
"word": " The",
"begin": 14.68,
"end": 14.78,
"confidence": 0.81
},
{
"word": " past",
"begin": 14.78,
"end": 15.34,
"confidence": 0.82
},
{
"word": " foreshadows",
"begin": 15.34,
"end": 16.119999999999997,
"confidence": 0.89
},
{
"word": " the",
"begin": 16.119999999999997,
"end": 16.46,
"confidence": 0.81
},
{
"word": " present,",
"begin": 16.46,
"end": 17.02,
"confidence": 0.8
},
{
"word": " and",
"begin": 17.439999999999998,
"end": 17.72,
"confidence": 0.89
},
{
"word": " the",
"begin": 17.72,
"end": 17.939999999999998,
"confidence": 0.82
},
{
"word": " present",
"begin": 17.939999999999998,
"end": 18.38,
"confidence": 0.8
},
{
"word": " hasn't",
"begin": 18.38,
"end": 18.939999999999998,
"confidence": 0.93
},
{
"word": " been",
"begin": 18.939999999999998,
"end": 19.240000000000002,
"confidence": 0.82
},
{
"word": " written",
"begin": 19.240000000000002,
"end": 19.46,
"confidence": 0.86
},
{
"word": " yet.",
"begin": 19.46,
"end": 19.96,
"confidence": 0.91
}
],
"speaker": "not_activated",
"channel": "channel_0"
}
],
"prediction_raw": {
"metadata": {
"total_speech_duration": 19.919999999999998,
"total_speech_duration_channel_0": 19.919999999999998,
"summarizationTime": 4.220367431640625,
"audioConversionTime": 0.41851377487182617,
"vadTime": 0.012445449829101562,
"inferenceTime": 1.9278953075408936,
"diarizationTime": 0.00001239776611328125,
"totalTranscriptionTime": 2.3588669300079346,
"nbSilentChannels": 0,
"nbSimilarChannels": 0,
"providedFileMetadata": {
"nb channels": 1,
"sample rate": 44100,
"sample width": 16,
"original file type": "audio"
}
},
"transcription": [
{
"time_begin": 0.09,
"time_end": 2.07,
"transcription": "Split infinity",
"language": "en",
"confidence": 0.49,
"words": [
{
"word": " Split",
"begin": 0.09,
"end": 0.75,
"confidence": 0.7
},
{
"word": " infinity.",
"begin": 0.75,
"end": 1.47,
"confidence": 0.4
}
],
"speaker": "not_activated",
"channel": "channel_0"
},
{
"time_begin": 2.13,
"time_end": 5.19,
"transcription": "in a time when less is more",
"language": "en",
"confidence": 0.65,
"words": [
{
"word": " in",
"begin": 2.13,
"end": 2.65,
"confidence": 0.59
},
{
"word": " a",
"begin": 2.65,
"end": 2.87,
"confidence": 0.99
},
{
"word": " time",
"begin": 2.87,
"end": 3.33,
"confidence": 0.82
},
{
"word": " when",
"begin": 3.33,
"end": 3.79,
"confidence": 0.86
},
{
"word": " less",
"begin": 3.79,
"end": 4.07,
"confidence": 0.87
},
{
"word": " is",
"begin": 4.07,
"end": 4.41,
"confidence": 0.91
},
{
"word": " more.",
"begin": 4.41,
"end": 4.6899999999999995,
"confidence": 0.88
}
],
"speaker": "not_activated",
"channel": "channel_0"
},
{
"time_begin": 5.52,
"time_end": 20.4,
"transcription": "Where too much is never enough, there is always hope for the future. The future can be read from the past. The past foreshadows the present, and the present hasn't been written yet",
"language": "en",
"confidence": 0.75,
"words": [
{
"word": " Where",
"begin": 5.52,
"end": 5.76,
"confidence": 0.51
},
{
"word": " too",
"begin": 5.76,
"end": 6.1,
"confidence": 0.8
},
{
"word": " much",
"begin": 6.1,
"end": 6.4799999999999995,
"confidence": 0.81
},
{
"word": " is",
"begin": 6.4799999999999995,
"end": 6.92,
"confidence": 0.9
},
{
"word": " never",
"begin": 6.92,
"end": 7.26,
"confidence": 0.88
},
{
"word": " enough,",
"begin": 7.26,
"end": 7.819999999999999,
"confidence": 0.77
},
{
"word": " there",
"begin": 8.62,
"end": 8.7,
"confidence": 0.81
},
{
"word": " is",
"begin": 8.7,
"end": 8.959999999999999,
"confidence": 0.84
},
{
"word": " always",
"begin": 8.959999999999999,
"end": 9.459999999999999,
"confidence": 0.74
},
{
"word": " hope",
"begin": 9.459999999999999,
"end": 9.8,
"confidence": 0.83
},
{
"word": " for",
"begin": 9.8,
"end": 10.12,
"confidence": 0.9
},
{
"word": " the",
"begin": 10.12,
"end": 10.32,
"confidence": 0.82
},
{
"word": " future.",
"begin": 10.32,
"end": 10.76,
"confidence": 0.93
},
{
"word": " The",
"begin": 11.8,
"end": 11.899999999999999,
"confidence": 0.82
},
{
"word": " future",
"begin": 11.899999999999999,
"end": 12.219999999999999,
"confidence": 0.94
},
{
"word": " can",
"begin": 12.219999999999999,
"end": 12.6,
"confidence": 0.9
},
{
"word": " be",
"begin": 12.6,
"end": 12.86,
"confidence": 0.91
},
{
"word": " read",
"begin": 12.86,
"end": 13.059999999999999,
"confidence": 0.9
},
{
"word": " from",
"begin": 13.059999999999999,
"end": 13.34,
"confidence": 0.82
},
{
"word": " the",
"begin": 13.34,
"end": 13.559999999999999,
"confidence": 0.82
},
{
"word": " past.",
"begin": 13.559999999999999,
"end": 14.139999999999999,
"confidence": 0.81
},
{
"word": " The",
"begin": 14.68,
"end": 14.78,
"confidence": 0.81
},
{
"word": " past",
"begin": 14.78,
"end": 15.34,
"confidence": 0.82
},
{
"word": " foreshadows",
"begin": 15.34,
"end": 16.119999999999997,
"confidence": 0.89
},
{
"word": " the",
"begin": 16.119999999999997,
"end": 16.46,
"confidence": 0.81
},
{
"word": " present,",
"begin": 16.46,
"end": 17.02,
"confidence": 0.8
},
{
"word": " and",
"begin": 17.439999999999998,
"end": 17.72,
"confidence": 0.89
},
{
"word": " the",
"begin": 17.72,
"end": 17.939999999999998,
"confidence": 0.82
},
{
"word": " present",
"begin": 17.939999999999998,
"end": 18.38,
"confidence": 0.8
},
{
"word": " hasn't",
"begin": 18.38,
"end": 18.939999999999998,
"confidence": 0.93
},
{
"word": " been",
"begin": 18.939999999999998,
"end": 19.240000000000002,
"confidence": 0.82
},
{
"word": " written",
"begin": 19.240000000000002,
"end": 19.46,
"confidence": 0.86
},
{
"word": " yet.",
"begin": 19.46,
"end": 19.96,
"confidence": 0.91
}
],
"speaker": "not_activated",
"channel": "channel_0"
}
],
"chapterization": "not_activated",
"summarization": "Though we are living in uncertain times, we can find hope for the future by looking at the past. We have the power to shape our present, and the concept of \"less is more\" is an important part of this journey."
}
}
Txt output
{
"prediction": "Split infinity in a time when less is more Where too much is never enough, there is always hope for the future. The future can be read from the past. The past foreshadows the present, and the present hasn't been written yet",
"prediction_raw": {
"metadata": {
"total_speech_duration": 19.919999999999998,
"total_speech_duration_channel_0": 19.919999999999998,
"summarizationTime": 2.164443254470825,
"audioConversionTime": 0.3706510066986084,
"vadTime": 0.010127544403076172,
"inferenceTime": 1.8316893577575684,
"diarizationTime": 0.0000016689300537109375,
"totalTranscriptionTime": 2.2124695777893066,
"nbSilentChannels": 0,
"nbSimilarChannels": 0,
"providedFileMetadata": {
"nb channels": 1,
"sample rate": 44100,
"sample width": 16,
"original file type": "audio"
}
},
"transcription": "transcription": [
{
"time_begin": 0.09,
"time_end": 2.07,
"transcription": "Split infinity",
"language": "en",
"confidence": 0.49,
"words": [
{
"word": " Split",
"begin": 0.09,
"end": 0.75,
"confidence": 0.7
},
{
"word": " infinity.",
"begin": 0.75,
"end": 1.47,
"confidence": 0.4
}
],
"speaker": "not_activated",
"channel": "channel_0"
},
{
"time_begin": 2.13,
"time_end": 5.19,
"transcription": "in a time when less is more",
"language": "en",
"confidence": 0.65,
"words": [
{
"word": " in",
"begin": 2.13,
"end": 2.65,
"confidence": 0.59
},
{
"word": " a",
"begin": 2.65,
"end": 2.87,
"confidence": 0.99
},
{
"word": " time",
"begin": 2.87,
"end": 3.33,
"confidence": 0.82
},
{
"word": " when",
"begin": 3.33,
"end": 3.79,
"confidence": 0.86
},
{
"word": " less",
"begin": 3.79,
"end": 4.07,
"confidence": 0.87
},
{
"word": " is",
"begin": 4.07,
"end": 4.41,
"confidence": 0.91
},
{
"word": " more.",
"begin": 4.41,
"end": 4.6899999999999995,
"confidence": 0.88
}
],
"speaker": "not_activated",
"channel": "channel_0"
},
{
"time_begin": 5.52,
"time_end": 20.4,
"transcription": "Where too much is never enough, there is always hope for the future. The future can be read from the past. The past foreshadows the present, and the present hasn't been written yet",
"language": "en",
"confidence": 0.75,
"words": [
{
"word": " Where",
"begin": 5.52,
"end": 5.76,
"confidence": 0.51
},
{
"word": " too",
"begin": 5.76,
"end": 6.1,
"confidence": 0.8
},
{
"word": " much",
"begin": 6.1,
"end": 6.4799999999999995,
"confidence": 0.81
},
{
"word": " is",
"begin": 6.4799999999999995,
"end": 6.92,
"confidence": 0.9
},
{
"word": " never",
"begin": 6.92,
"end": 7.26,
"confidence": 0.88
},
{
"word": " enough,",
"begin": 7.26,
"end": 7.819999999999999,
"confidence": 0.77
},
{
"word": " there",
"begin": 8.62,
"end": 8.7,
"confidence": 0.81
},
{
"word": " is",
"begin": 8.7,
"end": 8.959999999999999,
"confidence": 0.84
},
{
"word": " always",
"begin": 8.959999999999999,
"end": 9.459999999999999,
"confidence": 0.74
},
{
"word": " hope",
"begin": 9.459999999999999,
"end": 9.8,
"confidence": 0.83
},
{
"word": " for",
"begin": 9.8,
"end": 10.12,
"confidence": 0.9
},
{
"word": " the",
"begin": 10.12,
"end": 10.32,
"confidence": 0.82
},
{
"word": " future.",
"begin": 10.32,
"end": 10.76,
"confidence": 0.93
},
{
"word": " The",
"begin": 11.8,
"end": 11.899999999999999,
"confidence": 0.82
},
{
"word": " future",
"begin": 11.899999999999999,
"end": 12.219999999999999,
"confidence": 0.94
},
{
"word": " can",
"begin": 12.219999999999999,
"end": 12.6,
"confidence": 0.9
},
{
"word": " be",
"begin": 12.6,
"end": 12.86,
"confidence": 0.91
},
{
"word": " read",
"begin": 12.86,
"end": 13.059999999999999,
"confidence": 0.9
},
{
"word": " from",
"begin": 13.059999999999999,
"end": 13.34,
"confidence": 0.82
},
{
"word": " the",
"begin": 13.34,
"end": 13.559999999999999,
"confidence": 0.82
},
{
"word": " past.",
"begin": 13.559999999999999,
"end": 14.139999999999999,
"confidence": 0.81
},
{
"word": " The",
"begin": 14.68,
"end": 14.78,
"confidence": 0.81
},
{
"word": " past",
"begin": 14.78,
"end": 15.34,
"confidence": 0.82
},
{
"word": " foreshadows",
"begin": 15.34,
"end": 16.119999999999997,
"confidence": 0.89
},
{
"word": " the",
"begin": 16.119999999999997,
"end": 16.46,
"confidence": 0.81
},
{
"word": " present,",
"begin": 16.46,
"end": 17.02,
"confidence": 0.8
},
{
"word": " and",
"begin": 17.439999999999998,
"end": 17.72,
"confidence": 0.89
},
{
"word": " the",
"begin": 17.72,
"end": 17.939999999999998,
"confidence": 0.82
},
{
"word": " present",
"begin": 17.939999999999998,
"end": 18.38,
"confidence": 0.8
},
{
"word": " hasn't",
"begin": 18.38,
"end": 18.939999999999998,
"confidence": 0.93
},
{
"word": " been",
"begin": 18.939999999999998,
"end": 19.240000000000002,
"confidence": 0.82
},
{
"word": " written",
"begin": 19.240000000000002,
"end": 19.46,
"confidence": 0.86
},
{
"word": " yet.",
"begin": 19.46,
"end": 19.96,
"confidence": 0.91
}
],
"chapterization": "not_activated",
"summarization": "Looking to the past can provide hope for the future, and the present is yet to be determined."
}
}
SRT output
{
"prediction": "1\n00:00:00,900 --> 00:00:02,600\nSplit infinity\n\n2\n00:00:02,120 --> 00:00:05,190\nin a time when less is more\n\n3\n00:00:05,510 --> 00:00:20,390\nWhere too much is never enough, there is always hope for the future. The future can be read from the past. The past foreshadows the present, and the present hasn't been written yet\n",
"prediction_raw": {
"metadata": {
"total_speech_duration": 19.919999999999998,
"total_speech_duration_channel_0": 19.919999999999998,
"summarizationTime": 2.164443254470825,
"audioConversionTime": 0.3706510066986084,
"vadTime": 0.010127544403076172,
"inferenceTime": 1.8316893577575684,
"diarizationTime": 0.0000016689300537109375,
"totalTranscriptionTime": 2.2124695777893066,
"nbSilentChannels": 0,
"nbSimilarChannels": 0,
"providedFileMetadata": {
"nb channels": 1,
"sample rate": 44100,
"sample width": 16,
"original file type": "audio"
}
},
"transcription": "transcription": [
{
"time_begin": 0.09,
"time_end": 2.07,
"transcription": "Split infinity",
"language": "en",
"confidence": 0.49,
"words": [
{
"word": " Split",
"begin": 0.09,
"end": 0.75,
"confidence": 0.7
},
{
"word": " infinity.",
"begin": 0.75,
"end": 1.47,
"confidence": 0.4
}
],
"speaker": "not_activated",
"channel": "channel_0"
},
{
"time_begin": 2.13,
"time_end": 5.19,
"transcription": "in a time when less is more",
"language": "en",
"confidence": 0.65,
"words": [
{
"word": " in",
"begin": 2.13,
"end": 2.65,
"confidence": 0.59
},
{
"word": " a",
"begin": 2.65,
"end": 2.87,
"confidence": 0.99
},
{
"word": " time",
"begin": 2.87,
"end": 3.33,
"confidence": 0.82
},
{
"word": " when",
"begin": 3.33,
"end": 3.79,
"confidence": 0.86
},
{
"word": " less",
"begin": 3.79,
"end": 4.07,
"confidence": 0.87
},
{
"word": " is",
"begin": 4.07,
"end": 4.41,
"confidence": 0.91
},
{
"word": " more.",
"begin": 4.41,
"end": 4.6899999999999995,
"confidence": 0.88
}
],
"speaker": "not_activated",
"channel": "channel_0"
},
{
"time_begin": 5.52,
"time_end": 20.4,
"transcription": "Where too much is never enough, there is always hope for the future. The future can be read from the past. The past foreshadows the present, and the present hasn't been written yet",
"language": "en",
"confidence": 0.75,
"words": [
{
"word": " Where",
"begin": 5.52,
"end": 5.76,
"confidence": 0.51
},
{
"word": " too",
"begin": 5.76,
"end": 6.1,
"confidence": 0.8
},
{
"word": " much",
"begin": 6.1,
"end": 6.4799999999999995,
"confidence": 0.81
},
{
"word": " is",
"begin": 6.4799999999999995,
"end": 6.92,
"confidence": 0.9
},
{
"word": " never",
"begin": 6.92,
"end": 7.26,
"confidence": 0.88
},
{
"word": " enough,",
"begin": 7.26,
"end": 7.819999999999999,
"confidence": 0.77
},
{
"word": " there",
"begin": 8.62,
"end": 8.7,
"confidence": 0.81
},
{
"word": " is",
"begin": 8.7,
"end": 8.959999999999999,
"confidence": 0.84
},
{
"word": " always",
"begin": 8.959999999999999,
"end": 9.459999999999999,
"confidence": 0.74
},
{
"word": " hope",
"begin": 9.459999999999999,
"end": 9.8,
"confidence": 0.83
},
{
"word": " for",
"begin": 9.8,
"end": 10.12,
"confidence": 0.9
},
{
"word": " the",
"begin": 10.12,
"end": 10.32,
"confidence": 0.82
},
{
"word": " future.",
"begin": 10.32,
"end": 10.76,
"confidence": 0.93
},
{
"word": " The",
"begin": 11.8,
"end": 11.899999999999999,
"confidence": 0.82
},
{
"word": " future",
"begin": 11.899999999999999,
"end": 12.219999999999999,
"confidence": 0.94
},
{
"word": " can",
"begin": 12.219999999999999,
"end": 12.6,
"confidence": 0.9
},
{
"word": " be",
"begin": 12.6,
"end": 12.86,
"confidence": 0.91
},
{
"word": " read",
"begin": 12.86,
"end": 13.059999999999999,
"confidence": 0.9
},
{
"word": " from",
"begin": 13.059999999999999,
"end": 13.34,
"confidence": 0.82
},
{
"word": " the",
"begin": 13.34,
"end": 13.559999999999999,
"confidence": 0.82
},
{
"word": " past.",
"begin": 13.559999999999999,
"end": 14.139999999999999,
"confidence": 0.81
},
{
"word": " The",
"begin": 14.68,
"end": 14.78,
"confidence": 0.81
},
{
"word": " past",
"begin": 14.78,
"end": 15.34,
"confidence": 0.82
},
{
"word": " foreshadows",
"begin": 15.34,
"end": 16.119999999999997,
"confidence": 0.89
},
{
"word": " the",
"begin": 16.119999999999997,
"end": 16.46,
"confidence": 0.81
},
{
"word": " present,",
"begin": 16.46,
"end": 17.02,
"confidence": 0.8
},
{
"word": " and",
"begin": 17.439999999999998,
"end": 17.72,
"confidence": 0.89
},
{
"word": " the",
"begin": 17.72,
"end": 17.939999999999998,
"confidence": 0.82
},
{
"word": " present",
"begin": 17.939999999999998,
"end": 18.38,
"confidence": 0.8
},
{
"word": " hasn't",
"begin": 18.38,
"end": 18.939999999999998,
"confidence": 0.93
},
{
"word": " been",
"begin": 18.939999999999998,
"end": 19.240000000000002,
"confidence": 0.82
},
{
"word": " written",
"begin": 19.240000000000002,
"end": 19.46,
"confidence": 0.86
},
{
"word": " yet.",
"begin": 19.46,
"end": 19.96,
"confidence": 0.91
}
],
"chapterization": "not_activated",
"summarization": "Looking to the past can provide hope for the future, and the present is yet to be determined."
}
}
VTT output
{
"prediction": "WEBVTT\n\n1\n00:00:00.090 --> 00:00:02.069\nSplit infinity\n\n2\n00:00:02.129 --> 00:00:05.190\nin a time when less is more\n\n3\n00:00:05.519 --> 00:00:20.399\nWhere too much is never enough, there is always hope for the future. The future can be read from the past. The past foreshadows the present, and the present hasn't been written yet\n",
"prediction_raw": {
"metadata": {
"total_speech_duration": 19.919999999999998,
"total_speech_duration_channel_0": 19.919999999999998,
"summarizationTime": 2.164443254470825,
"audioConversionTime": 0.3706510066986084,
"vadTime": 0.010127544403076172,
"inferenceTime": 1.8316893577575684,
"diarizationTime": 0.0000016689300537109375,
"totalTranscriptionTime": 2.2124695777893066,
"nbSilentChannels": 0,
"nbSimilarChannels": 0,
"providedFileMetadata": {
"nb channels": 1,
"sample rate": 44100,
"sample width": 16,
"original file type": "audio"
}
},
"transcription": "transcription": [
{
"time_begin": 0.09,
"time_end": 2.07,
"transcription": "Split infinity",
"language": "en",
"confidence": 0.49,
"words": [
{
"word": " Split",
"begin": 0.09,
"end": 0.75,
"confidence": 0.7
},
{
"word": " infinity.",
"begin": 0.75,
"end": 1.47,
"confidence": 0.4
}
],
"speaker": "not_activated",
"channel": "channel_0"
},
{
"time_begin": 2.13,
"time_end": 5.19,
"transcription": "in a time when less is more",
"language": "en",
"confidence": 0.65,
"words": [
{
"word": " in",
"begin": 2.13,
"end": 2.65,
"confidence": 0.59
},
{
"word": " a",
"begin": 2.65,
"end": 2.87,
"confidence": 0.99
},
{
"word": " time",
"begin": 2.87,
"end": 3.33,
"confidence": 0.82
},
{
"word": " when",
"begin": 3.33,
"end": 3.79,
"confidence": 0.86
},
{
"word": " less",
"begin": 3.79,
"end": 4.07,
"confidence": 0.87
},
{
"word": " is",
"begin": 4.07,
"end": 4.41,
"confidence": 0.91
},
{
"word": " more.",
"begin": 4.41,
"end": 4.6899999999999995,
"confidence": 0.88
}
],
"speaker": "not_activated",
"channel": "channel_0"
},
{
"time_begin": 5.52,
"time_end": 20.4,
"transcription": "Where too much is never enough, there is always hope for the future. The future can be read from the past. The past foreshadows the present, and the present hasn't been written yet",
"language": "en",
"confidence": 0.75,
"words": [
{
"word": " Where",
"begin": 5.52,
"end": 5.76,
"confidence": 0.51
},
{
"word": " too",
"begin": 5.76,
"end": 6.1,
"confidence": 0.8
},
{
"word": " much",
"begin": 6.1,
"end": 6.4799999999999995,
"confidence": 0.81
},
{
"word": " is",
"begin": 6.4799999999999995,
"end": 6.92,
"confidence": 0.9
},
{
"word": " never",
"begin": 6.92,
"end": 7.26,
"confidence": 0.88
},
{
"word": " enough,",
"begin": 7.26,
"end": 7.819999999999999,
"confidence": 0.77
},
{
"word": " there",
"begin": 8.62,
"end": 8.7,
"confidence": 0.81
},
{
"word": " is",
"begin": 8.7,
"end": 8.959999999999999,
"confidence": 0.84
},
{
"word": " always",
"begin": 8.959999999999999,
"end": 9.459999999999999,
"confidence": 0.74
},
{
"word": " hope",
"begin": 9.459999999999999,
"end": 9.8,
"confidence": 0.83
},
{
"word": " for",
"begin": 9.8,
"end": 10.12,
"confidence": 0.9
},
{
"word": " the",
"begin": 10.12,
"end": 10.32,
"confidence": 0.82
},
{
"word": " future.",
"begin": 10.32,
"end": 10.76,
"confidence": 0.93
},
{
"word": " The",
"begin": 11.8,
"end": 11.899999999999999,
"confidence": 0.82
},
{
"word": " future",
"begin": 11.899999999999999,
"end": 12.219999999999999,
"confidence": 0.94
},
{
"word": " can",
"begin": 12.219999999999999,
"end": 12.6,
"confidence": 0.9
},
{
"word": " be",
"begin": 12.6,
"end": 12.86,
"confidence": 0.91
},
{
"word": " read",
"begin": 12.86,
"end": 13.059999999999999,
"confidence": 0.9
},
{
"word": " from",
"begin": 13.059999999999999,
"end": 13.34,
"confidence": 0.82
},
{
"word": " the",
"begin": 13.34,
"end": 13.559999999999999,
"confidence": 0.82
},
{
"word": " past.",
"begin": 13.559999999999999,
"end": 14.139999999999999,
"confidence": 0.81
},
{
"word": " The",
"begin": 14.68,
"end": 14.78,
"confidence": 0.81
},
{
"word": " past",
"begin": 14.78,
"end": 15.34,
"confidence": 0.82
},
{
"word": " foreshadows",
"begin": 15.34,
"end": 16.119999999999997,
"confidence": 0.89
},
{
"word": " the",
"begin": 16.119999999999997,
"end": 16.46,
"confidence": 0.81
},
{
"word": " present,",
"begin": 16.46,
"end": 17.02,
"confidence": 0.8
},
{
"word": " and",
"begin": 17.439999999999998,
"end": 17.72,
"confidence": 0.89
},
{
"word": " the",
"begin": 17.72,
"end": 17.939999999999998,
"confidence": 0.82
},
{
"word": " present",
"begin": 17.939999999999998,
"end": 18.38,
"confidence": 0.8
},
{
"word": " hasn't",
"begin": 18.38,
"end": 18.939999999999998,
"confidence": 0.93
},
{
"word": " been",
"begin": 18.939999999999998,
"end": 19.240000000000002,
"confidence": 0.82
},
{
"word": " written",
"begin": 19.240000000000002,
"end": 19.46,
"confidence": 0.86
},
{
"word": " yet.",
"begin": 19.46,
"end": 19.96,
"confidence": 0.91
}
],
"chapterization": "not_activated",
"summarization": "Looking to the past can provide hope for the future, and the present is yet to be determined."
}
}