Migrate to the latest version of Gladia’s API
V1
V2
encoding
, bit_depth
and sample_rate
are still present in V2, but with less options for now.
As wav
is the same encoding
as wav/pcm
, V2 has dropped support for wav
and defaults to wav/pcm
.
amb
, mp3
, flac
, ogg/vorbis
, opus
, sphere
and amr-nb
are no longer supported.
bit_depth
option 64
is no longer supported.
If you’re using an unsupported encoding
or bit_depth
, please contact us with your use case. In the mean time, keep using V1.
model
.
endpointing
is now declared in seconds instead of milliseconds.
maximum_audio_duration
has been renamed to maximum_duration_without_endpointing
.
V1
V2
V1
V2
V1
V2
V1
V2
frames_format
is no longer present.
audio_enhancer
has been moved into the pre_processing
object.
V1
V2
word_timestamps
has been renamed to words_accurate_timestamps
and moved into the realtime_processing
object.
V1
V2
prosody
, reinject_context
and transcription_hint
are not supported for now.
They may return in another form in the future.
V1
V2
V1
V2
type
field is "final"
and/or the transcription
field is not empty. type
field is transcript
and that data.is_final
is true
.
Below are examples of transcript messages in V1 and V2, so you can see the differences.
See the API reference for the full format.
V1
V2
V1
V2