Authorizations
Your personal Gladia API key
Query Parameters
The region used to process the audio.
us-west
, eu-west
Body
The encoding format of the audio stream. Supported formats:
- PCM: 8, 16, 24, and 32 bits
- A-law: 8 bits
- μ-law: 8 bits
Note: No need to add WAV headers to raw audio as the API supports both formats.
wav/pcm
, wav/alaw
, wav/ulaw
The bit depth of the audio stream
8
, 16
, 24
, 32
The sample rate of the audio stream
8000
, 16000
, 32000
, 44100
, 48000
The number of channels of the audio stream
1 <= x <= 8
Custom metadata you can attach to this live transcription
{ "user": "John Doe" }
The model used to process the audio. "solaria-1" is used by default.
solaria-1
The endpointing duration in seconds. Endpointing is the duration of silence which will cause an utterance to be considered as finished
0.01 <= x <= 10
The maximum duration in seconds without endpointing. If endpointing is not detected after this duration, current utterance will be considered as finished
5 <= x <= 60
Specify the language configuration
Specify the pre-processing configuration
Specify the realtime processing configuration
Specify the post-processing configuration
Specify the websocket messages configuration
If true, messages will be sent to configured url.
Specify the callback configuration
Response
The live job has been initiated
Id of the job
"45463597-20b7-4af7-b3b3-f5fb778203ab"
The websocket url to connect to for sending audio data. The url will contain the temporary token to authenticate the session.
"wss://api.gladia.io/v2/live?token=4a39145c-2844-4557-8f34-34883f7be7d9"