Initiate a session
Initiate a live transcription job. Use the returned ws url
to connect to the websocket and start sending audio chunks. Use the returned id
and the GET /v2/live/:id endpoint to obtain the status and results.
Authorizations
Your personal Gladia API key
Body
The encoding format of the audio stream. Supported formats:
- PCM: 8, 16, 24, and 32 bits
- A-law: 8 bits
- μ-law: 8 bits
Note: No need to add WAV headers to raw audio as the API supports both formats.
wav/pcm
, wav/alaw
, wav/ulaw
The bit depth of the audio stream
8
, 16
, 24
, 32
The sample rate of the audio stream
8000
, 16000
, 32000
, 44100
, 48000
The number of channels of the audio stream
1 <= x <= 8
Custom metadata you can attach to this live transcription
{ "user": "John Doe" }
The model used to process the audio. "accurate" is used by default.
fast
, accurate
The endpointing duration in seconds. Endpointing is the duration of silence which will cause an utterance to be considered as finished
0.01 <= x <= 10
The maximum duration in seconds without endpointing. If endpointing is not detected after this duration, current utterance will be considered as finished
5 <= x <= 60
Specify the language configuration
Specify the pre-processing configuration
Specify the realtime processing configuration
Specify the post-processing configuration
Specify the websocket messages configuration
If true, messages will be sent to configured url.
Specify the callback configuration
Response
Id of the job
"45463597-20b7-4af7-b3b3-f5fb778203ab"
The websocket url to connect to for sending audio data. The url will contain the temporary token to authenticate the session.
"wss://api.gladia.io/v2/live?token=4a39145c-2844-4557-8f34-34883f7be7d9"
Was this page helpful?