Initiate a session
Initiate a live transcription job. Use the returned ws url
to connect to the websocket and start sending audio chunks. Use the returned id
and the GET /v2/live/:id endpoint to obtain the status and results.
Authorizations
Your personal Gladia API key
Body
The encoding format of the audio stream. Supported formats:
- PCM: 8, 16, 24, and 32 bits
- A-law: 8 bits
- μ-law: 8 bits
Note: No need to add WAV headers to raw audio as the API supports both formats.
wav/pcm
, wav/alaw
, wav/ulaw
The bit depth of the audio stream
8
, 16
, 24
, 32
The sample rate of the audio stream
8000
, 16000
, 32000
, 44100
, 48000
The number of channels of the audio stream
1 < x < 8
Custom metadata you can attach to this live transcription
The endpointing duration in seconds. Endpointing is the duration of silence which will cause an utterance to be considered as finished
0.01 < x < 10
The maximum duration in seconds without endpointing. If endpointing is not detected after this duration, current utterance will be considered as finished
5 < x < 60
Specify the language configuration
Specify the pre-processing configuration
Specify the realtime processing configuration
Specify the post-processing configuration
Specify the websocket messages configuration
If true, messages will be sent to configured url.
Specify the callback configuration
Response
Id of the job
The websocket url to connect to for sending audio data. The url will contain the temporary token to authenticate the session.
Was this page helpful?