We support almost all types of audio or video files with a tradeoff to be taken into account between the transfer time of specific formats that can generate big files and the time to convert the original format to the target one (WAV pcm 16KHz little-endian).

Gladia API current limitations

Those limits will be gradually lifted to ensure the full stability and performance of the service for everyone.

  • Audio length: The maximum length of audio that can be transcribed in a single request is currently 135 minutes. Attempts to transcribe longer audio files will result in an error. Direct YouTube links are limited to 120 minutes instead of 135 minutes.

  • File size: Audio files must not exceed 500 MB in size. Larger files will not be accepted by the API.

Splitting oversize audio files

For audio files that are near or exceed the limitations on length and size, it is recommended to split them into smaller chunks of ~60 minutes each. This approach not only adheres to the API constraints but also generally yields better transcription results.

Tools for Splitting Audio Files:

  • FFMPEG : FFMPEG is a versatile command-line tool that can be used to manipulate audio and video files. It is a popular choice for splitting long audio files.
  • ffmpeg-python : For Python users, ffmpeg-python is a wrapper around FFMPEG that provides a more Pythonic interface for interacting with FFMPEG.
  • prism-media for Node.js : Node.js users can use prism-media for manipulating media files, including splitting audio files.
  • fluent-ffmpeg for Node.js : Another option for Node.js users is fluent-ffmpeg, which offers a simpler and more fluent API for handling media files.

Following these best practices will help you avoid issues due to limitations and maximize the quality of the transcriptions you obtain from the Audio Transcription API.

Supported audio formats

Source FormatMime TypeAudio/Video
aacaudio/aacAudio
ac3audio/ac3Audio
eac3audio/eac3Audio
flacaudio/flacAudio
m4aaudio/mp4Audio
mp2audio/mpegAudio
mp3audio/mpegAudio
oggapplication/oggAudio
opusaudio/opusAudio
wavaudio/wavAudio

Supported video formats

Source FormatMime TypeAudio/Video
3g2video/3gpp2Video
3gpvideo/3gppVideo
avivideo/x-msvideoVideo
flvvideo/x-flvVideo
m4vvideo/x-m4vVideo
matroskavideo/x-matroskaAudio/Video
movvideo/quicktimeVideo
mp4video/mp4Audio/Video
wmvvideo/x-ms-wmvVideo