The Gladia CLI is a command-line interface tool that allows developers to transcribe audio files or URLs using the Gladia API. It supports several features such as noise reduction, speaker diarization, direct translation, text emotion recognition, and summarization.
The Gladia CLI can be installed directly from the GitHub repository or built from source using pipenv.
To install the Gladia CLI directly from the GitHub repository, use the following commands:
wget <https://github.com/gladiaio/gladia-cli/raw/main/dist/linux_x64_gladia> && mv linux_x64_gladia gladia && chmod +x gladia
wget <https://github.com/gladiaio/gladia-cli/raw/main/dist/macos_arm64_gladia> && mv macos_arm64_gladia gladia && chmod +x gladia
To build the Gladia CLI from the source, use the following commands:
pipenv shell pip install -r requirements.txt ./build.sh Usage The Gladia CLI can be used by running the gladia command followed by the necessary options. Options The following are the available options for the Gladia CLI: \--audio-url: The URL of the audio file to be transcribed. --audio-file: The path to the audio file to be transcribed. --language-behaviour: Determines how to handle multi-language audio. --language: The language spoken in the audio file. --transcription-hint: A hint to the transcription model. You can pass names, topics, custom vocabulary, etc. --noise-reduction: Applies noise reduction to the audio. --diarization: Performs speaker diarization. --diarization-max-speakers: Determines the maximum number of speakers to be detected. --direct-translate: Activates direct translation to the specified language. --direct-translate-language: The language to which to translate the transcription. Requires the direct translation option to be activated using --direct-translate. --text-emotion: Activates text emotion recognition. --summarization: Activates summarization. --output-format: The format in which to return the transcription results. Possible values are table, json, text, srt, vtt, plain. --gladia-key: The API key for Gladia. Get it at <https://app.gladia.io/account> --save-gladia-key: Saves the API key to a configuration file. --help: Shows the available options.
To use the Gladia API, developers must obtain an API key from https://app.gladia.io/account.
The API key can be saved to a configuration file using the --save-gladia-key option.
The following is an example of using the Gladia CLI to transcribe an audio file:
gladia --audio-url http://files.gladia.io/example/audio-transcription/split_infinity.wav Transcribing audio file... Transcript time_begin time_end probability language speaker transcription 0.09 2.07 0.49 en not_activated Split infinity 2.13 5.19 0.65 en not_activated in a time when less is more