Pre-recorded Live As Speech-to-text models are trained on general vocabulary, under-represented words such as brand names, proper nouns, or domain-specific terms are often transcribed incorrectly. Custom Spelling is a post-processing operation that applies literal matching between the correct word and the pronunciations entries. When there is a literal match, the transcribed text is replaced with your term.Documentation Index
Fetch the complete documentation index at: https://docs.gladia.io/llms.txt
Use this file to discover all available pages before exploring further.
If the word comes out garbled or replaced by something phonetically similar (e.g.
“le vin” instead of “Levain”), use Custom
vocabulary instead. Custom
vocabulary matches on phonemes, not literal text.
How it works
Gladia runs custom spelling on the transcript text after transcription:- Gladia scans the output for strings listed in your dictionary values.
- When a variant is found, it is replaced with the corresponding key.
- Each entry supplies:
- Key — the spelling to write (case-sensitive).
- Values — variant strings to find (case-insensitive; can be multiple words).
Example: name “Gorish”
If the model outputs “gaurish” or “ghorish”, Gladia replaces them with “Gorish” when they appear in your dictionary:When to use custom vocabulary vs. custom spelling
Use Custom spelling when the model outputs a recognizable but wrong form. It applies literal string matching on variants you list (e.g. “data-science” → “Data Science”). List every close variant the model might output. Use Custom vocabulary when the model outputs garbled or sound-alike text. It applies phoneme-based matching on entries you define (e.g. “le vin” / “levine” → “Levain”). Add pronunciations for each spelling the model might produce.| Custom spelling | Custom vocabulary | |
|---|---|---|
| Matches on | Exact text in the transcript | How words sound |
| Best for | Wrong spelling, punctuation, formatting | Phonetically similar mis-transcriptions |
| You provide | All the words that the model outputs wrongly | value, pronunciations, intensity |
Example configuration
Parameter reference
Tuning tips
- Collect variants from real transcripts — run without custom spelling first, then add keys and values from what the model actually outputs.
- Match key capitalization to how the word should appear in the final transcript.
- List phonetically different strings separately — custom spelling will not group them the way custom vocabulary does.
- Move garbled or sound-alike output to custom vocabulary when listing every variant becomes impractical.
Recommended workflow
- Transcribe without custom spelling and note misspelled terms.
- Route each term: recognizable but wrong spelling → custom spelling; garbled or phonetically wrong → custom vocabulary.
- Build the dictionary — correct form as the key, every variant you have seen as values.
- Transcribe again — confirm replacements and check that nothing else was changed unexpectedly.
- Refine: add new variants as they appear in production audio.