What is Pipecat?

Pipecat is an open‑source framework for building realtime, voice‑first AI agents. It orchestrates audio input/output, speech‑to‑text (STT), large language models (LLMs), and text‑to‑speech (TTS) into one low‑latency pipeline you can run on the server and connect to from the browser or native apps. With Pipecat’s Gladia STT service, you can plug Gladia’s accurate, low‑latency transcription directly into your agent.
Looking for the full API and options? See the official Pipecat Gladia STT docs: docs.pipecat.ai/server/services/stt/gladia.

What you can build

  • Voice assistants that understand users in real time
  • Live captions and meeting companions
  • Contact‑center bots and IVRs
  • In‑app help widgets with speech interfaces

Quickstart using the sample repo

Use our end‑to‑end sample that wires Pipecat with Gladia STT : gladia-samples/integrations-examples/pipecat-bot

Next steps