Skip to main content

Real-Time Speech to Text

Agora's Real-Time Speech to Text (STT) transcribes live voice streams to deliver closed captions and transcription for enhanced accessibility. With advanced features like silent audio removal, it optimizes performance and reduces costs.

Transcribed text can be translated into multiple languages in real-time or used as input for AI models like GPT, seamlessly connecting real-time engagement with AI-powered applications.

Start building with

API reference

Samples