Get Started
Examples
Concepts
Resources
Integrations
copy markdown
Diarize multiple speakers on long and short audio files with multilingual support.
OpenAI SDK
Vercel AI SDK
LangChain SDK
JSON output
OpenAI SDK
Vercel AI SDK
LangChain SDK
JSON output
To get the best performance with long audio file is to use run task with the <task>speech_to_text</task> in the system prompt, this only activates a part of the model used for audio.
OpenAI SDK
Vercel AI SDK
LangChain SDK
This took 1m10s to transcribe and diarize a 1hr and 35min audio file.
JSON output
The output is truncated for this example.