Interfaze

logo

Beta

pricing

docs

blog

sign in

Get Started

Introduction

Examples

Vision

Concepts

Resources

Integrations

STT Speaker Diarization

copy markdown

Diarize multiple speakers on long and short audio files with multilingual support.

  • Over 100+ languages with mixed language support. View all supported languages
  • Speaker diarization for up to 50 speakers
  • Speaker based intent, sentiment, and other audio analysis

Basic speaker diarization


OpenAI SDK

Vercel AI SDK

LangChain SDK

...

JSON output

...

Sentiment analysis by speaker


OpenAI SDK

Vercel AI SDK

LangChain SDK

...

JSON output

...

Blazing fast diarization on long audio files

To get the best performance with long audio file is to use run task with the <task>speech_to_text</task> in the system prompt, this only activates a part of the model used for audio.


OpenAI SDK

Vercel AI SDK

LangChain SDK

...

This took 1m10s to transcribe and diarize a 1hr and 35min audio file.

JSON output

...

The output is truncated for this example.