← Back to context

Comment by Morizero

10 days ago

You don't happen to know a whisper solution that combines diarization with live audio transcription, do you?

WhipserX's diarization is great imo:

    whisperx input.mp3 --language en --diarize --output_format vtt --model large-v2

Works a treat for Zoom interviews. Diarization is sometimes a bit off, but generally its correct.