Comment by vicchenai

13 hours ago

the built-in diarization is the one thing that actually caught my attention here. running whisper + pyannote separately is a pain for long recordings and the speaker continuity breaks at chunk boundaries. if this handles it in a single pass that's a real workflow improvement, regardless of how the raw accuracy benchmarks compare

0 comments