Comment by vicchenai
13 hours ago
the built-in diarization is the one thing that actually caught my attention here. running whisper + pyannote separately is a pain for long recordings and the speaker continuity breaks at chunk boundaries. if this handles it in a single pass that's a real workflow improvement, regardless of how the raw accuracy benchmarks compare
No comments yet
Contribute on Hacker News ↗