Comment by throwawaymaths
3 days ago
do you know if any current locally hostable public transcribers are good at diarization? for some tasks having even crude diarization would improve QOL by a huge factor. i was looking at a whisper diarization python package for a bit but it was a bitch to deploy.
WhisperX! https://github.com/basetenlabs/truss-examples/tree/main/whis...
yeah as i said, i couldn't figure out how to deploy whisper-diarization.
so you need python - a full install, and git. Doesn't matter OS. python venv (virtual environment) ensures that this folder, once it works, is locked to all the versions inside it, including the python version. this works for any software that uses pip to set up, or any python stuff in general.
your prompt should change to say
(whisper-diarization) <your OS prompt>$
now you can type
next time you want to use it, you can just do like
[0] To activate a Python virtual environment created with venv, use the command
on Linux or macOS, or
on Windows. This will change your terminal prompt to indicate that the virtual environment is active.
(the [0] note was 'AI generated' by DDG, but whatever, linux puts it in ./bin/activate and windows puts it in ./Scripts/activate.ps1 (ideally))
Deepgram does it.
sorry i meant locally hostable public. ill edit parent.