← Back to context

Comment by throwawaymaths

3 days ago

do you know if any current locally hostable public transcribers are good at diarization? for some tasks having even crude diarization would improve QOL by a huge factor. i was looking at a whisper diarization python package for a bit but it was a bitch to deploy.

WhisperX! https://github.com/basetenlabs/truss-examples/tree/main/whis...

  • yeah as i said, i couldn't figure out how to deploy whisper-diarization.

    • so you need python - a full install, and git. Doesn't matter OS. python venv (virtual environment) ensures that this folder, once it works, is locked to all the versions inside it, including the python version. this works for any software that uses pip to set up, or any python stuff in general.

        git clone <whisper-diarization.git URL>
        cd whisper-diarization
        python -m venv .
        cd scripts
        # and then depending on your OS it's activate.sh, activate.ps1, activate.bat, etc. so on linux [0] 
      

      your prompt should change to say

      (whisper-diarization) <your OS prompt>$

      now you can type

        cd ..
        pip install -c constraints.txt -r requirements.txt
        python ./diarize.py --no-stem --suppress_numerals --whisper-model large-v3-turbo --device cuda -a <FILE>
      

      next time you want to use it, you can just do like

        cd ~/whisper-diarization
        scripts/activate.sh (or whatever) [0]
        python ./diarize.py [...]
      
      

      [0] To activate a Python virtual environment created with venv, use the command

        source venv/bin/activate 
      

      on Linux or macOS, or

        venv\Scripts\activate 
      

      on Windows. This will change your terminal prompt to indicate that the virtual environment is active.

      (the [0] note was 'AI generated' by DDG, but whatever, linux puts it in ./bin/activate and windows puts it in ./Scripts/activate.ps1 (ideally))