Comment by Tepix

9 months ago

Why would you give up your privacy by sending what interests you to OpenAI when whisper doesn't need that much computer in the first place?

With faster-whisper (int8, batch=8) you can transcripe 13 minutes of audio in 51 seconds on CPU.

I am a blue collar electrician. Not a coder (but definitely geeky).

Whisper works quite well on Apple Silicon with simple drag/drop install (i.e. no terminal commands). Program is free; you can get an M4 mini for ~$550; don't see how an online platform can even compete with this, except for one-off customers (i.e. not great repeat customers).

We used it to transcribe ddaayyss of audio microcassettes which my mother had made during her lifetime. Whisper.app even transcribed a few hours that are difficult to comprehend as a human listener. It is VERY fast.

I've used the text to search for timestamps worth listening to, skipping most dead-space (e.g. she made most while driving, in a stream of not-always-focused consciousness).

I came here to ask the same question. This is a well-solved problem, red queen racing it seems utterly pointless, a symptom of reflexive adversarialism.

  • > symptom of reflexive adversarialism

    Is there a definition for this expression? I don't catch you.

    > ... using corporate technology for the solved problem is a symptom of self-directed skepticism by the user against the corporate institutions ...

    Eh?

    • Reflexively adopting an adversarial approach (trying to outwit Open AI by using sped-up audio) instead of being content with the perfectly reliable, free, and quite efficient solution that already exists.

      >> ... using corporate technology for the solved problem is a symptom of self-directed skepticism by the user against the corporate institutions ...

      > Eh?

      I don't know who wrote that or why you pasted in response to me.