Comment by johnfn
2 days ago
That's a fun idea. How do you get the transcript into Claude Code (or whatever you use)? What transcription service do you use?
2 days ago
That's a fun idea. How do you get the transcript into Claude Code (or whatever you use)? What transcription service do you use?
I use the Raycast + Whisper Dictation. I don't think there is anything novel about it, but it integrates nicely into my workflow.
My main gripe is when the recording window loses focus, I haven't found a way to bring it back and continue the recorded session. So occasionally I have to start from scratch, which is particularly annoying if it happens during a long-winded brain dump.
I'm not the person you're replying to, but I use Whispering connected to the whisper-large-v3-turbo model on Groq.
It's incredibly cheap and works reliably for me.
I have got it to paste my voice transcriptions into Chrome (Gemini, Claude, ChatGPT) as well as Cursor.
https://github.com/EpicenterHQ/epicenter
I use Handy with Claude code. Nice to just have a key combo to transcribe into whatever has focus.
https://github.com/cjpais/Handy
Love handy. I use it too when dealing with LLMs. The other day I asked chatgpt to generate interview questions based on job description and then I answered using handy. So cool!
I use Spokenly with local Parakeet 0.6B v3 model + Cerebras gpt-oss-120b for post-processing (cleaning up transcription errors and fixing technical mondegreens, e.g., `no JS` → `Node.js`). Almost imperceptible transcription and processing delay. Trigger transcription with right ⌥ key.
According to Google this is the first time the phrase "technical mondegreens" was ever used. I really like it.
I built my own open-source tool to do exactly this so that I can run something like `claude $(hns)` in my terminal and then I can start speaking, and after I'm done, claude receives the transcript and start working. See this workflow here: https://hns-cli.dev/docs/drive-coding-agents/
your OS might have a built in dictation thing. Google for that and try it before online services.
There are a few apps nowadays for voice transcription. I've used Wispr Flow and Superwhisper, and both seem good. You can map some hotkey (e.g., ctrl + windows) to start recording, then when you press it again to stop, it'll get pasted into whatever text box you have open
Superwhisper offers some AI post-processing of the text (e.g., making nice bullets or grammar), but this doesn't seem necessary and just makes things a bit slower
+1 for Superwhisper. It has an offline model for transcription. And it transcribes with very high accuracy for me and great speed.
I do the same. On Mac I use macwhisper. The transcription does not have to be correct. Lots of times it writes the wrong word when talking about technical stuff but Claude understands which word I mean from context
made this tool to press double control to start and another ctrl to stop which copies to the cliboard
https://github.com/elv1n/para-speak/
So cool man! Had to add couple fixes to be able to use it on mac. Love it!
I use VoiceInk (needed some patches to get it to compile but Claude figured it out) and the Parakeet V3 model. It’s really good!