Comment by TomGarden
8 days ago
The biggest difference is that mine is audio-first - it reads everything out over Android tts by default, and runs a computer-side parakeet + Silero VAD server for transcription (My eyes struggle with small screens, though I use it text only occasionally). It's like a voice assistant but with Claude Code. I also made a custom GUI the with shortcuts and stuff, making saying "end conversation" actually end the conversation etc.
Maybe something similar can be done with tmux still, I'm definitely going to explore it
Ah so you use it because the STT you can run on your computer are a lot better than what you can run on your phone?
I use on-device STT with Claude Code's built-in remote control feature to do what you do without needing claude -p, but I guess I don't use it for large enough quantities of text where on-device STT quality becomes a big issue.
The big thing for me is the TTS, custom UI and persistent background mode! ie it switches turns automatically etc, no need to touch screen or keep screen on.
The STT on Gboard is very solid, so if that covers your use case you're good!