← Back to context

Comment by magicalhippo

1 year ago

This is the tricky bit yes. I bought some Pi HATs[1] earlier, and they seem to work fine when testing, but haven't tried to use them properly yet. Alternatively I have some condenser microphones and an audio card HAT for the Pi.

So I hope to be able to use that with Pi Zero W. Played around with using PulseAudio so I could add compression plugins and such, which seems to work.

Haven't actually attempted wake-work yet, but first port of call there will be to try the ones Rhasspy uses[2].

Since I want to integrate with HA I might end up expanding on Rhasspy, or I might just use parts of Rhasspy, ie once I determine it's a HA command and not a question about dinosaurs say, inject the cleaned transcription into the Rhasspy pipeline.

[1]: https://www.seeedstudio.com/ReSpeaker-2-Mics-Pi-HAT.html

[2]: https://rhasspy.readthedocs.io/en/latest/wake-word/

I was just looking what it would take to have a local LLM assistant and ended up on ReSpeaker too.

I am just wrapping up another Pi Zero W audio recording project (https://github.com/pseudosavant/ps-audio-recorder). I used the first-gen Zero W since I had one already lying around. I'd strongly recommend using the newer Pi Zero 2 W though.

The Zero W CPU is just so slow. It can barely (e.g. 1.05x) do real-time MP3 encoding, and the single core makes it prone to being interrupted by background tasks. It couldn't even do SMB/Windows file sharing faster than 0.5MB/sec without disabling encryption in Samba. Encryption kills this CPU.

I have a couple of Zero 2s coming in the mail.