Comment by Sean-Der
4 days ago
I think you would need the ESP32 to connect to another host. Doing Speech-to-Text, LLM, Text-to-speech is pretty intensive. Even if you connect to a Raspberry Pi.
But totally possible! It's a great idea and would love too help you build it :)
Wire some Open Source together and just start with a small collection of ogg files.
There was one a startup called Snips https://snips.ai/ which made an open-source voice recognition engine running on an RPi.