Comment by rukuu001
1 year ago
Working on this for an internal tool - detecting no speech has been a PITA so far. Interested to see how you go with this.
1 year ago
Working on this for an internal tool - detecting no speech has been a PITA so far. Interested to see how you go with this.
Use the voice activity detector we wrote for Home Assistant. It works very well: https://github.com/rhasspy/pymicro-vad
What if I'm watching TV and use the AI to control it ? It should only react to my voice (a problem I had that forced me to use a wake word).
currently we are using silero VAD to detect speech: https://github.com/livekit/agents/blob/main/livekit-plugins/...
it works well for voice activity; though it doesn't always detect end-of-turn correctly (humans often pause mid-sentence to think). we are working on improving this behavior.