Comment by zahlman

13 hours ago

What exactly do you want the pipeline to do that cares about the input being "speech", or indeed that's different from just sending mic -> speaker directly? (I can imagine a few different things, but I want to figure out if your use case sounds like mine, or what suggestions are appropriate for what tasks.)