Comment by Gigachad
10 months ago
They have already streamed the first part of the response before the filtered phrase has even been generated.
10 months ago
They have already streamed the first part of the response before the filtered phrase has even been generated.
Could you stream the raw tokens into a server side filter which then streams censored tokens at near real time?