Comment by Gigachad
1 year ago
They have already streamed the first part of the response before the filtered phrase has even been generated.
1 year ago
They have already streamed the first part of the response before the filtered phrase has even been generated.
Could you stream the raw tokens into a server side filter which then streams censored tokens at near real time?