Comment by plufz
3 hours ago
What would happen if they mass distilled one of the really large local models like GLM 700b or deepseek 1.6t?
3 hours ago
What would happen if they mass distilled one of the really large local models like GLM 700b or deepseek 1.6t?
At that point you might as well just host them yourself.
That's not how the innovation works
Innovation is teaching your model on stolen data from literally everywhere but other models.