← Back to context Comment by byzantinegene 7 hours ago we're already doing that, it's called distillation and how models like deepseek are trained. 0 comments byzantinegene Reply No comments yet Contribute on Hacker News ↗
No comments yet
Contribute on Hacker News ↗