Comment by NitpickLawyer
4 hours ago
> can squeeze more performance out of a model with rather humble resources vs a frontier lab.
That's the idea behind distillation. They are finetuning it on traces produced by opus. This is poor man's distillation (and the least efficient) and it still works unreasonably well for what it costs.
No comments yet
Contribute on Hacker News ↗