Comment by orbital-decay 3 days ago It really isn't, you can improve by distilling a weaker model 1 comment orbital-decay Reply anon373839 3 days ago Self-distillation is also a technique.
Self-distillation is also a technique.