Comment by Aurornis

1 day ago

> 2. The improvement would come from merging the weights PLUS on-policy distillation. The confusion is that the uploaded model didn't have the distillation at all.

They merged the base model with another lab’s fine tuned model. The improvements could have come from getting some of the fine tuned weights from the other model.

If they really had a better performing model that they “accidentally” forgot to upload, they could have uploaded the correct file by now.

2 comments

Aurornis

croes 18 hours ago

Seems they did

https://news.ycombinator.com/item?id=48529544

ipieter 18 hours ago

I only see an edit to the readme (13h ago) and removal of the weights, so the repo is now empty.
I am willing to give them the benefit of the doubt, but we've seen this before: a model gets released that is supposedly state-of-the-art, yet seems to be a an other repackaged model without any training. Reflection 70B was the most similar example, all they now need is an api that rewrites "Claude" to "Rio".