Comment by marcosdumay
9 hours ago
> Most research converges to the idea that RL on synthetic data makes models worse, not better.
You are missing a mountain of nuance by generalizing the existence of a hole there.
9 hours ago
> Most research converges to the idea that RL on synthetic data makes models worse, not better.
You are missing a mountain of nuance by generalizing the existence of a hole there.
No comments yet
Contribute on Hacker News ↗