Comment by janalsncm
3 days ago
Their R1 paper was really well-done. But I think it leaves out a few details necessary for stable training.
3 days ago
Their R1 paper was really well-done. But I think it leaves out a few details necessary for stable training.
No comments yet
Contribute on Hacker News ↗