Comment by minimaxir
13 hours ago
Due to the increasing difficulty of scaling up training, it appears the gains are instead being achieved through better model training which appears to be working well for everyone.
13 hours ago
Due to the increasing difficulty of scaling up training, it appears the gains are instead being achieved through better model training which appears to be working well for everyone.
No comments yet
Contribute on Hacker News ↗