Comment by slashdave
21 hours ago
I think you are assuming training from scratch, which I doubt is happening here. Fine-tuning and RL, especially based on synthetic feedback (coding skill, in particular) can be ongoing and is where these models obtain truly useful abilities.
No comments yet
Contribute on Hacker News ↗