Comment by jackie293746
5 hours ago
Well since you work at a lab you should know that most capabilities arise in pretraining, not posttraining or mid training, and the latter two mostly function to bring out the hidden intelligence in these models more than anything else.
Source: also work at a lab.
No comments yet
Contribute on Hacker News ↗