Comment by mxwsn
1 day ago
No, there are more training tokens than parameters in LLMs. They are in the classical first descent setting.
1 day ago
No, there are more training tokens than parameters in LLMs. They are in the classical first descent setting.
No comments yet
Contribute on Hacker News ↗