← Back to context

Comment by mxwsn

1 day ago

No, there are more training tokens than parameters in LLMs. They are in the classical first descent setting.

0 comments

mxwsn

Reply

No comments yet

Contribute on Hacker News ↗