Comment by losvedir
11 hours ago
Er, then what is the "already trained" model? I thought pre-training was the gradient descent through the internet part of building foundational models.
11 hours ago
Er, then what is the "already trained" model? I thought pre-training was the gradient descent through the internet part of building foundational models.
No comments yet
Contribute on Hacker News ↗