Comment by anandnair
5 months ago
We don't even need that example. The example is in front of us. Take a smaller parameter model and ask it to do the same complex thing that a larger parameter model did. It will struggle.
Btw, I'm not saying it's just the number of parameters that matters.
No comments yet
Contribute on Hacker News ↗