Comment by nayroclade
5 days ago
Is the approach fundamentally limited to smaller models? Or could you theoretically train a model as powerful as the largest models, but much faster?
5 days ago
Is the approach fundamentally limited to smaller models? Or could you theoretically train a model as powerful as the largest models, but much faster?
No comments yet
Contribute on Hacker News ↗