Comment by adastra22
1 year ago
They can still run a lot more users on the same number of GPUs (and they don't have a lot) using distilled models.
1 year ago
They can still run a lot more users on the same number of GPUs (and they don't have a lot) using distilled models.
No comments yet
Contribute on Hacker News ↗