Comment by adastra22
6 months ago
They can still run a lot more users on the same number of GPUs (and they don't have a lot) using distilled models.
6 months ago
They can still run a lot more users on the same number of GPUs (and they don't have a lot) using distilled models.
No comments yet
Contribute on Hacker News ↗