Comment by adastra22
7 months ago
They can still run a lot more users on the same number of GPUs (and they don't have a lot) using distilled models.
7 months ago
They can still run a lot more users on the same number of GPUs (and they don't have a lot) using distilled models.
No comments yet
Contribute on Hacker News ↗