Comment by rfoo
13 days ago
I don't. I have no problem not running open-weight models myself because there's an efficiency gap of two orders of magnitude between "pretend-I-can" solution and running them on hundreds of H100s for high thousands of users.
No comments yet
Contribute on Hacker News ↗