Comment by LTL_FTC
1 month ago
I linked results where the user ran Kimi k2 across his 8-node cluster. Inference results are listed for 1,10,100 concurrent requests.
Edit to add:
Yeah, those stations with the GB300 look more along the lines of what I would want as well but I agree, they’re probably way beyond my reach.
No comments yet
Contribute on Hacker News ↗