← Back to context Comment by am17an 3 days ago Use llama.cpp? I get 250 toks/sec on gpt-oss using a 4090, not sure about the mac speeds 0 comments am17an Reply No comments yet Contribute on Hacker News ↗
No comments yet
Contribute on Hacker News ↗