← Back to context Comment by ranger_danger 5 days ago with regular llama.cpp on a 3070ti I get 60tok/s TG with the 9B model, it's quite impressive. 0 comments ranger_danger Reply No comments yet Contribute on Hacker News ↗
No comments yet
Contribute on Hacker News ↗