Comment by metalliqaz
2 hours ago
I run models in the ~120B class on my old server (96GB DDR4) and it manages about 3-3.5 tok/sec. It is indeed painfully slow to watch, but I find if I walk away or bury the window and do something else, it always seems to be done when I check back
No comments yet
Contribute on Hacker News ↗