Comment by fennecfoxy
9 days ago
Depends on quantization etc. But there are good calculators that will calculate for your KV cache etc as well: https://apxml.com/tools/vram-calculator.
9 days ago
Depends on quantization etc. But there are good calculators that will calculate for your KV cache etc as well: https://apxml.com/tools/vram-calculator.
No comments yet
Contribute on Hacker News ↗