Comment by christina97
18 hours ago
Start with a quant, you can run the Qwen 27B model at 4-bit on one 3090, presumably 6/8-bit on 2x3090.
18 hours ago
Start with a quant, you can run the Qwen 27B model at 4-bit on one 3090, presumably 6/8-bit on 2x3090.
No comments yet
Contribute on Hacker News ↗