Comment by jwr
14 days ago
For those unfamiliar with the "active parameters" terminology, what would be the RAM requirements?
E.g.can I run the smallest one on my Macbook Pro (M4 Max, 64GB) like I can run gemma3?
14 days ago
For those unfamiliar with the "active parameters" terminology, what would be the RAM requirements?
E.g.can I run the smallest one on my Macbook Pro (M4 Max, 64GB) like I can run gemma3?
The RAM requirements for storing the parameters are set by the total, not active, parameters. Llama4 Scout is 109B model, so, at Int4 quantization, it will require ~55GB for the model. With 64GB, you could probably run it, but I would imagine not with a very large context size.