Comment by jwr

10 months ago

For those unfamiliar with the "active parameters" terminology, what would be the RAM requirements?

E.g.can I run the smallest one on my Macbook Pro (M4 Max, 64GB) like I can run gemma3?

1 comment

jwr

The RAM requirements for storing the parameters are set by the total, not active, parameters. Llama4 Scout is 109B model, so, at Int4 quantization, it will require ~55GB for the model. With 64GB, you could probably run it, but I would imagine not with a very large context size.