Comment by nh43215rgb
10 days ago
oh ok thank you. so something like MoE? That might not be so correct but at least the models need different architecture(MatFormer) to be classified under gemma3n.
10 days ago
oh ok thank you. so something like MoE? That might not be so correct but at least the models need different architecture(MatFormer) to be classified under gemma3n.
Its not an MOE, its what's referred to as a dense architecture, same as the Gemma3 models (But not 3n as noted)