← Back to context

Comment by gs17

18 hours ago

> 1T total / 32B active MoE model

Is this the largest open-weight model?

I believe so.

Grok-1 is 341B, DeepSeek-v3 is 671B, and recent new open weights models are around 70B~300B.