Comment by vessenes
3 days ago
That 128k is a reference to the context window — how many tokens you put in to the start. Presumably Grok 4 with 128k context window is running on less hardware (it needs much less RAM than 256k) and they route it accordingly internally.
No comments yet
Contribute on Hacker News ↗