Comment by someone3876

8 months ago

Have you actually used Gemini? I use it a lot for translation, and its context window is more like 150k tokens, rather than the 2M context window they say it has.

2 comments

someone3876

gwern 8 months ago

My apologies to all the D&D sessions which take more than 150k tokens, then.

Der_Einzige 8 months ago

Be that as it may, long context window models which are good are not a mirage. By say late 2027, when the LLM providers figure out that they're using the wrong samplers, they will figure out how to get you 2 million output tokens per LLM call which stay coherent.