Comment by icelancer

15 days ago

Their compaction endpoint is far and away the best in the industry. Claude's has to be dead last.

13 comments

icelancer

Help me understand, how is a compaction endpoint not just a Prompt + json_dump of the message history? I would understand if the prompt was the secret sauce, but you make it sound like there is more to a compaction system than just a clever prompt?

FuckButtons 15 days ago
They could be operating in latent space entirely maybe? It seems plausible to me that you can just operate on the embedding of the conversation and treat it as an optimization / compression problem.
- e1g 14 days ago
  
  Yes, Codex compaction is in the latent space (as confirmed in the article):
  > the Responses API has evolved to support a special /responses/compact endpoint [...] it returns an opaque encrypted_content item that preserves the model’s latent understanding of the original conversation
  
  7 replies →
Art9681 14 days ago

Their models are specifically trained for their tools. For example the `apply_patch` tool. You would think it's just another file editing tool, but its unique diff format is trained into their models. It also works better than the generic file editing tools implemented in other clients. I can also confirm their compaction is best in class. I've imlemented my own client using their API and gpt-5.2 can work for hours and process millions of input tokens very effectively.
EnPissant 15 days ago

Maybe it's a model fine tuned for compaction?

kordlessagain 15 days ago

Yes, agree completely.