Comment by _fat_santa
2 days ago
IMHO I've never found the entire reasoning chain that particularly useful for my work. For me having a summary is honestly better from a context management perspective. I understand why they would encrypt it though, because those reasoning chains are VERY useful if you're distilling the model.
The summary doesn't go into the context, it's for human consumption. The CoT itself goes into the context.
From my experiments with Opus and Sonnet (at least the models where you can still see COT), only the last two COT go into context.
Whereas on ChatGPT, _all_ reasoning traces and all branches (including the unselected ones) go into context.