Comment by comova

9 months ago

I believe this is to improve performance by shortening the context window for long thinking processes. I don't think this is referring to real-time summarizing for the users' sake.

3 comments

comova

usaar333 9 months ago

When you do a chat are reasoning traces for prior model outputs in the LLM context?

int_19h 9 months ago

No, they are normally stripped out.

j_maffe 9 months ago

> I don't think this is referring to real-time summarizing for the users' sake.

That's exactly what it's referring to.