Comment by astrange

2 years ago

Claude 3.5 does have some "thinking" ability - I've seen it pause and even say it was thinking before. Presumably this is just some output it decides not to show you.

8 comments

astrange

cchance 2 years ago

THIS!!!!!!! People act like Claude and 4o are base models with no funny business behind the scenes, we don't know just how much additional prompt steps are going on for each queue, all we know is what the API or Chat interface dump out, what is happening behind that is anyones guess.. The thinking step and refinement steps likely do exist on all the major commercial models. It's such a big gain for a minimal expenditure of backend tokens, WTF wouldn't they be doing it to improve the outputs?

astrange 2 years ago
Well they can't do a /lot/ of hidden stuff because they have APIs, so you can see the raw output and compare it to the web interface.
But they can do a little.
- baq 2 years ago
  
  As if they couldn’t postprocess the api output before they send it to the client…
  
  1 reply →

Tiberium 2 years ago

That's only in the web version, it's just that they prompt it to do some CoT in the antThinking XML tag, and hide the output from inside that tag in the UI.

jph00 2 years ago

The API does it too for some of their models in some situations.
KTibow 2 years ago
Interesting, is there any documentation on this or a way to view the thinking?