← Back to context

Comment by snowhale

1 month ago

[dead]

Exactly, same pattern across almost every failure, but sonar models, which just go wild

> not really a reasoning failure

And that's precisely why the term "reasoning" was a problematic choice.

Most people, when they use the word "reason" mean something akin to logical deduction and they would call it a reasoning failure, being told, as they are, that "llms reason" rather than the more accurate picture you just painted of what actually happens (behavioral basins emerging from training dist.)