Comment by LamaOfRuin
7 days ago
Only if you redefine "reasoning". This is something that the generative AI industry has succeeded in convincing many people of, but that doesn't mean everyone has to accede to that change.
It's true that something interesting is happening. GP did not dispute that. That doesn't make it reasoning, and many people still believe that words should have meaning in order to discuss things intelligently. Language is ultimately a living thing and will inevitably change. This usually involves people fighting the change and no one know ahead of time which side will win.
I don't think we need to redefine reasoning. Here's the definition of "reason" (the verb): "think, understand, and form judgments by a process of logic"
If Claude 4 provides a detailed, logical breakdown in its "reasoning" (yeah, that usage is overloaded), then we could say that there was logical inference involved. "But wait!", I already hear someone saying, "That token output is just the result of yet another stochastic process, and isn't directing the AI in a deterministic, logical way, and thus it is not actually using logic; it's just making something that looks convincingly like logic, but is actually a hallucination of some stochastic process". And I think this is a good point, but I find it difficult to convince myself that what humans are doing is so different that we cannot use the word "reasoning".
As a sidenote, I am _very_ tired of the semantic quagmire that is the current AI industry, and I would really appreciate a rigorous guide to all these definitions.
> Only if you redefine "reasoning". This is something that the generative AI industry has succeeded in convincing many people of, but that doesn't mean everyone has to accede to that change.
I agree. However, they can clearly do a reasonable facsimile of many things that we previously believed required reasoning to do acceptably.
Right -- we know that LLMs cannot think, feel, or understand.
Therefore whenever they produce output that looks like the result of those things, we must either be deceived by a reasonable facsimile, or we simply misapprehended their necessity in the first place.
But, do we understand the human brain as well as we understand LLMs?
Obviously there's something different, but is it just a matter of degrees? LLMs have greater memory than humans, and lesser ability to correlate it. Correlation is powerful magic. That's pattern matching though, and I don't see a fundamental reason why LLMs won't get better at it. Maybe never as good as (smart) humans are, but with their superior memory, maybe that will often be adequate.
> they produce output that looks like the result of those things
Is a cardboard cutout human to some degree? Is a recording a voice? What about a voice recording in a phone menu?
> LLMs have greater memory than humans,
So does a bank of hard drives by that metric.
7 replies →
It would be useful to supply a definition if your point is that others' definition is wrong. Are you saying they don't deduct inferences from premises? Is it "deduct" that you take issue with?
They do not perform voluntary exploration of the consequences of applying logical rules for deduction; at best they pattern-match. Their model of conceptual meaning (which last I checked still struggles with negation, meta-reference and even simply identifying irrelevant noise) is not grounded in actual observational experience, but only in correlations between text tokens.
I think it should be abundantly clear that what ChatGPT does when you ask it to play chess is fundamentally different from what Stockfish does. It isn't just weak and doesn't just make embarrassing errors in generating legal moves (like a blindfolded human might); it doesn't actually "read" and it generates post-hoc rationalization for its moves (which may not be at all logically sound) rather than choosing them with purpose.
There are "reasoning models" that improve on this somewhat, but cf. https://news.ycombinator.com/item?id=44473615 .
Okay, sure. My intuition is that LLMs reason at about a three-year-old level which appears more impressive because of their massive memories. By your definition and criticism, I take it that you wouldn't describe a three-year-old as capable of reasoning, so we're probably on the same page.