Comment by varispeed

3 days ago

There is nothing smart about current LLMs. They just regurgitate text compressed in their memory based on probability. None of the LLMs currently have actual understanding of what you ask them to do and what they respond with.

22 comments

varispeed

adamtaylor_13 3 days ago

If LLMs just regurgitate compressed text, they'd fail on any novel problem not in their training data. Yet, they routinely solve them, which means whatever's happening between input and output is more than retrieval, and calling it "not understanding" requires you to define understanding in a way that conveniently excludes everything except biological brains.

fennecbutt 1 day ago

I somewhat agree with you but I also realise that there are very few "novel" problems in the world. I think it's really just more complex problem spaces is all.
Same relative logic, just more of it/more steps or trials.
sfn42 3 days ago
Yes there are some fascinating emergent properties at play, but when they fail it's blatantly obvious that there's no actual intelligence nor understanding. They are very cool and very useful tools, I use them on a daily basis now and the way I can just paste a vague screenshot with some vague text and they get it and give a useful response blows my mind every time. But it's very clear that it's all just smoke and mirrors, they're not intelligent and you can't trust them with anything.
- pennomi 3 days ago
  
  When humans fail a task, it’s obvious there is no actual intelligence nor understanding.
  Intelligence is not as cool as you think it is.
  
  2 replies →
- atomicthumbs 2 days ago
  
  you'd think with how often Opus builds two separate code paths without feature parity when you try to vibe code something complex, people wouldn't regard this whole thing so highly
otabdeveloper4 2 days ago
> they'd fail on any novel problem not in their training data
Yes, and that's exactly what they do.
No, none of the problems you gave to the LLM while toying around with them are in any way novel.
- adamtaylor_13 2 days ago
  
  None of my codebases are in their training data, yet they routinely contribute to them in meaningful ways. They write code that I'm happy with that improves the codebases I work in.
  Do you not consider that novel problem solving?
  
  1 reply →
varispeed 3 days ago
They don't solve novel problems. But if you have such strong belief, please give us examples.
- ainch 2 days ago
  
  Depends how precisely you define novel - I don't think LLMs are yet capable of posing and solving interesting problems, but they have been used to address known problems, and in doing so have contributed novel work. Examples include Erdos Problem #728[0] (Terence Tao said it was solved "more or less autonomously" by an LLM), IMO problems (Deepmind, OpenAI and Huang 2025), GPT-5.2 Pro contributing a conjecture in particle physics[1], systems like AlphaEvolve leveraging LLMs + evolutionary algorithms to generate new, faster algorithms for certain problems[2].
  [0] https://mathstodon.xyz/@tao/115855840223258103
  [1] https://huggingface.co/blog/dlouapre/gpt-single-minus-gluons
  [2] https://deepmind.google/blog/alphaevolve-a-gemini-powered-co...

bsenftner 3 days ago

We know that, but that does not make them unuseful. The opposite in fact, they are extremely useful in the hands of non-idiots.We just happen to have a oversupply of idiots at the moment, which AI is here to eradicate. /Sort of satire.

visarga 3 days ago

So you are saying they are like copy, LLMs will copy some training data back to you? Why do we spend so much money training and running them if they "just regurgitate text compressed in their memory based on probability"? billions of dollars to build a lossy grep.

I think you are confused about LLMs - they take in context, and that context makes them generate new things, for existing things we have cp. By your logic pianos can't be creative instruments because they just produce the same 88 notes.

flamedoge 2 days ago

I have a gut feeling, huge portion of deficiencies we note with AI is just reflection of the training data. For instance, wiki/reddit/etc internet is just a soup of human description of the world model, not the actual world model itself. There are gaps or holes in the knowledge because codified summary of world is what is remarkable to us humans, not a 100% faithful, comprehensive description of the world. What is obvious to us humans with lived real world experience often does not make it into the training data. A simple, demonstrable example is whether one should walk or drive to car wash.

small_model 3 days ago

Thats not how they work, pro-tip maybe don't comment until you have a good understanding?

fyltr 3 days ago
Would you mind rectifying the wrong parts then?
- retsibsi 3 days ago
  
  Phrases like "actual understanding", "true intelligence" etc. are not conducive to productive discussion unless you take the trouble to define what you mean by them (which ~nobody ever does). They're highly ambiguous and it's never clear what specific claims they do or don't imply when used by any given person.
  But I think this specific claim is clearly wrong, if taken at face value:
  > They just regurgitate text compressed in their memory
  They're clearly capable of producing novel utterances, so they can't just be doing that. (Unless we're dealing with a very loose definition of "regurgitate", in which case it's probably best to use a different word if we want to understand each other.)
- mhl47 3 days ago
  
  The fact that the outputs are probabilities is not important. What is important is how that output is computed.
  You could imagine that it is possible to learn certain algorithms/ heuristics that "intelligence" is comprised of. No matter what you output. Training for optimal compression of tasks /taking actions -> could lead to intelligence being the best solution.
  This is far from a formal argument but so is the stubborn reiteration off "it's just probabilities" or "it's just compression". Because this "just" thing is getting more an more capable of solving tasks that are surely not in the training data exactly like this.
100721 3 days ago
Huh? Their words are an accurate, if simplified, description of how they work.
- fragmede 2 days ago
  
  The simplification is where it loses granularity. I could describe every human's life as they were born and then they died. That's 100% accurate, but there's just a little something lost by simplifying that much.

beyondCritics 3 days ago

Just HI slop. Ask any decent model, it can explain what's wrong this this description.