Comment by zamalek

5 days ago

I used the mathematics example only because the GP did. There are many other examples of non-reasoning, including some papers (as recent as Feb).

2 comments

zamalek

azakai 4 days ago

There are many examples of current limitations, but do you see a reason to think they are fundamental limitations? (I'm not saying they aren't, I'm curious what the evidence is for that.)

zamalek 4 days ago

It's because of how transformers work, especially the fact that the output layer is a bunch of weights which we quite literally do a weighted random choice from. My hunch is that diffusion models would have a higher chance of doing real reasoning - or something like a latent space for reasoning.
Thinking that LLMs are intelligent arises from an incomplete understanding of how they work or, alternatively, having shareholders to keep happy.