Comment by HarHarVeryFunny

1 year ago

Some limit's are pretty obvious, even if easy to fix.

For example, a pure LLM is just a single pass through a stack of transformer layers, so there is no variable depth/duration (incl. iteration/looping) of thought and no corresponding or longer duration working memory other than the embeddings as they pass thru. This is going to severely limit their ability to plan and reason since you only get a fixed N layers of reasoning regardless of what they are asked.

Lack of working memory (really needs to be context duration, or longer, not depth duration) has many predictable effects.

No doubt we will see pure-transformer architectures extended to add more capabilities, so I guess the real question is how far these extensions (+scaling) will get us. I think one thing we can be sure of though is that it won't get us to AGI (defining AGI = human-level problem solving capability) unless we add ALL of the missing pieces that the brain has, not just a couple of the easy ones.

0 comments

HarHarVeryFunny

No comments yet

Contribute on Hacker News ↗