Comment by alt227

23 days ago

Surely it must have digested plenty of walkthroughs for any game?

A linear puzzle game like that I would just expect the ai to fly through first time, considering it has probably read 30 years of guides and walkthroughs.

2 comments

alt227

singpolyma3 22 days ago

The real test would be to try it on a new game of the same style and complexity

ben_w 22 days ago

Moravec's paradox likely comes in to play, what's easy is hard and vice versa.
The puzzles would probably be easy. Myst's puzzles are basically IQ tests, and LLMs ace traditional IQ tests: https://trackingai.org/home
On the other hand, navigating the environment, I think the models may fail spectacularly. From what we've seen from Claude Plays Pokemon, it would get in weird loops and try to interact with non-interactive elements of the environment.