← Back to context

Comment by Folcon

11 hours ago

   "One thing I found fascinating about watching Claude play is it wouldn't play around and experiment the way I'd expect a human to? It would stand still still trying to work out what to do next, move one square up, consider a long time, move one square down, and repeat. When I'd expect a human to immediately get bored and go as far as they could in all directions to see what was there and try interacting with everything. Maybe some cognitive analogue of boredom is useful for avoiding loops?"
    - FiftyTwo[0]

I'm wondering if this is function of our training methods? They're sufficiently penalised against making "wrong moves", that they don't experiment?

-[0]: https://www.lesswrong.com/posts/u6Lacc7wx4yYkBQ3r/insights-i...