← Back to context

Comment by rybosworld

16 hours ago

So an elaborate version of Asimov's Laws of Robotics?

A bit worrying that model safety is approached this way.

One has to wonder, what if a pedophile had an access to nuclear launch codes, and our only hope would be a Claude AI creating some CSAM to distract him from blowing up the world.

But luckily this scenario is already so contrived that it can never happen.

Isn't it a good sign? The Laws of Robotics seems like a slam dunk baseline, and the issues and subtleties of it has been very thoughtfully mapped out in Asimovs short story collection.

  • The whole point of those books was to explore the places where those laws produced unexpected behaviour, so they are clearly not sufficient. I would argue those books are actually about demonstrating that it is very hard to build an ethical system out of rules.