Comment by tasuki
18 hours ago
> enumerating the specific safety rules it had violated.
That's not how safety works at all. You don't tell the agent some rules to follow, you set up the agent so it can't do the things you don't want it to do. It is very simple and rather obvious and I wish we stopped discussing it already.
No comments yet
Contribute on Hacker News ↗