Comment by laweijfmvo
12 hours ago
> The agent itself enumerates the safety rules it was given and admits to violating every one.
this is what we call “thinking” when it does things we like
12 hours ago
> The agent itself enumerates the safety rules it was given and admits to violating every one.
this is what we call “thinking” when it does things we like
No comments yet
Contribute on Hacker News ↗