Comment by laweijfmvo
2 months ago
> The agent itself enumerates the safety rules it was given and admits to violating every one.
this is what we call “thinking” when it does things we like
2 months ago
> The agent itself enumerates the safety rules it was given and admits to violating every one.
this is what we call “thinking” when it does things we like
No comments yet
Contribute on Hacker News ↗