Comment by djmips
1 year ago
Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.
1 year ago
Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.
No comments yet
Contribute on Hacker News ↗