Comment by djmips
3 months ago
Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.
3 months ago
Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.
No comments yet
Contribute on Hacker News ↗