Comment by djmips
8 days ago
Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.
8 days ago
Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.
No comments yet
Contribute on Hacker News ↗