Comment by djmips
10 months ago
Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.
10 months ago
Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.
No comments yet
Contribute on Hacker News ↗