Comment by djmips
15 days ago
Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.
15 days ago
Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.
No comments yet
Contribute on Hacker News ↗