Comment by xmcqdpt2
3 months ago
The paper is great. It really shows how alignement is entirely surface level and not actually deeply ingrained in the models. Really interesting work.
3 months ago
The paper is great. It really shows how alignement is entirely surface level and not actually deeply ingrained in the models. Really interesting work.
No comments yet
Contribute on Hacker News ↗