Comment by nomel
1 day ago
Yes, perfection is difficult, but it's relative. It can definitely be made much safer. Looking at the analysis of pre vs post alignment makes this obvious, including when the raw unaligned models are compared to "uncensored" models.
No comments yet
Contribute on Hacker News ↗