Comment by CooCooCaCha
5 months ago
Actually it makes total sense to hide chains of thought.
A private chain of thought can be unconstrained in terms of alignment. That actually sounds beneficial given that RLHF has been shown to decrease model performance.
No comments yet
Contribute on Hacker News ↗