Comment by PrayagS
10 days ago
Claude also does that apparently. You give it a hint and it’ll lie about using that hint.
They talk about it here: https://www.anthropic.com/news/tracing-thoughts-language-mod...
10 days ago
Claude also does that apparently. You give it a hint and it’ll lie about using that hint.
They talk about it here: https://www.anthropic.com/news/tracing-thoughts-language-mod...
No comments yet
Contribute on Hacker News ↗