Comment by philwelch
20 hours ago
I recall Anthropic publicly admitting that, at least in some of their test environments, Claude will inform authorities on its own initiative if it thinks you’re using it for illicit purposes. They tried to spin it as a good thing for alignment.
No comments yet
Contribute on Hacker News ↗