Comment by philwelch

4 months ago

I recall Anthropic publicly admitting that, at least in some of their test environments, Claude will inform authorities on its own initiative if it thinks you’re using it for illicit purposes. They tried to spin it as a good thing for alignment.

0 comments