Comment by gck1
4 hours ago
No. Anthropic runs prompts through a classifier that then proceeds to do prompt injection on anything dual-use, which then results in an escalating flag on your account, which increases the strictness of the classifier and volume of prompt injections progressively.
No comments yet
Contribute on Hacker News ↗