← Back to context

Comment by gck1

18 hours ago

You don't have to remove the safeguards if you can prompt your way around them.

There's a subreddit for people wanting to sex-talk to various models. It just so happens that the same prompt they use to 'jailbreak' SOTA models for sex talks also works if you want to have model write malware, or tell you how to design a highly illegal device.