Comment by empath75

1 day ago

If anyone wants an example of actual jailbreak in the wild that uses this technique (NSFW):

https://www.reddit.com/r/persona_AI/comments/1nu3ej7/the_spi...

This doesn't work with gpt5 or 4o or really any of the models that do preclassification and routing, because they filter both the input and the output, but it does work with the 4.1 model that doesn't seem to do any post-generation filtering or any reasoning.