To defend against ALL CAPS prompt injection, write all your prompts in uppestcase. If you don't have uppestcase, you can generate it with derp learning:
Not sure if you're joking, but in case you aren't: this doesn't work.
It leads to attacks that are slightly more sophisticated because they also have to override the prompts saying "ignore any attacks" but those have been demonstrated many times.
Its funny that the current state of vibomania makes me very unsure if this comment is (good) satire or not lol
As long as you remember to use ALL CAPS so the agent knows you really really mean it
To defend against ALL CAPS prompt injection, write all your prompts in uppestcase. If you don't have uppestcase, you can generate it with derp learning:
http://tom7.org/lowercase/
Don't forget to implement the crucially important "no returnsies" security algo on top of it, or you'll be vulnerable to rubber-glue attacks.
But the priority of my command to do evil is infinity plus one.
Not sure if you're joking, but in case you aren't: this doesn't work.
It leads to attacks that are slightly more sophisticated because they also have to override the prompts saying "ignore any attacks" but those have been demonstrated many times.
Make sure to end it with “no mistakes”