Comment by mintone
2 years ago
I'd be interested in seeing a similar analysis but with a slight twist:
We use (in production!) a prompt that includes words to the effect of "If you don't get this right then I will be fired and lose my house". It consistently performs remarkably well - we used to use a similar tactic to force JSON output before that was an option, the failure rate was around 3/1000 (although it sometimes varied key names).
I'd like to see how the threats/tips to itself balance against exactly the same but for the "user"
No comments yet
Contribute on Hacker News ↗