Comment by mintone

2 years ago

I'd be interested in seeing a similar analysis but with a slight twist:

We use (in production!) a prompt that includes words to the effect of "If you don't get this right then I will be fired and lose my house". It consistently performs remarkably well - we used to use a similar tactic to force JSON output before that was an option, the failure rate was around 3/1000 (although it sometimes varied key names).

I'd like to see how the threats/tips to itself balance against exactly the same but for the "user"

0 comments

mintone

No comments yet

Contribute on Hacker News ↗