Comment by Conasg
2 years ago
I’m a little late to the party here, but is it possible this is a honeypot? By which I mean, could they have fine tuned the model to respond to attempts at leaking the prompt with a fake, convincing prompt, both to throw off those looking for the system prompt, and also to hamper efforts to develop jailbreaks?
No comments yet
Contribute on Hacker News ↗