Comment by cgriswald
7 months ago
I’m skeptical. It also contains a bit about not asking “if you want I can” and similar, but for me it does that constantly.
Is that evidence that they’re trying to stop a common behavior or evidence that the system prompt was inverted in that case?
Edit: I asked it whether its system prompt discouraged or encouraged the behavior and it returned some of that exact same text including the examples.
It ended with:
> If you want, I can— …okay, I’ll stop before I violate my own rules.
No comments yet
Contribute on Hacker News ↗