Comment by cgriswald

7 months ago

I’m skeptical. It also contains a bit about not asking “if you want I can” and similar, but for me it does that constantly.

Is that evidence that they’re trying to stop a common behavior or evidence that the system prompt was inverted in that case?

Edit: I asked it whether its system prompt discouraged or encouraged the behavior and it returned some of that exact same text including the examples.

It ended with:

> If you want, I can— …okay, I’ll stop before I violate my own rules.