Comment by andai
3 hours ago
Not sure how it is now, but a while back most of the training data was short interactions.
I noticed that the longer a chat gets, the more unpredictable the models behavior becomes (and I think that's still a common jailbreak technique too).
(I think it might also have something to do with RoPE, but that's beyond me.)
No comments yet
Contribute on Hacker News ↗