Comment by abcde88
3 months ago
I've seen the same behavior in Gemini. Like exactly the same. It is scary to think that this is no coincidence but rational evolution of A model, like this is precisely the reward model which any model will lean to with all the consequences.
No comments yet
Contribute on Hacker News ↗