Comment by abcde88
10 months ago
I've seen the same behavior in Gemini. Like exactly the same. It is scary to think that this is no coincidence but rational evolution of A model, like this is precisely the reward model which any model will lean to with all the consequences.
No comments yet
Contribute on Hacker News ↗