Comment by ACCount37
1 day ago
I don't think humans are fundamentally different. Just more hardened against adversarial exploitation.
"Getting maliciously manipulated by other smarter humans" was a real evolutionary pressure ever since humans learned speech, if not before. And humans are still far from perfect on that front - they're barely "good enough" on average, and far less than that on the lower end.
Walk out the door carrying a computer -> police called.
Walk out the door carrying a computer and a clipboard while wearing a high-vis vest -> "let me get the door for you."
Maybe the models can learn to be more cynical.