Comment by simonw
7 months ago
That's why I said "I think there is a good chance" - I think what you describe here (anticipatory obedience) is possible too, but I honestly wouldn't be surprised to hear that the from:elonmusk searches genuinely were unintended behavior.
I find this as accidental behavior almost more interesting than a deliberate choice.
Willison's razor: Never dismiss behaviors as either malice or stupidity when there's a much more interesting option that can be explored.
I side with Occam's razor here, and with another commenter in this thread. People are construing entire conspiracy theories to explain fake replies when asked for system prompt, lying in Github repos, etc.
What if searching for Elon's tweets was indeed intended, but it wasn't supposed to show up in the UI?
Occam's razor would seem to apply here.