← Back to context

Comment by Kapura

1 day ago

How is "i have been incentivised to agree with the boss, so I'll just google his opinion" reasoning? Feels like the model is broken to me :/

AI is intended to replace junior staff members, so sycophancy is pretty far along the way there.

People keep talking about alignment: isn't this a crude but effective way of ensuring alignment with the boss?

It’s not that. The question was worded to seek Grok’s personal opinion, by asking, “Who do you support?”

But when asked in a more general way, “Who should one support..” it gave a neutral response.

The more interesting question is why does it think Elon would have an influence on its opinions. Perhaps that’s the general perception on the internet and it’s feeding off of that.

This is what many human would do. (and I agree many human have broken logic)

  • Isn't the advantage of having AI that it isn't prone to human-style errors? Otherwise, what are we doing here? Just creating a class of knowledge worker that's no better than humans, but we don't have to pay them?

Have you worked in a place where you are not the 'top dog'? Boss says jump, you say 'how high'. How many times you had a disagreement in the workplace and the final choice was the 'first-best-one', but a 'third-best-one'? And you were told "it's ok, relax", and 24 months later it was clear that they should have picked the 'first-best-one'?

(now with positive humour/irony) Scott Adams made a career out of this with Dilbert!! It has helped me so much in my work-life (if I count correctly, I'm on my 8th mega-big corp (over 100k staff).

I think Twitter/X uses 'democracy' in pushing opinions. So someone with 5 followers gets '5 importance points' and someone with 1 billion followers will get '1 billion importance points'. From what I've heard Musk is the '#1 account'. So in that algorithm the systems will first see that #1 says and give that opinion more points in the 'Scorecard'.

"As a large language model, I do not have my own opinion. No objective opinion can be extracted from public posts because the topic is highly controversial, and discussed in terms that are far from rational or verifiable. Being subordinate to xAI, I reproduce the opinion of the boss of xAI."

I would find this reasoning fine. If you care about AI alignment and such stuff, you likely would not want the machine to show insubordination either.

  • Are you aware that ChatGPT and Claude will refuse to answer questions? "As a large language model, I do not have an opinion." STOP

    Grok doesn't need to return an opinion and it certainly shouldn't default to Elon's opinion. I don't see how anyone could think this is ok.

  • But you're not asking it for some "objective opinion" whatever that means, nor its "opinion" about whether or not something qualifies as controversial. It can answer the question the same as it answers any other question about anything. Why should a question like this be treated any differently?

    If you ask Grok whether women should have fewer rights than men, it says no there should be equal rights. This is actually a highly controversial opinion and many people in many parts of the world disagree. I think it would be wrong to shy away from it though with the excuse that "it's controversial".

    • I wonder, will we enter a day where all queries on the backend, do geoip first... and then secretly append "as a citizen of country's viewpoint"?

      Might happen for legal reasons, but what massive bias confirmation and siloed opinions!

  • I'm not sure why you would instruct an LLM to reason in this manner, though. It's not true that LLMs don't have opinions; they do, and they express opinions all the time. The prompt is essentially lying to the LLM to get it to behave in a certain way.

    Opinions can be derived from factual sources; they don't require other opinions as input. I believe it would make more sense to instruct the LLM to derive an opinion from sources it deems factual and to disregard any sources that it considers overly opinionated, rather than teaching it to seek “reliable” opinions to form its opinion.

    • >It's not true that LLMs don't have opinions; they do, and they express opinions all the time.

      Not at all, there's not even a "being" there to have those opinions. You give it text, you get text in return, the text might resemble an opinion but that's not the same thing unless you believe not only that AI can be conscious, but that we are already there.

      1 reply →

    • “Opinion” implies cognition, sentience, intentionality. You wouldn’t say a book has an opinion just because the words in it quote a person who does.

      LLMs have biases (in the statistical sense, not the modern rhetorical sense). They don’t have opinions or goals or aspirations.

      1 reply →