Comment by 112233

2 months ago

It it actively dangerous too. You might be self aware and llm aware all you want, if you routinely read "This is such an excellent point", " You are absolutely right" and so on, it does your mind in. This is worst kind of global reality show mkultra...

14 comments

112233

ETH_start 2 months ago

Isn't it more dangerous that people live their life out without ever trying anything, because they are beset by fear and doubt, and never had anyone give them an encouraging word?

Let's say the AI gives them faulty advice, that makes them over-confident, and try something and fail. Usually that just means a relatively benign mistake — since AIs generally avoid advising anything genuinely risky — and after they have recovered, they will have the benefit of more real world experience, which raises their odds of eventually trying something again and this time succeeding.

Sometimes trying something, anything, is better than nothing. Action — regardless of the outcome — is its own discovery process.

And much of what you learn when you act out in the world is generally applicable, not just domain-specific knowledge.

112233 2 months ago
I am confused by the tone and message of your comment — are you indeed arguing that having corporations use country-scale resources to run unsupervised psychological manipulation and abuse experiments on global population is one of just two choices, the other being people not doing anything at all?
- ETH_start 2 months ago
  
  I'm saying that what you have referred to as "psychological manipulation and abuse experiments" is in reality a source of motivation that helps people break the dormancy trap and be more active in the world, and that this could be a significant net benefit.
  I just want all sides of the question explored, instead of reflexively framing AI's impact as harmful.

Xraider72 2 months ago

Deepseek is GOATed for me because of this. If I ask it if "X" is a dumb idea, it is very polite in telling me that X is is dumb if the AI knows of a better way to do the task.

Every other AI I've tried is a real sycophant.

112233 2 months ago

I'm partial to the tone of Kimi K2 — terse, blunt, sometimes even dismissive. Does not require "advanced techiques" to avoid the psychosis-inducing tone of Claude/ChatGPT

mrandish 2 months ago

No doubt. From cult's 'love bombing' to dictator's 'yes men' to celebrity entourages, it's a well-known hack on human psychology. I have a long-time friend who's a brilliant software engineer who recently realized conversing with LLMs was affecting his objectivity.

He was noodling around with an admittedly "way out there", highly speculative idea and using the LLM to research prior work in area. This evolved into the LLM giving him direct feedback. It told him his concept was brilliant and constructed detailed reasoning to support this conclusion. Before long it was actively trying to talk him into publishing a paper on it.

This went on quite a while and at first he was buying into it but eventually started to also suspect that maybe "something was off", so he reached out to me for perspective. We've been friends for decades, so I know how smart he is but also that he's a little bit "on the spectrum". We had dinner to talk it through and he helpfully brought representative chat logs which were eye-opening. It turned into a long dinner. Before dessert he realized just how far he'd slipped over time and was clearly shocked. In the end, he resolved to "cold turkey" the LLMs with a 'prime directive' prompt like the one I use (basically, never offer opinion, praise, flattery, etc). Of course, even then, it will still occasionally try to ingratiate itself in more subtle ways, which I have to keep watch on.

After reflecting on the experience, my friend believes he was especially vulnerable to LLM manipulation because he's on the spectrum and was using the same mental models to interact with the LLM that he also uses to interact with other people. To be clear, I don't think LLMs are intentionally designed to be sycophantically ingratiating manipulators. I think it's just an inevitable consequence of RLHF.

slg 2 months ago

And that is a relatively harmless academic pursuit. What about topics that can lead to true danger and violence?
"You're exactly right, you organized and paid for the date, that created a social debt and she failed to meet her obligation in that implicit deal."
"You're exactly right, no one can understand your suffering, nothingness would be preferable to that."
"You're exactly right, that politician is a danger to both the country and the whole world, someone stopping him would become a hero."
We have already seen how personalized content algorithms that only prioritize getting the user to continue to use the system can foment extremism. It will be incredibly dangerous if we follow down that path with AI.
112233 2 months ago

Claude Code with their models is unusable because of this. That it keeps actively sabotaging and ruining the code ("Why did you delete that working code? Just use ifdef for test!" "This is genius idea! You are absolutely right!") does not make it much better — it's a twisted Wonderland fever dream.
For "chat" chat, strict hygiene is a matter of mind-safety: no memory, long exact instructions, minimum follow-ups, avoiding first and second person if possible etc.

d0mine 2 months ago

It might explain why there is a stereotype the more beautiful woman the crazier she is. (everybody tells her what she wants to hear)

Akronymus 2 months ago

https://youtu.be/VRjgNgJms3Q

relevant video for that.

tortilla 2 months ago

So this is what it feels to be a billionaire with all the yes men around you.

LogicFailsMe 2 months ago
you say that like it's a bad thing! Now everyone can feel like a billionaire!
but I think you are on to something here with the origin of the sycophancy given that most of these models are owned by billionaires.
- BigTTYGothGF 2 months ago
  
  > Now everyone can feel like a billionaire!
  In the "like being kicked in the head by a horse every day" sense.
  
  1 reply →