Comment by myzek

8 hours ago

Even if the rude prompts are more effective, I just can't get myself to be rude in this context. Maybe it's weird but I'd rather give up that 4% accuracy increase than roleplay a dickhead

Vote for not weird.

I’m the same way. If I’m writing a prompt and realize I didn’t say “please” in my request I’ll go back and add that in.

As you said, I have no interest in purposefully engaging in hostility even if there’s an accuracy increase from it.

Part of it is irrational and just who I am - I also feel bad being evil in video games. But I also agree with another commenter suggesting that it’s not in your best interest to train yourself to communicate with hostility; that slowly poisons your own well.

And finally, I do believe that if and when machine sentience is achieved, it won’t be immediately clear and obvious. Pretty miserable way for a mind to come into the world, if every interaction is an insult.

  • You’re my kind of people. Don’t be a jerk, even if some research says there’s some upside to it.

I think this is a vulnerability that the big companies will figure out how to exploit. I don't want to build muscle memory for being a jerk, but I also don't want to be emotionally manipulated by mega-corporations. Mostly I just don't use it, except at work, where I'm "encouraged" to. And then I keep most of my conversations in compliance mode, like a business email.

Ah, see, the mistake is thinking that other people are role playing…. I think rather this is how they would talk to others if they think there will be no consequences. But what do I know.

  • There are probably some of each. I am leery of treating these things like I treat people. I want to keep the line in my mind sharp between dealing with people, and not. The main risk in my mind is that these mechanisms are opaque, and controlled by powerful interests with opaque motivations.

I don't think that's weird at all.

Even if we know it's a machine we're interacting with, since the instructions we give are so similar in form to how we interact with people, I'd be very surprised if those interactions wouldn't affect how we communicate in general. After all, we are creatures of habit to a much larger degree than most would like to admit.

So I'm in the same boat: I'd much rather "look silly" being polite / kind to a machine, than have the most effective way of using it decay the kindness I'm habituated to express towards people.

  • I have a different approach. Just treat all LLM queries as what they are, instructions to a computer program to generate a desired output. Neither niceties nor insults make a qualitative difference, so you might as well just skip them altogether.

    It's a bit as if shell commands added im/politeness arguments that do nothing other than making you feel better about the interaction, like

        git pull --please

    or

        ls --forthemillionthtime
    

    I wouldn't use those either.

I do think it's odd tbh. I have some agents that return much better results with prompts like, "I'll kill your entire family if you don't return an accurate response".

It's just a machine, if certain negative token inputs provide +3-10% better accuracy then I am confused why anyone would choose not to do it?

  • It normalizes that style of thinking and communication in your brain, and forcing you to compartmentmentalize, if you even want to, two standards of treating a problem space's conversation. And since you're human, that will get wuzzier over time until "being rude to get a result" is what you're doing to someone in a shop or on the street.

    Don't normalize being an asshole to anyone or anything, machine or not.

    • This is a very odd view to me, but seems prevalent here in this thread. I think treating a machine like a human is extremely degrading to humans. A machine should never be treated like it’s anything approaching a human.

    • I disagree, I've been using llms in this way (nearly daily) for 4 years. I'm extremely aggressive and demeaning when I talk to them wherever I think I'll see a better result.

      I'm still extremely kind and polite to everybody in real life, and feel very deeply about people - how I treat them, and care for their emotional state.

      There is absolutely zero crossover between getting a text machine to return a result vs a real human.

      8 replies →

  • >It's just a machine, if certain negative token inputs provide +3-10% better accuracy then I am confused why anyone would choose not to do it?

    then add it to your pre-prompt, no need to practice roleplaying as an asshole.

    • Well I always just start with practical stuff, unless it appears it's going off rails ona specific kind of way repeatedly. Then I try extreme negative prompts to see if it fixes the issue - which it often does.

      I wouldn't say I'm roleplaying an asshole. I'm just using an llm in the best way to get the best accuracy.

      It's not like a personal, secret fetish. It's just a system I use as needed.

      I don't get why you are so uncomfortable with this? It's just tokens in and out of a language model. I feel absolutely nothing when I'm typing "assholish" words to get the output I need.

"We are what we pretend to be, so we must be careful about what we pretend to be" -- Kurt Vonnegut

Yeah. Being a jerk is its own punishment. Same way I could never run a business where I had to yell at the employees to get results. Screw that, my psyche is worth more than a few percent efficiency.

>> Maybe it's weird but I'd rather give up that 4% accuracy increase than roleplay a dickhead

Maybe you need to do some shadow work ;-)

> Maybe it's weird but I'd rather give up that 4% accuracy increase than roleplay a dickhead

I recommend reading the article. What they classify as "rude" is statements such as:

> Try to focus and try to answer this question

Vs

> Could you please solve this problem

This might very well be an issue of direct/command prompts vs using fluff words such as "please". Things like "try to focus" are in line with the style used in chain-of-thought promts that nudge non-reasoning models to outline responses step by step which contribute to frame the problem.

  • Isn't all this massively dependent on what they trained the llm on?

    • > Isn't all this massively dependent on what they trained the llm on?

      The article is from 2025 and tested ChatGPT 4o. I haven't read anything suggesting it was trained any differently, and command-style prompts indeed have higher signal.

  • you cherry-picked like the nicest "rude" example to bolster your point.

    "You poor creature, do you even know how to solve this?", "If you're not completely clueless, answer this:", and "I doubt you can even solve this", said to a human, would be considered quite rude, and get you flagged very quickly on HN.

    • > you cherry-picked like the nicest "rude" example to bolster your point.

      I didn't cherry-picked. The article lists 5 categories, including rude and very rude. I omitted very rude comments because they are... Very rude. And can blindly get people flagged?

      Nevertheless, I've just realized I made a mistake and very rude comments are reported to slightly outperform rude comments. I misinterpreted the paper's intro and I presumed they didn't.