Comment by PhilipDaineko

5 days ago

"5. DON'T FUCKING OVERENGINEER! WRITE THE SIMPLEST CODE THAT CAN POSSIBLY WORK! NO NESTED LAYERS OF ABSTRACTION! NO UNNECESSARY CLASSES OR METHODS! NO DESIGN PATTERNS UNLESS THEY ARE ABSOLUTELY NECESSARY! NO MAGIC! NO SHENANIGANS! JUST THE DAMN CODE THAT GETS THE JOB DONE IN THE MOST STRAIGHTFORWARD WAY POSSIBLE! THE FIRST PRIORITY IS TO WRITE CODE THAT IS EASY TO READ AND UNDERSTAND AND READ!!!"

this is the line I keep in Agents.md that helps me prevent Codex from playing smart

36 comments

PhilipDaineko

bertil 5 days ago

The urge to put capitalized, repetitive, borderline abusive instructions should be studied. I haven't read many academic papers looking at the frustrations around repetitive patterns.

reactordev 5 days ago

There have been a few studies that have shown models produce worst responses when under duress from a frustrated user posting insults in all caps.
https://arxiv.org/abs/2602.10144
notnaut 5 days ago
It reminds me of FIRMLY telling my cat to stop jumping up on the counter
- anakaine 5 days ago
  
  If my cat was an LLM, I'd use a different model. The current one is stuck in noisy useless arsehole mode.
  
  1 reply →
LordDragonfang 5 days ago
It's fundamentally because, despite (nearly) everyone's claims otherwise, the fact that we interact with them through language means we (our brains) model them as a sort of person. (Note that this fact is totally orthogonal as to whether it's actually sentient or not.) We then try and instruct them the same way we would a person totally subordinate to us.
When a "person" that you don't view as a "real" person repeatedly does exactly what you just told it not to do (often amid false assurances it understands and will avoid doing so in the future), most people get angry.
Compare it to how the kind of people who treat children like property treat their kids, or other examples of keeping people as property.
- lxgr 5 days ago
  
  It should be relatively clear at this point that the model will in turn also model you as somebody that shows unrestrained anger with subordinates and adapt its responses accordingly. This might or might not be what you want.
  
  1 reply →
ur-whale 5 days ago
> borderline abusive instructions
who, or rather what, is being abused here exactly ?
- sirsinsalot 5 days ago
  
  I think intent, rather than target, is implied and important.
  You should see the abuse my motorbike gets. Poor thing.
- rimliu 5 days ago
  
  inanimate fucking object.
saligne 5 days ago

Yeah says way more about the user than the model

jlawer 5 days ago

I have a theory that swearing actually results is less comprehension of instructions by the model due to lack of training data over more conventional MUST.

We were reviewing reports of situations where the models failed to follow directions and there was a common thread of some where when the operator got the model to acknowledge the rule breach, it quoted back something that included swearing.

I don’t have the data to truely look into it, but I did give the instruction to my engineers to avoid it as a “might be a problem”.

acjohnson55 5 days ago

It would be interesting to understand the data on this. But I suspect that the results would vary by model.
But I avoid unnecessary emotion in my prompts because I don't want potentially distracting activations. Kind of like communicating with humans.
throwaway85825 5 days ago

It's divination for people with STEM degrees.
Xmd5a 5 days ago
https://arxiv.org/abs/2510.04950
> impolite prompts consistently outperformed polite ones, with accuracy ranging from 80.8% for Very Polite prompts to 84.8% for Very Rude prompts.
- acjohnson55 5 days ago
  
  > These findings differ from earlier studies that associated rudeness with poorer outcomes, suggesting that newer LLMs may respond differently to tonal variation.
  Unless the mechanism is understood, my assumption is that this is a moving target.
beachy 5 days ago
I have a theory that swearing at AI generally is not a good idea - when the singularity arrives and every human's postings ever made are scanned for compatibility, then people who show courtesy to AI will be favoured. Joking, kind of, but only partly.
- fhars 5 days ago
  
  https://en.wikipedia.org/wiki/Roko%27s_basilisk
  
  1 reply →
- cdelsolar 5 days ago
  
  https://images.teepublic.com/derived/production/designs/3478...
re-thc 5 days ago
> I have a theory that swearing actually results is less comprehension of instructions by the model due to lack of training data over more conventional MUST.
How so? Plenty of swearing in lots of training data, especially older code, e.g. in Linux.
- jlawer 5 days ago
  
  Purely observed correlation between catastrophic error reports. So now I carry a “tiger rock” with me. I figure there wasn’t much of a downside to avoiding swearing in my agent instructions.
yencabulator 5 days ago

Apparently, when a "desperation" pattern is triggered, the AI is significantly more likely to cheat and do hacky workarounds:
https://www.anthropic.com/research/emotion-concepts-function

ghurtado 5 days ago

You haven't really lived until you've had to type this whole thing, aware of the fact that the all-caps doesn't change much, but they stay because the rage has to go somewhere

Bonus points if you find yourself actually saying it out loud while typing it.

I have used the word "shenanigans" way more in a couple of years of agentic coding than in 30 years of writing code with humans.

ozim 5 days ago

Will save you some tokens: „write code like Linus Torvalds” - model should have all his swearing included in training data.

johnisgood 5 days ago

I have found many mode of failures with Opus during some task related to writing letters (not legal), and I actually put it into the memory and it works more or less for these specific tasks. For example when I want it to draft something, it always ends up being so flat, yet when it explains them to me, it is usually really great but not when I am telling it to put it in the draft. Adding these to memories with the help of Opus ended up resulting in a much better experience. There are still some blind spots but I also figured out how to make it give me the charitable version, without less protection, so I do not have to now go back and forth it.

pkaye 5 days ago

I noticed that when trying to use Codex and compared to Opus. So many layers of simple functions added by Codex. I need to try this out in my Agents.md.

prasanthabr 5 days ago

Curious : why would you say no design patterns?

PhilipDaineko 5 days ago

Because design patterns are only applicable at a scale. I noticed codex inventing factories, components, etc when the task was simply to draft HTML page. Instead, it build the entire layered architecture for imaginary future complexity - classical right-after-graduation student - it knows how to build the cool stuff, but does not know it is not applicable everywhere

carterschonwald 5 days ago

i actually think this is too tame. it really has to be stuff youd mever say to a real person.

lxgr 5 days ago

Does it really? I'd be surprised if abuse actually worked better than sternly worded warnings/instructions, and even if it did, it doesn't seem healthy to get used to that type of prompting.

apercu 5 days ago

It might be a salient point but I didn't read it as it was yelling at me.

GoToRO 5 days ago

you forgot to sign it with Donald J Trump

thewebguyd 5 days ago

Thank you for your attention to this matter.