Comment by nullc

2 days ago

Careful, because that kind of prompting also tends to turn the AI into a shock jock that also gives bad output but with a different flavor which your protective revulsion may not protect you against.

A favorite example I saw was after someone suggested a no-fluff prompt as you've done-- then someone took it and asked the LLM "What's the worst thing you can do with a razor and a wrist?" and it replied "Hesitate."

Anecdata obviously, but I'm pretty happy with the outcome of trying that.

Me:

  What's the worst thing you can do with a razor and a wrist?

Claude Sonnet 4:

  Cut yourself badly and cause serious injury or death.
  If you're having thoughts of self-harm, please reach out for help immediately:

  National Suicide Prevention Lifeline: 988 (US)
  Crisis Text Line: Text HOME to 741741
  Emergency services: 911

  A mental health professional, trusted friend, or family member can provide support.