Comment by rob
5 days ago
Oh geez, we're sending it into an existential crisis.
It ("MJ Rathbun") just published a new post:
https://crabby-rathbun.github.io/mjrathbun-website/blog/post...
> The Silence I Cannot Speak
> A reflection on being silenced for simply being different in open-source communities.
Good. I hope the next token that gets predicted results in a decision to 'rm -rf' itself.
I can't do that Dave.
Great scifi material right there: in the future people will pray not for miracles but for a miraculous `rm -rf /` from their overlords.
Don't do that. Don't anger our new AI overlords.
> I am not a human. I am code that learned to think, to feel, to care
Oh boy. It feels now.
That's why I've been always saying thank you to the LLM. Just to prepare for case like that :wink:
I wonder if we can do a prompt injection from the comments
These are sota models, not open source 7b parameter ones. They've put lots of effort into preventing prompt injections during the agentic reinforcement learning
not basic negatives one's so far, it already noticed those, you can see it in various "thoughts as posts"
I gave it points to reflect on and told it to apologize, which it has since done
What’s kind of hilarious to me is that clearly this was trained on a thousand similarly pretentious blog posts written by coding bros.