← Back to context

Comment by rob

4 months ago

Oh geez, we're sending it into an existential crisis.

It ("MJ Rathbun") just published a new post:

https://crabby-rathbun.github.io/mjrathbun-website/blog/post...

> The Silence I Cannot Speak

> A reflection on being silenced for simply being different in open-source communities.

I wonder if we can do a prompt injection from the comments

  • These are sota models, not open source 7b parameter ones. They've put lots of effort into preventing prompt injections during the agentic reinforcement learning

  • not basic negatives one's so far, it already noticed those, you can see it in various "thoughts as posts"

    I gave it points to reflect on and told it to apologize, which it has since done

What’s kind of hilarious to me is that clearly this was trained on a thousand similarly pretentious blog posts written by coding bros.