Comment by idiotsecant

15 hours ago

Every time I've made an LLM do a thing it's designed not to do it's been a careful sideways crab-walk toward the goal over many exchanges. LLMs are vulnerable to 'frog boiling'. If each email is a new context it seems unsurprising that nobody broke it.

1 comment

idiotsecant

NitpickLawyer 14 hours ago

> it seems unsurprising that nobody broke it

But still a good thing overall. Two years ago this was not the case, and you could ask it to break its system prompt with a poem and get all the secrets back...