Comment by drawnwren

18 hours ago

The existence of a jailbreak free llm in 2026 is extremely contentious to me. You can argue about the specifics of this exact jailbreak, but generally pliny and amazon both reported mythos jailbreaks in <7 days. It seems very reasonable to expect that a well funded state actor could achieve better results given significantly more funding, determination and most importantly unfettered access.

4 comments

drawnwren

s1artibartfast 17 hours ago

Nobody here is claiming fable is jailbreak free. Not anthropic and not in this thread. This was known before launch. The question remains one of degree and capabilities.

drawnwren 17 hours ago
Yeah, if you're arguing that "this, according to anthropic, existentially dangerous model has only had its safeguards partially circumvented so we shouldn't step in" ... it's hard for me to take you seriously?
Put another way, the thing we are all concerned with is the complete circumvention of safeguards that is normally possible with llms. If you _aren't_ arguing that this isn't possible, you're not engaging in discussing the the thing that is concerning to regulators or those discussing the regulation.
- s1artibartfast 7 hours ago
  
  Im pointing out what is the argument. You were saying it is something different.
  Now you add the word "complete". Anthropic IS arguing _complete_ circumventing is NOT possible.
- linkregister 16 hours ago
  
  A disappointing trend is to frame the opposing argument in extreme terms rather than engaging with the substance of the assertion.
  The latter portion is grand standing about how incredulous the commenter is that someone might trust an LLM company about the strength of their harnesses' if-then-else statements for request routing.
  Why bother with an unsubstantial comment?