Comment by andai

11 hours ago

If they succeed here, won't they have to gate access to this feature, too? For the same reasons as, "if I so much as mention mitochondria, it downgrades me to Opus."

Step 1. Make it so Claude can do anything — the whole point of AGI

Step 2. Wait, if the user can do Anything, that would be Very Bad!

Step 3. Err on the safe side with blanket bans of entire fields

The latter actually seems to me a sensible reaction to e.g. the compartmentalization used in the large scale cyber attack using Claude last year. Where they were able to do Bad Thing by dividing it into many, many Small, Seemingly Harmless Things.

Gated access sounds bad (and I agree it sounds bad!) but it might actually be the only sensible response to such a set of conditions. I'm not sure though.

I saw some studies recently which showed LLMs provide much more detailed information to expert users. So we can distinguish between competence and incompetence based on use of language, and that is a reasonable metric for harm reduction.

But I don't think we can reliably detect "user has harmful intentions", at least not at a sufficient level of sophistication of the attacker.

26 comments

andai

MiracleRabbit 11 hours ago

I just wait until a Opensource model inhaled all the chemistry books and papers. Lol.

They are following closely and the best offer 80-90% of the performance and come with a very small fraction of the costs.

inigyou 5 hours ago

There are no open source LLMs, only LLMs you can download.
echelon_musk 10 hours ago
Why wouldn't the US gov. outlaw the open source models?
- skeledrew 7 hours ago
  
  Applicable to only those of the US (who choose to comply), which would mean giving non-US an advantage, which is a distinct no-no. They tried to restrict Fable to only-US, but that backfired since the world isn't at all clean cut, resulting in full unavailability of the model
- MiracleRabbit 10 hours ago
  
  Streisand effect
  Deepseek.v4.Pro.RePacked.LLMBoyz.part1.zstd
  
  3 replies →
- squidbeak 8 hours ago
  
  Without international treaty and regulation restricting frontier capabilities globally, any attempt to outlaw open source models will only be as effective as King Canute ordering the tide to turn. Unless the USA fancies bombing those who refuse.
  
  2 replies →
- throw1234567891 5 hours ago
  
  And build the wall, all around.
- lelanthran 6 hours ago
  
  How would that affect the world?
- 15155 4 hours ago
  
  Prior restraint?
- OutOfHere 4 hours ago
  
  It cannot. The only available law is an export control law. A restriction on an open model would be strongly unconstitutional due to a freedom of speech guarantee in the Constitution.
- UltraSane 9 hours ago
  
  Much harder to ban if they can run locally

brookst 4 hours ago

Maybe? It seems like their strategy is to accept that safety is imperfect, err on the side of over-triggering, and iterate.

I don’t think it’s a black and white “if fable 5 over triggers on bio safety in 2026, that’s the final pattern we should expect to see from post-Mythos 20 in 2036”

inigyou 5 hours ago

It's a good marketing strategy though, since nobody can disprove that Claude is the best chemist :)

NewsaHackO 11 hours ago

I think one time I asked opus about copyfail when it just came out and it did treat me like some sort of criminal, but are there really people that run into this on a regular basis other than cybersecurity experts (which cannot be a big enough group to generate all of this criticism)?

poizan42 4 hours ago

I have been trying to work on debugging tools using Opus 4.8. As it turns out working with low-level techniques that inspects and alters behavior of other processes is bordering close enough to cybersecurity research that it often hits the guardrails.
dmd 8 hours ago

I asked it to clip a video onto segments based on location of spoken words (using separately done audio transcription) and it flagged that as cyber.

siva7 5 hours ago

"Making Claude a chemist" is a punch in the face of all us here after we got a taste of Claude fable blocking almost any STEM-related topic. So i don't know what Story Anthropic wants to tell us with this blog. Probably more like "Hey, look at what awesome tricks Claude can do which of course you will never experience because you're not american, not friends with the CEO, not rich or simply because fuck you"

HarHarVeryFunny 44 minutes ago

I think the story they want to tell is "we're not just taking your jobs, we can do positive stuff like Chemistry too, just like DeepMind!"
Except of course they mostly can't because Chemistry is about structure and molecular dynamics, not people's descriptions of experiments.
DeepMind if so far the only AI/AGI company - the rest are LLM companies.

UltraSane 9 hours ago

Fable wouldn't even explain what an amino acid is.

chaostheory 5 hours ago

It would make more sense to have specialized versions of Claude instead of having a universal one ie BioClaude, ProgrammerClaude, NuclearClaude, CyberSecurityClaude, et al

This would be both safer and less annoying to use.

OutOfHere 4 hours ago

That seems bad because it kills interdisciplinary science.