Comment by mips_avatar

6 days ago

It's bad that Anthropic can determine what this means. If you're building a modern app you're likely training your own embedding models and now anthropic can just silently sabotage your training pipelines?

>We estimate they will impact ~0.03% of traffic, concentrated in fewer than 0.1% of organizations

At the scale of API requests that Anthropic sees, I think the affected organization count might be substantial, and they might not be getting the full model capability that they're paying top $$$ for.

Also, wonder how they arrived at that estimation.

I have no idea how you came to that conclusion. Unless your training pipeline involves actively querying one of Anthropic models, no they can't. And if it does you're distilling their model.

  • The crocodile tears of companies who've hoovered up everything possible, regardless of permissions or legality, now crying that someone else is stealing their hard work is comical.

    I don't even think they can believe it themselves, it's in reality they are just trying to throw fear, uncertainty and doubt about potentially cheaper offerings.

  • That is not what their policy states. It specifically says they will sabotage even non-distillation attempts, such as distributed training pipeline design. And given that they are so far very nonperformant in classification accuracy, expect it to randomly include far more topics wide of the mark.

    The fun part is that you will never know if your neural net classification project is getting silently sabotaged because their classifier doesn't work!

  • Opus 4.8 (or a classifier in front of it) flagged my account and refused to comply when I told it to kill the process. Reasoning summary was complete bananas.

    With this in mind, I don't want model to be proactively instructed and encouraged to sabotage without telling me.

  • Like if you're using claude code on a feature tangential to your training pipeline it's allowed to nerf itself and damage your AI work.

  • Read the examples Anthropic gave in the model card. They refer to extremely broad technology used across AI and ML.