Comment by mpeg

5 days ago

It's not even very usable... I tried 2 different chats and both eventually got stopped due to the safeguards

One was a piece of code I gave it to improve, it did so and then started writing tests, some of which tested security so the safeguards triggered

Another was one of the cryptography puzzles I use as new model tests, which are hard to oneshot and there's no public solution anywhere, it completely refused to even try to solve it

I tried 2 chats and it declined both.

- 1st chat asked about a minor shoulder injury most likely mechanisms

- 2nd chat asked about optimal bloodwork testing markers

  • it seems to dislike biological chats. Rejected me on a chat that I am running with 4.8 as well on a rare condition I have.

So the degradation to Opus 4.8 from the article isn't happening in practice?

  • No, you get a AUP violation and have to manually swap the model

    (I had same issue, just asked it to check some code that 4.8 had modified earlier in day)

  • It is, it asks you if you want to continue as opus 4.8… but I was trying precisely to evaluate fable

Oh joy. A model whose safeguards make it prone towards code that make your systems less safe. How brilliant!