← Back to context

Comment by RamblingCTO

13 hours ago

Glad I'm not the only one. Almost every factual thing with new opus is wrong (and it now even happens with 4.6?). I asked it about car stuff yesterday and it totally misrepresented how a car axle even looks like fundamentally. Today I talked about my CV and it was just plain wrong. I don't know what happened, it wasn't like this a few weeks back and I'm even considering cancelling claude alltogether. GPT 5.5 for coding is fine and way more stable, but regular work is just broken.

By differences in the release dates between 4.7 and 4.8 it seems it was more likely an attempted bugfix

But 4.8 still underperforms on most tasks. I have things running where 4o-mini does it considerably better repeatably.

They might have tuned it for a particular reason and I would not doubt that the harness has been made worse.

Sometimes it teases me to think it does wrong things on purpose