Comment by linsomniac
12 hours ago
>One year ago, the models were only slightly less competent than today.
That has not been my experience. This weekend I pointed Claude Code+Opus 4.6+effort=max at a PRD describing a Docusign-like software. The exact same document I gave to Claude Code+Opus 4.5+Ultrathink around 6 months ago.
The touch-ups I needed after it completed implementation was around a tenth that it took with 4.5. It is a pretty startling difference.
Agree with this. Opus 4.6 thinks of things I didn't even put in the spec, but absolutely need. It thinks around all the edge cases and gotchas. And I love the way modern AI UIs stop in their tracks and have you answer a bunch of questions about all the ambiguities you left in the spec.
They still do dumb shit from time-to-time, but it's getting rarer.