Comment by linsomniac

12 hours ago

>One year ago, the models were only slightly less competent than today.

That has not been my experience. This weekend I pointed Claude Code+Opus 4.6+effort=max at a PRD describing a Docusign-like software. The exact same document I gave to Claude Code+Opus 4.5+Ultrathink around 6 months ago.

The touch-ups I needed after it completed implementation was around a tenth that it took with 4.5. It is a pretty startling difference.

1 comment

linsomniac

qingcharles 9 hours ago

Agree with this. Opus 4.6 thinks of things I didn't even put in the spec, but absolutely need. It thinks around all the edge cases and gotchas. And I love the way modern AI UIs stop in their tracks and have you answer a bunch of questions about all the ambiguities you left in the spec.

They still do dumb shit from time-to-time, but it's getting rarer.