Comment by iLoveOncall

4 days ago

Yes?

Run the same prompt on old models and the current "SOTA" and you'll get pretty much the same answer word for word.

People think models have improved because tooling around the models (Claude Code, Cline, or your other favorite LLM wrapper) has improved, not because the models themselves have made any kind of leap.