Comment by tw1984

3 months ago

well, this is what anthropic wants you to believe.

all public benchmark results and user feedback paint a quite different picture. Chinese have coding agents on par with Claude Code, they could easily FT/RL to future improve its specific capability if they want, yet anthropic refuses to even acknowledge the reality.

1 comment

tw1984

yawnxyz 3 months ago

yeah probably they're just benchmarking whatever they have across all providers including their own - i mean that's what everyone's doing anyway