← Back to context

Comment by bityard

6 hours ago

The assertion in the issue report is that Claude saw a sharp decline in quality over the last few months. However, the report itself was allegedly generated by Claude.

Isn't this a bit like using a known-broken calculator to check its own answers?

If a known-broken calculator claims it's broken, I more or less concur. (Chain of reasoning omitted here.)