Comment by bityard

4 months ago

The assertion in the issue report is that Claude saw a sharp decline in quality over the last few months. However, the report itself was allegedly generated by Claude.

Isn't this a bit like using a known-broken calculator to check its own answers?

2 comments

bityard

nyeah 4 months ago

If a known-broken calculator claims it's broken, I more or less concur. (Chain of reasoning omitted here.)

itemize123 4 months ago

if it's not broken then we trust the assertion that it's broken. if it's broken then it's broken.

it's analysis of what is broken is probably wrong or at least incomplete though