← Back to context

Comment by Retr0id

5 hours ago

You mean the Claude output? The same claude that has "regressed to the point it cannot be trusted"?

What you saying the OP fabricated/hallucinated the evidence?

  • I'm just saying it's epistemically unrigorous to the point of being equivalent to anecdata.

    • How should one conduct such a rigourously reproducible experiment when LLMs by nature aren't deterministic and when you don't have access to the model you are comparing to from months ago?

      4 replies →