← Back to context

Comment by vntok

17 hours ago

Reproducing experimental results across models and vendors is trivial and cheap nowadays.

Not if anthropic goes further in obfuscating the output of claude code.

  • Why would you test implementation details? Test what's delivered, not how it's delivered. The thinking portion, synthetized or not, is merely implementation.

    The resulting artefact, that's what is worth testing.

    • > Why would you test implementation details

      Because this has never been sufficient. From things like various hard to test cases to things like readability and long term maintenance. Reading and understanding the code is more efficient and necessary for any code worth keeping around.