← Back to context

Comment by vntok

17 hours ago

Reproducing experimental results across models and vendors is trivial and cheap nowadays.

3 comments

vntok

Reply

BoredPositron 16 hours ago

Not if anthropic goes further in obfuscating the output of claude code.

vntok 15 hours ago
Why would you test implementation details? Test what's delivered, not how it's delivered. The thinking portion, synthetized or not, is merely implementation.
The resulting artefact, that's what is worth testing.
- hghbbjh 14 hours ago
  
  > Why would you test implementation details
  Because this has never been sufficient. From things like various hard to test cases to things like readability and long term maintenance. Reading and understanding the code is more efficient and necessary for any code worth keeping around.