Comment by jeremyloy_wt
10 hours ago
> we as humans can guide the LLM toward a rigorous test suite, rather than one that has a lot of "coverage" but doesn't actually provide sound guarantees about behavior.
I have a hard enough time getting humans to write tests like this…
No comments yet
Contribute on Hacker News ↗