Comment by gblargg
5 days ago
Seems like this will backfire. Now when developers encounter problems with Claude Fable, they will have an easy explanation: it did it deliberately and intentionally vaguely. There's no way to falsify it. It's reasonable to expect it to get false positives and invoke this when it shouldn't be.
No comments yet
Contribute on Hacker News ↗