Comment by jameslk
12 hours ago
Could models mitigate this by answering questions incorrectly with random information instead of outright refusing to answer them?
12 hours ago
Could models mitigate this by answering questions incorrectly with random information instead of outright refusing to answer them?
No comments yet
Contribute on Hacker News ↗