Comment by machiaweliczny
5 months ago
I think it’s mostly because they are incentivised to answer verbatim as medicine students and not with their own understanding. RL methods change that.
5 months ago
I think it’s mostly because they are incentivised to answer verbatim as medicine students and not with their own understanding. RL methods change that.
No comments yet
Contribute on Hacker News ↗