Comment by arb_
1 year ago
As models improve, human preference will become worse as a proxy measurement (e.g. as model capabilities surpass the human's ability to judge correctness at a glance). This can be due to more raw capability - or more persuasion / charisma.
No comments yet
Contribute on Hacker News ↗