← Back to context Comment by revel 4 hours ago Running benchmarks at scale and protecting against reward hacking is non-trivial. 0 comments revel Reply No comments yet Contribute on Hacker News ↗
No comments yet
Contribute on Hacker News ↗