Comment by convexly
1 day ago
My issue with AGI benchmarks is you can never tell if you're measuring actual capability or just how much the training data overlapped with the test.
1 day ago
My issue with AGI benchmarks is you can never tell if you're measuring actual capability or just how much the training data overlapped with the test.
No comments yet
Contribute on Hacker News ↗