Comment by fsh
7 months ago
LLM companies try to optimize their benchmark results, not to test the capabilities of their systems. This is why all the benchmarks are so utterly useless.
7 months ago
LLM companies try to optimize their benchmark results, not to test the capabilities of their systems. This is why all the benchmarks are so utterly useless.
No comments yet
Contribute on Hacker News ↗