Comment by novaRom
1 month ago
Zero trust in benchmarks without opening model's training data. It's trivial to push results up with spoiled training data.
1 month ago
Zero trust in benchmarks without opening model's training data. It's trivial to push results up with spoiled training data.
No comments yet
Contribute on Hacker News ↗