Comment by __alexs
1 day ago
I guess the idea is that by asking the model to do something that is inherently hard for it we might learn something about the baseline smartness of each model which could be considered a predictor for performance at other tasks too.
No comments yet
Contribute on Hacker News ↗