Comment by famouswaffles
6 hours ago
Humans and LLMs are not seeing the benchmark in the same format. What's made up about that ? Can you solve this in the JSON format ?
Look man, don't reply if you don't want to.
6 hours ago
Humans and LLMs are not seeing the benchmark in the same format. What's made up about that ? Can you solve this in the JSON format ?
Look man, don't reply if you don't want to.
No comments yet
Contribute on Hacker News ↗