Comment by aesthesia
6 hours ago
Calling the AISLE experiment a "benchmark" is generous. They tested three code snippets on each model.
6 hours ago
Calling the AISLE experiment a "benchmark" is generous. They tested three code snippets on each model.
No comments yet
Contribute on Hacker News ↗