Comment by aesthesia
5 hours ago
Calling the AISLE experiment a "benchmark" is generous. They tested three code snippets on each model.
5 hours ago
Calling the AISLE experiment a "benchmark" is generous. They tested three code snippets on each model.
No comments yet
Contribute on Hacker News ↗