Comment by cyberclimb
1 day ago
Note that these results are specific to gpt-4o so it's unclear how much they generalize.
They note at the end they're also testing "GPT o3, and Claude" but no empircal results are included.
1 day ago
Note that these results are specific to gpt-4o so it's unclear how much they generalize.
They note at the end they're also testing "GPT o3, and Claude" but no empircal results are included.
No comments yet
Contribute on Hacker News ↗