← Back to context

Comment by tonfa

4 hours ago

Afaik this is because there is a community and database around them.

Interesting. OpenAI could also be trying to solve other problems, but Erdos problems maybe falling first?

  • No, Erdos problems were accepted as sort of a benchmark. There's a bunch of reasons they're favorable for this task:

    1. They have a wide range of difficulties. 2. They were curated (Erdos didn't know at first glance how to solve them). 3. Humans already took the time to organize, formally state, add metadata to them. 4. There's a lot of them.

    If you go around looking for a mathematics benchmark it's hard to do better than that.