← Back to context

Comment by mvkel

7 hours ago

If you read the charter of the eval (or any eval, really), this statement is pretty silly.

The whole point of each eval version is to identify a chunk of challenges that humans do well that AI can't. When AI gets to ~80, you move to the next chunk. When you run out of challenges, you have AGI.

HN occasionally devolves into “supremely pedantic and nitpicky” mode. Today is one of those days.