← Back to context

Comment by ealready_value

5 days ago

This is the reply I look for in all the new model announcements. Its fun to tell people that I judge models based on pelicans.

I also look for this reply because i like seeing the follow-up reply saying that this is not a benchmark anymore because labs have gotten it in their training data.

that reply never failed to come it's basically a meme at this point