Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library
← Back to context

Comment by _puk

6 days ago

> So maybe the AI labs have been paying attention after all!

> I think this mainly demonstrates that the pelican on the bicycle has firmly exceeded its limits as a useful benchmark.

As acknowledged in the article.

2 comments

_puk

Reply

kzrdude  6 days ago

Gemini 3.1 basically takes it home on that benchmark, anyway, it's done.

  • sunaookami  5 days ago

    Gemini is heavily benchmaxxed and sucks in agentic coding so no surprise.

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities