← Back to context

Comment by 0x3f

24 days ago

> quickly demonstrating the ability to answer questions no human has been able to answer.

Such as?

5.5 Pro has been leveraged to solve at least two previously-unsolved Erdos problems [1]. Whether these were unsolved due to being seriously untried by humanity, or because of their difficulty, isn't relevant; no human proved able to answer them, while our synthetic intelligence systems did.

In my personal life, I have leveraged these systems to design code that I don't believe I would ever have been able to designed. And, because no other human may attempt to, this means the same thing: That no human would have been able to do it. Things like reverse-engineering niche APIs and digging into binary files to diagnose weird format conversion issues.

[1] https://x.com/DavidTurturean/status/2054942008817451195

  • > Whether these were unsolved due to being seriously untried by humanity, or because of their difficulty, isn't relevant

    That seems very relevant to my evaluation. I can pull out my calculator right now and solve a problem no human ever has.

    • True! But calculators are already priced in. LLMs now unlock a whole new class of problems that we can automate the solutions to; what we're seeing is the markets trying to figure out how to price that.