Comment by 7373737373

2 days ago

Interesting new contender for simplest to state unsolved problem: The Antihydra

Does this program halt?

  a = 8
  b = 0
  while b != -1:
      if a % 2 == 0:
          b += 2
      else:
          b -= 1
      a += a//2

(// being integer division, equivalently a binary shift one to the right: >> 1)

https://www.sligocki.com/2024/07/06/bb-6-2-is-hard.html

https://bbchallenge.org/antihydra

Fwiw, ChatGPT is able to say that it doesn't. I wonder what other classes of programs it's able to state if it halts?

  • The math community surely expects a proof of that, and ChatGPT surely doesn't (yet) have one. (Maybe some day it will, as Kevin Buzzard and others are experimenting with asking language models to produce formal proofs.)

    You could get LLMs to opine on many unresolved math conjectures, but I doubt much credence should be given to their responses, when not accompanied by a proof.

  • Most LLMs I've tried come up with invalid reasoning, many confuse empirical evidence (of simulating it for a few steps and it 'most probably not halting') with definite proof that it never does, some create invalid probabilistic mathematical arguments to the same effect

    Others I've tried are caught in a loop of trying to prove the same, insufficient approach over and over again, lacking explorative and "creative" behavior

    Generally it seems that LLMs lack the 'motivation' to actually try to solve unsolved problems especially if they know that they are unsolved or difficult