Comment by CGMthrowaway

6 days ago

This is the critical bit (paraphrasing):

Humans have worked out the amplitudes for integer n up to n = 6 by hand, obtaining very complicated expressions, which correspond to a “Feynman diagram expansion” whose complexity grows superexponentially in n. But no one has been able to greatly reduce the complexity of these expressions, providing much simpler forms. And from these base cases, no one was then able to spot a pattern and posit a formula valid for all n. GPT did that.

Basically, they used GPT to refactor a formula and then generalize it for all n. Then verified it themselves.

I think this was all already figured out in 1986 though: https://journals.aps.org/prl/abstract/10.1103/PhysRevLett.56... see also https://en.wikipedia.org/wiki/MHV_amplitudes

46 comments

CGMthrowaway

godelski 6 days ago

  > I think this was all already figured out in 1986 though

They cite that paper in the third paragraph...

  Naively, the n-gluon scattering amplitude involves order n! terms. Famously, for the special case of MHV (maximally helicity violating) tree amplitudes, Parke and Taylor [11] gave a simple and beautiful, closed-form, single-term expression for all n.

It also seems to be a main talking point.

I think this is a prime example of where it is easy to think something is solved when looking at things from a high level but making an erroneous conclusion due to lack of domain expertise. Classic "Reviewer 2" move. Though I'm not a domain expert and so if there was no novelty over Parke and Taylor I'm pretty sure this will get thrashed in review.

CGMthrowaway 6 days ago
You're right. Parke & Taylor showed the simplest nonzero amplitudes have two minus helicities while one-minus amplitudes vanish (generically). This paper claims that vanishing theorem has a loophole - a new hidden sector exists and one-minus amplitudes are secretly there, but distributional
- rightbyte 6 days ago
  
  > simplest nonzero amplitudes have two minus helicities while one-minus amplitudes vanish
  Sorry but I just have to point out how this field of maths read like Star Trek technobabble too me.
  
  4 replies →
nyc_data_geek1 6 days ago
[flagged]
- kbelder 6 days ago
  
  Be careful, in the strength of your passions, that you don't become a stochastic word generator yourself.
- godelski 6 days ago
  
  > Am I getting that right?
  My comment was in response to the claim I responded to. Any inference you have made about my feelings about OpenAI are that of your own. You can search my comment history if you want to verify or reject your suspicion. I don't think you'll be able to verify it...
- singpolyma3 6 days ago
  
  No
- throwuxiytayq 6 days ago
  
  [flagged]
  
  5 replies →

btown 6 days ago

It bears repeating that modern LLMs are incredibly capable, and relentless, at solving problems that have a verification test suite. It seems like this problem did (at least for some finite subset of n)!

This result, by itself, does not generalize to open-ended problems, though, whether in business or in research in general. Discovering the specification to build is often the majority of the battle. LLMs aren't bad at this, per se, but they're nowhere near as reliably groundbreaking as they are on verifiable problems.

utopiah 6 days ago
> modern LLMs are incredibly capable, and relentless, at solving problems that have a verification test suite.
Feel like it's a bit what I tried to expressed few weeks ago https://news.ycombinator.com/item?id=46791642 namely that we are just pouring computational resources at verifiable problems then claim that astonishingly sometimes it works. Sure LLMs even have a slight bias, namely they do rely on statistics so it's not purely brute force but still the approach is pretty much the same : throw stuff at the wall, see what sticks, once something finally does report it as grandiose and claim to be "intelligent".
- virgildotcodes 6 days ago
  
  > throw stuff at the wall, see what sticks, once something finally does report it as grandiose and claim to be "intelligent".
  What do we think humans are doing? I think it’s not unfair to say our minds are constantly trying to assemble the pieces available to them in various ways. Whether we’re actively thinking about a problem or in the background as we go about our day.
  Every once in a while the pieces fit together in an interesting way and it feels like inspiration.
  The techniques we’ve learned likely influence the strategies we attempt, but beyond all this what else could there be but brute force when it comes to “novel” insights?
  If it’s just a matter of following a predefined formula, it’s not intelligence.
  If it’s a matter of assembling these formulas and strategies in an interesting way, again what else do we have but brute force?
  
  8 replies →
- piombisallow 6 days ago
  
  That's also what most grad students are doing. Even in the unlikely case they completely stop improving, it's still a massive deal.
  
  1 reply →
QuercusMax 6 days ago

Yes, this is where I just cannot imagine completely AI-driven software development of anything novel and complicated without extensive human input. I'm currently working in a space where none of our data models are particularly complex, but the trick is all in defining the rules for how things should work.
Our actual software implementation is usually pretty simple; often writing up the design spec takes significantly longer than building the software, because the software isn't the hard part - the requirements are. I suspect the same folks who are terrible at describing their problems are going to need help from expert folks who are somewhere between SWE, product manager, and interaction designer.
D-Machine 6 days ago
Even more generally than verification, just being tied to a loss function that represent something we actually care about. E.g. compiler and test errors, LEAN verification in Aristotle, basic physics energy configs in AlphaFold, or win conditions in e.g. RL, such as in AlphaGo.
RLHF is an attempt to push LLMs pre-trained with a dopey reconstruction loss toward something we actually care about: imagine if we could find a pre-training criterion that actually cared about truth and/or plausibility in the first place!
- alex43578 5 days ago
  
  There's been active work in this space, including TruthRL: https://arxiv.org/html/2509.25760v1. It's absolutely not a solved problem, but reducing hallucinations is a key focus of all the labs.

lupsasca 6 days ago

That paper from the 80s (which is cited in the new one) is about "MHV amplitudes" with two negative-helicity gluons, so "double-minus amplitudes". The main significance of this new paper is to point out that "single-minus amplitudes" which had previously been thought to vanish are actually nontrivial. Moreover, GPT-5.2 Pro computed a simple formula for the single-minus amplitudes that is the analogue of the Parke-Taylor formula for the double-minus "MHV" amplitudes.

woeirua 6 days ago

You should probably email the authors if you think that's true. I highly doubt they didn't do a literature search first though...

emp17344 6 days ago

You should be more skeptical of marketing releases like this. This is an advertisement.
coldtea 6 days ago

It's hard to get someone to do literature first when they get free publicity by not doing literature search and claiming some major AI assisted breakthrough...
Heck, it's hard to get authors to do literature search, period: never mind not thoroughly looking for prior art, even well known disgraced papers get citated continue to get possitive citations all the time...
godelski 6 days ago

They also reference Parke and Taylor. Several times...
suuuuuuuu 6 days ago
Don't underestimate the willingness of physicists to skimp on literature review.
- baq 6 days ago
  
  After last month’s Erdos problems handling by LLMs at this point everyone writing papers should be aware that literature checks are approximately free, even physicists.

helterskelter 6 days ago

> But no one has been able to greatly reduce the complexity of these expressions, providing much simpler forms.

Slightly OT, but wasn't this supposed to be largely solved with amplituhedrons?

ericmay 6 days ago

Still pretty awesome though, if you ask me.

fsloth 6 days ago

I think even “non-intelligent” solver like Mathematica is cool - so hell yes, this is cool.
_aavaa_ 6 days ago

Big difference between “derives new result” and “reproduces something likely in its training dataset”.

nine_k 6 days ago

Sounds somehow similar to the groundbreaking application of a computer to prove the 4 color theorem. Then the researchers wrote a program to find and formally prove the numerous particular cases. Here the computer finds a simplifying pattern.

torginus 6 days ago

I'm not sure if GPTs ability goes beyond a formal math package's in this regard or its just its just way more convienient to ask ChatGPT rather than using these software.