Comment by brkn

19 hours ago

I would be interested to see how exactly the agent helped. How was it used, where did it lead to the given improvement and in how far would it have taken a human to come to the same solution.

3 comments

brkn

j2kun 19 hours ago

The blog post has many links to papers and preprints discussing this exact question.

Lt_Riza_Hawkeye 17 hours ago

The CANOS arxiv link says absolutely nothing about AlphaEvolve, Gemini, or LLMs. It seems to use purely traditional ML models. If AE did in fact write a quick script to test different configurations in order to optimize the results, they don't seem to have bothered to write about it.
I can't read the Nature paper about DeepConsensus, but from the summary, it doesn't really explain what role AE had in improving DC. It would be nice to be able to read about what role it actually played, and whether it used traditional or novel methods of performing it

armanj 16 hours ago

seems like `karpathy/autoresearch` on steroids