Comment by yodon

7 days ago

The real question is "could she have written that problem statement a year ago, or is a year's learning by the team baked into that prompt?"

There are lots of distributed agent orchestators out there. Rarely is any of them going to do exactly what you want. The hard part is commonly defining the problem to be solved.

There’s definitely a strong Clyde-the-counting-horse effect. language models perform vastly better on questions to which the question writer knew the answer.

Even the Google engineer points out that this isn't an engineering problem, but an alignment problem among the Google devs themselves.