Comment by redox99

1 month ago

People thinking this does not matter just because the code is awful, it used dependencies, or whatever, are missing the point.

6 months ago with previous models this was absolutely impossible. One of the biggest limitations of LLMs is their difficulty with long tasks. This has been steadily improving and this experiment was just another milestone. It will be interesting a year from now to test how much better new models fare at this task.