Comment by grey-area

5 days ago

Sorry I meant the code a year ago - it took a bit more hand-holding at that point and it was a mishmash of different things, but I feel it’s just slightly easier now - still similar. Haven’t looked into this one just had a quick play. Thanks for trying it out!

I think his article is for the last 6 months - my feeling is progress with LLMs has stalled recently and generated code still has problems with accuracy and coherence and subtle bugs, but everyone has a different experience.

I agree with that. Right now, you choose:

Subtle bugs in understanding the spec but strong arch and coding (codex)

Or

Subtle bugs in implementation but good understanding of the spec (claude).