Comment by prodigycorp
5 hours ago
That could be it. I still see LLMs fail a set of static typing challenges that I created a couple years ago as a benchmark. Google models still fail it. I wonder if the lack of typing in a lot of the training data makes python harder to reason about?
No comments yet
Contribute on Hacker News ↗