← Back to context

Comment by NateEag

15 hours ago

> It is pretty clear that initial accuracy issues will become less and less of a problem as these technologies mature.

What do you base this on?

As someone who can both see the amazing things genAI can do, and who sees how utterly flawed most genAI output is, it's not obvious to me.

I'm working with Claude every day, Opus 4.7, and reviewing a steady stream of PRs from coworkers who are all-in, not just using due to corporate mandates like me, and I find an unending stream of stupidity and incomprehension from these bots that just astonishes me.

Claude recently output this to me:

"I've made those changes in three files:

- File 1

- File 2"

That is a vintage hallucination that could've come right out of GPT 2.0.

> That is a vintage hallucination that could've come right out of GPT 2.0.

That's because, despite the many claims to the contrary, the models haven't actually gotten any smarter. They are still just token prediction engines at the end of the day, without any understanding of what they are doing. That's why one should not rely on them.