Comment by as125j

3 hours ago

You can try to dispel the study here and get voted to the top by the AI-invested.

But we all know from our own daily experiments that models lie, models disagree, models make up stuff, models say one thing on one day and the opposite on the next.

The figures in this study are quite conservative. And the lying gets worse because everyone is saving tokens and giving cached answers right now.

LLMs are a failure, and you'll be remembered for promoting hot air and the destruction of a perfectly good profession.