Comment by supjeff

5 hours ago

given how often these llms are wrong, doesnt it make sense that they are less confident?

Indeed. But I've had experiences with gemini-2.5-pro-exp where its thoughts could be described as "rejected from the prom" vibes. It's not like I abused it either, it was running into loops because it was unable to properly patch a file.