Comment by hereme888

19 hours ago

Early ArtificialAnalysis.ai results show GPT 5.5 is still the better bang-for-your-buck.

OpenAI solves tasks with about 50% less output tokens.

https://artificialanalysis.ai/?intelligence=coding-index&int...

I give Codex a try with every new version, and we don't match, so this isn't true for everyone.

Claude would need to be much more expensive for me to switch.

  • People be saying these things with certainity. 99% of the time one has just inspired more confidence through sycophancy, or just good varience in outputs for a session/prompt.

    Slop heads be swearing by one slot machine one week and swearing it off the next like an addicted gambler describing their favorite slot machines from week to week.

    This isn't a coincidence, these companies hire UX designers from mobile gaming and online gambling to help engineer their addictiveness.

    Its all in your head, and the output is no matter what always going to be worse than learning how to do something yourself and putting care into it.

    Handmade watches > mass manufactured watches. There's nothing special about the skills needed for the guy who runs a conveyer belt at a watch manufacturer in China. The watch made by the guy who makes one watch a month in Switzerland is prized and beloved.

    • So you're trying to convince a community of mostly engineers — using the example of a terribly outdated technology that stays overpriced purely through symbolization because the luxury industry's bubble keeps holding — that flashy looks, advertising, and fancy concepts aren't really beloved and worthy? Fascinating.

    • > the guy who makes one watch a month

      That's the thing, though. Most people alive today will never be able to possess such an object, no matter how prized and beloved it is. Still, if people want to be able to tell the time from their wrist in a reliable fashion, there are _plenty_ of far cheaper options available to them. The craftsmanship does have inherent value, yes. That does not mean the practical solution is worthless.

      There can be practices incorporated in the production of software, involving AI use in a responsible fashion (difficult, of course), that produces practical solutions to real world problems far faster than a group of industry-hardened veterans painstakingly polishing their codebase in pursuit of craftsmanship. Those who appreciate how it is made will pay for the crafstmanship. Those who cannot afford to do so, and only care about a solution working well enough for the tasks they want to accomplish, the production line is good enough.

Codex with 5.4/5.4 is great Idk havent seen anything more crazy with claude + more expensive

  • GPT 5.5 and 5.4 are such great models. I just tried opus 4.8 and took 30 minutes to be confronted with a bit laziness that makes me go crazy. 5.5 just doesn’t have this issue.

    • How do you compare them to 5.3 Codex? I am using 5.3 Codex for a while, I subjectively think it does better job than Opus 4.6/4.7, with a fraction of the cost, and I did give 5.5 a try and it seems a bit better but magnitudes more expensive.

This is true but OpenAI has been slowly boiling the frog here, too. $20 and $100 on their plan doesn't get you nearly as far as it did two, three months ago.

I use their (newish) 5x $100 plan and I routinely run out of weekly limits about a two days before the end of the week.

This has also goaded me into upgrading to $200 once before... and then had them hand out limits resets to everyone. Argh.