Comment by lacoolj

11 hours ago

OpenAI, ChatGPT, Codex

So many of the things that pioneered the way for the truly good (Claude, Gemini) to evolve. I am thankful for what they have done.

But the quality is gone, and they are now in catch-up mode. This is clear, not just from the quality of GPT-5.x outputs, but from this article.

They launch something new, flashy, should get the attention of all of us. And yet, they only launch to Apple devices?

Then, there are typos in the article. Again. I can't believe they would be sloppy about this with so much on the line. EDIT: since I know someone will ask, couple of examples - "7MM Tokens", "...this prompt initial prompt..."

And why are they not giving the full prompt used for these examples? "...that we've summarized for clarity" but we want to see the actual prompt. How unclear do we need to make our prompts to get to the level that you're showing us? Slight red flag there.

Anyway, good luck to them, and I hope it improves! Happy to try it out when it does, or at the very least, when it exists for a platform I own.

7 comments

lacoolj

halflings 9 hours ago

The main thing I noticed in the video is that they have heavily sped up all the code generation sections... seems to be on 5x speed or more. (because people got used to how fast and good Sonnet, and especially Gemini 3.0 Flash, are)

rirze 10 hours ago

Not sure when you last evaluated the tools, but I strongly prefer Codex to Claude Code and Gemini.

Codex gets complex tasks right and I don't keep hitting usage limits constantly. (this is comparing the 20$ ChatGPT to the 200$ Claude Pro Max plans fwiw)

The tooling around ChatGPT and Codex is less, but their models are far more dependable imo than Antropic's at this very moment.

girvo 9 hours ago

I don’t hit Codex limits because it’s so much slower, is what I’ve found personally.
touristtam 9 hours ago

I am not sure how those TUI are going to fare against multi providers ones like opencode.

nl 7 hours ago

> truly good (Claude, Gemini) to evolve

Claude yes, but Codex is much better than Gemini in every way that matters except speed in my experience.

Gemini 3 Flash is an amazing model, but Gemini 3 Pro isn't great. It can do good work, but it's pretty random if it will or it will go off the rails and do completely the wrong thing. OTOH GPT 5.2 Codex with high thinking is the best model currently available (slightly better than Opus 4.5)

deepfriedbits 10 hours ago

I can't speak to the typos, but launching first for MacOS not something new for OpenAI. They did the same with their dedicated desktop client.

adeelk93 3 hours ago

What’s the “7MM Tokens” typo?