Comment by ant6n

1 year ago

I recently did some OCRing with OpenAI. I found o3-mini-hi to be imagining and changing text, whereas the older (?) o4 was more accurate. It’s a bit worrying that some of the models screw around with the text.

8 comments

ant6n

jazzyjackson 1 year ago

There’s GPT4, then GPT4o (o for Omni, as in multi modal) and then GPT o1 (chain of thought / internal reasoning) then o3 (because o2 is a stadium in London that I guess is very litigious about its trademark?), o3-mini is the latest but yes optimized to be faster and cheaper

polshaw 1 year ago
o2 is the UK's largest mobile network operator. They bought naming rights to what was known as the millennium dome (not even a stadium).
- jazzyjackson 1 year ago
  
  Ahh makes sense :)
dotancohen 1 year ago
What is the o3 model good for? Is it just an evolution of o1 (chain of thought / internal reasoning)?
- KTibow 1 year ago
  
  Yes
  (albeit I believe o3-mini isn't natively multimodal)
  
  1 reply →
ant6n 1 year ago
Which one is the smartest, and most knowledgeable? (Like least likely to make up facts)
- wrsh07 1 year ago
  
  4o is going to be better for a straight up factual question
  (But eg I asked it about something Martin Short / John Mulaney said on SNL and it needed 2 prompts to get the correct answer..... the first answer wasn't making anything up it was just reasonably misinterpreting something)
  It also has web search which will be more accurate if the pages it reads are good (it uses bing search, so if possible provide your own links and forcibly enable web search)
  Similarly the latest Anthropic Claude Sonnet model (it's the new Sonnet 3.5 as of ~Oct) is very good.
  The idea behind o3 mini is that it only knows as much as 4o mini (the names suck, we know) but it will be able to consider its initial response and edit it if it doesn't meet the original prompt's criteria