← Back to context

Comment by codingwagie

15 days ago

Really? I have been using 4o, and its flawless at OCR

give it a shot with a few of the examples in the blog! or better yet, find some financial statements from Goldman/morgan Stanley and run it through the model.

Check the output again, there will be small mistakes if your text is large enough.

I used it once, was given a screenshot that contained a SHA1 hash and needed it in text. Maybe this is a case where ChatGPT can do a small task quickly for me and save me squinting?

It still fails on this today (the "bdbdffdf" part). Not allowed to share a chat with a picture it seems, my prompt was to upload the file below and "Image to text please.". Just the free 4o model, maybe the paid stuff is better.

https://postimg.cc/m1jNPL0j

  • Amusingly it tried to write a Python script to OCR it first, decided there were errors and tried to correct it.... it did correct some stuff and nearly got it, but I was able to spot an error 3/4 through with my eyeballs after a couple minutes.

    https://i.imgur.com/UuO3JxM.png