Comment by ks2048
9 days ago
I've been doing some experiments with the OCR API on macOS lately and wonder how it compares to these LLMs.
Overall, it's very impressive, but makes some mistakes (on easy images - i.e. obviously wrong) that require human intervention.
I would like to compare it to these models, but this benchmark is beyond OCR - extracted structured JSON.
No comments yet
Contribute on Hacker News ↗