Comment by ck_one

17 days ago

What object detection model do you use?

Is tesseract even ML based? Oh, this piece of software is more than 19 years old, perhaps there are other ways to do good, cheap OCR now. Does Gemini have an OCR library, internally? For other LLMs, I had the feeling that the LLM scripts a few lines of python to do the actual heavy lifting with a common OCR framework.

Custom trained yolo v8. I've moved on since then and the work was done in 2023. You'd get better results for much less today.