Comment by wahnfrieden

3 months ago

Do any LLM OCRs give bounding boxes anyway? Per character and per block.

Try MinerU 2.5 with two-step parsing. It gives good results with bounding boxes per block. Not sure if you can get it to do more detailed such as word or character level.