← Back to context

Comment by jakderrida

8 months ago

Holy Crap! You were right about PaddleOCR. My personal benchmark for OCR tools is to submit several random pages from the first edition Moody's Manual for Railroads.

https://imgur.com/r2RsJeH

The reason I use it is to test whether it's just analyzing letter-by-letter (even if they claim it does more) or if it's actually scanning the letter/word in its context. If it's letter-by-letter, I get hilariously awful results.

Sure, it got things wrong. But it also figured out some things even I couldn't decipher.