← Back to context

Comment by makeworld

6 months ago

I wonder if you could add error correction to get around OCR failures.

You could simply add a par2 file but the default setting makes it pretty big. I just tried on an 876 kB Word file. And I got a bunch of par files totaling 1158 kB. The man page says it'll correct up to 100 errors.

  • You could replicate the Word file twice, using 1752 kB, and it'll correct up to ≈7 million one-bit errors as long as there is at most one error at three equivalent bit offsets.

    • And that is why for error correction, the number of error it is guaranteed to always correct is a pretty important metric too!

  • How well do par2 files handle insertion errors?

    100 errors in an 876kB file would be about an 0.0012% errror rate. You are going to need another level of ECC before that.