Comment by carlosjobim
4 days ago
That doesn't make much sense, since a typewriter will neither type Calibri nor Times New Roman. And OCR should only be needed for type written documents, because any document made with Calibri or TNR is already digital.
printed documents, images, horribly inaccessible pdfs, horribly inaccessible websites
> Printed documents - Use the original, which is digital.
> Images - Use the original, which is digital.
> horribly inaccessible pdfs - Use the original, which has real text in the PDF
> horribly inaccessible websites - All text on any web site is digital. Nobody uses OCR on a website.
A massive paper producer like the government shouldn't adopt their type setting to people who are using technology wrongly.
an example from today (pdf warning): https://www.ntsb.gov/news/Documents/National%20Defense%20Aut...
1 reply →
it's easier to mandate font than to excise all processes within the fed bureaucracy that result in these.
images being digital have no bearing on OCR ability
1 reply →
We have a process at work where clients export information from their database as a pdf which they email to us so that we can ocr it and insert into our database.
No one else seems to think this is bat shit insane