Comment by 6510
4 days ago
I've often wanted a bulk tool that takes the title or some other easy to find value from a pdf and renames the file to that.
4 days ago
I've often wanted a bulk tool that takes the title or some other easy to find value from a pdf and renames the file to that.
Appreciate you sharing that requirement!
The need for batch processing to pull out targeted data points from PDFs (rather than converting the whole document) is a valuable insight.
While the current tool focuses on full conversion to Markdown, enhancing https://pdftomarkdown.pro/ to handle specific data extraction tasks like yours is definitely something I'll consider carefully for the future roadmap. Thanks for highlighting it!
Unfortunately, PDFs are right buggers to work with and there often isn't an "easy to find value" for anything
You're absolutely right, PDFs can be incredibly tricky. That lack of a consistent, easily parsable structure for arbitrary data is the core challenge.
I mean easy in the PDF sense. I have folders full of randomString.pdf and name(15).pdf but those that share a folder all have the same layout.