Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library
← Back to context

Comment by vr46

7 months ago

I’ll have to test this against my local Python pipeline which does all this without an LLM in attendance. There are a ton of existing Python libraries which have been doing this for a long time, so let’s take a look..

2 comments

vr46

Reply

thegabriele  7 months ago

Care to share the best ones for some use cases? Thanks

  • vr46  7 months ago

    MinerU

    PDFQuery

    PyMuPDF (having more success with older versions, right now)

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities