Comment by kachau

5 months ago

can you please share some details how are you using docling? This looks very promising but I am not sure how to use this one basically we have built document parser for all type of documents to extract texts and then feed these texts to llms to further find out semantics of these texts? do you think docling will help here with efficiency and latency?

1 comment

kachau

rapjul 5 months ago

Docling works quite well for me to convert a scanned book PDF to Markdown text.

On the command line, first install `uv` from https://github.com/astral-sh/uv?tab=readme-ov-file#installat..., then run `uv tool install -U "docling[tesserocr,ocrmac,vlm]"` (first includes the tesserocr, ocrmac (macOS only), and vlm (for running a small Image-to-Text model to get descriptions of images).

You go here https://github.com/DS4SD/docling/blob/main/pyproject.toml#L1... to see all the extra installation options.

For cached/offline use, run `docling-tools models download` to download their models.