Comment by maleldil
7 days ago
The API is pretty nice and easy to get started, but I couldn't get good results with parsing scientific paper PDFs, unfortunately (including OCR). Are there plans to use other backends? Docling works alright, and LLMs like Gemini Flash are interesting too.
Yes, there have already been several suggestions here for other backend etc.
You should try using a different PSM to see if you get better results.
If it's scientific texts specifically, look at grobid