Comment by jjuliano
8 days ago
If you are interested, I also made an AI assisted OCR API - https://github.com/kdeps/examples
It combines Tesseract (for images) and Poppler-utils (PDF). A local open-source LLMs will extract document segments intelligently.
It can also be extended to use one or multiple Vision LLM models easily.
And finally, it outputs the entire AI agent API into a Dockerized container.
No comments yet
Contribute on Hacker News ↗