Comment by constantinum
3 days ago
Reading from the comments, some of the common questions regarding document extraction are:
* Run locally or on premise for security/privacy reasons
* Support multiple LLMs and vector DBs - plug and play
* Support customisable schemas
* Method to check/confirm accuracy with source
* Cron jobs for automation
There is Unstract that solves the above requirements.
No comments yet
Contribute on Hacker News ↗