Comment by aaln
1 year ago
Extract all the text from the PDF, turn the pdf into images, send the text for each page along with the image to an LLM with a desired output strucutre.
1 year ago
Extract all the text from the PDF, turn the pdf into images, send the text for each page along with the image to an LLM with a desired output strucutre.
You are not doing any of the fancy table extractor stuff?