Comment by simonw

4 months ago

The models really are getting better though. Compare Gemini 1.5 and Gemini 2.5 on the same PDF document (I've done this a bunch) and you can see the difference.

The open question is how much better they need to get before they can be deployed for situations like this that require a VERY high level of reliability.

2 comments

simonw

lysecret 4 months ago

I fully agree. My point was more a lot of commenters seem or implicitly compare the llm based approach with some “better” or “simpler” approach which really doesn’t exist from my estimation LLMs are sota for this kind of extractions (though they still have issues).

hoosieree 4 months ago

People don't respect the chasm between "obviously no mistakes" and "no obvious mistakes".