Comment by h0l0cube

1 year ago

FTA:

> This week, there was a viral blog about Gemini 2.0 being used for complex PDF parsing, leading many to the same hypothesis we had nearly a year ago at this point. Data ingestion is a multistep pipeline, and maintaining confidence from these nondeterministic outputs over millions of pages is a problem.

2 comments

h0l0cube

password4321 1 year ago

Yes and per the poster's opening comment:

https://news.ycombinator.com/item?id=42966958#42966959

h0l0cube 1 year ago

It seemed you were implying the article was naive to the earlier post, whereas the OP poses itself as a rebuttal. Perhaps a fault of my inference.