← Back to context

Comment by themafia

3 months ago

> To benchmark LLM handwriting accuracy, last year Dr. Lianne Leddy and I developed a set of 50 documents comprising some 10,000 words—we had to choose them carefully and experiment to ensure that these documents were not already in the LLM training data (full disclosure: we can’t know for sure, but we took every reasonable precaution).

You're drawing conclusions from _this_? Let alone pretending that "it did a something else unexpected that can only be described as genuine, human-like, expert level reasoning."

Give me a break. This entire industry is impossible to take seriously.