← Back to context

Comment by gcanyon

1 day ago

> So that is essentially the ceiling in terms of accuracy.

I think this is mistaken. I remember... ten years ago? When speech-to-text models came out that dealt with background noise that made the audio sound very much like straight pink noise to my ear, but the model was able to transcribe the speech hidden within at a reasonable accuracy rate.

So with handwritten text, the only prediction that makes sense to me is that we will (potentially) reach a state where the machine is at least probably more accurate than humans, although we wouldn't be able to confirm it ourselves.

But if multiple independent models, say, Gemini 5 and Claude 7, both agree on the result, and a human can only shrug and say, "might be," then we're at a point where the machines are probably superior at the task.

That depends on how good we get at interpretability. If the models can not only do the job but also are structured to permit an explanation of how they did it, we get the confirmation. Or not, if it turns out that the explanation is faulty.