← Back to context

Comment by vardump

2 days ago

> Tesla's forward facing cameras are effectively monocular

Notably, human perception is effectively monocular in driving situations at distances of 60 feet or farther. It's best in the area where your limbs can reach.

We don't need stereoscopic vision to drive.

"precise" stereo vision is 30m, but the limit of depth perception is around 200m (some people are 500m)

crucially we have excellent implied depth, and object detection, something that even non-realtime state of the art tracking doesn't have.

human depth is much more complex than just parallax, which some poeple use as an argument that "pure vision" monocular depth is possible to do robustly. It will be, but not for a while. Especially as depth is only part of the problem. object categorisation is the other.