← Back to context

Comment by crazygringo

2 hours ago

> when we talk relative in this scenario we usually (always?) mean from the perspective of the target or "owner".

I dunno... I feel pretty confident 99% percent of people would do the same thing, and put the strawberry in the eye socket to our left, the viewer's.

You really have to be trained explicitly to put yourself in the subject's shoes, and very few people are. To me, the model is correctly following the instructions most people will mean.

And it's not even incorrect. "The left x" is linguistically ambiguous. If you say "the left flower", it's obviously the flower to our left. So when you say "the left eye socket", the eye socket to our left is a valid interpretation. If they had said their or its left eye socket, then it's more arguable that it must be from the subject's side. But that's not the case in this example.

There's a puzzle in the latest Indiana Jones game that exploits the fact that yes, most people would do the same thing.