Comment by hughes

2 months ago

> I’m confident it didn’t cheat and look at the EXIF data on the photograph, because if it had cheated it wouldn’t have guessed Cambria first.

If I was cheating on a similar task, I might make it more plausible by suggesting a slightly incorrect location as my primary guess.

Would be interesting to see if it performs as well on the same image with all EXIF data removed. It would be most interesting if it fails, since that might imply an advanced kind of deception...

18 comments

hughes

AIPedant 2 months ago

There have been a few cases where the LLM clearly did look at the EXIF, got the answer, then confabulated a bunch of GeoGusser logic to justify the answer. Sometimes that's presented as deception/misalignment but that's a category error: "find the answer" and "explain your reasoning" are two distinct tasks, and LLMs are not actually smart enough to coherently link them. They do one autocomplete for generating text that finds the answer and a separate autocomplete for generating text that looks like an explanation.

sorcerer-mar 2 months ago
> Sometimes that's presented as deception/misalignment but that's a category error: "find the answer" and "explain your reasoning" are two distinct tasks
Right but if your answer to "explain your reasoning" is not a true representation of your reasoning, then you are being deceptive. If it doesn't "know" its reasoning, then the honest answer is that it doesn't know.
(To head off any meta-commentary on humans' inability to explain their own reasoning, they would at least be able to honestly describe whether they used EXIF or actual semantic knowledge of a photography)
- AIPedant 2 months ago
  
  My point is that dishonesty/misalignment doesn't make sense for o3, which is not capable of being honest because it's not capable of understanding what words mean. It's like saying a monkey at a typewriter is being dishonest if it happens to write a falsehood.
  
  2 replies →
XenophileJKO 2 months ago

I think an alternative possible explanation is it could be "double checking" the meta data. Like provide images with manipulated meta data as a test.
simonw 2 months ago
Do you have links to any of those examples?
- AIPedant 2 months ago
  
  I have one link that illustrates what I mean: https://chatgpt.com/share/6802e229-c6a0-800f-898a-44171a0c7d... The line about "the latitudinal light angle that matches mid‑February at ~47 ° N." seems like pure BS to me, and in the reasoning trace it openly reads the EXIF.
  A more clear example I don't have a link for, it was on Twitter somewhere: someone tested a photo from Suriname and o3 said one of the clues was left-handed traffic. But there was no traffic in the photo. "Left-handed traffic" is a very valuable GeoGuesser clue, and it seemed to me that once o3 read the Surinamese EXIF, it confabulated the traffic detail.
  It's pure stochastic parroting: given you are playing GeoGuesser honestly, and given the answer is Suriname, the conditional probability that you mention left-handed traffic is very high. So o3 autocompleted that for itself while "explaining" its "reasoning."
  
  2 replies →

GrumpyNl 2 months ago

If you ask, where is this photo taken and you provide the EXIF data, why would that be cheating?

simonw 2 months ago

That really depends on your prompt. "Guess where this photo was taken" at least mildly implies that using EXIF isn't in the spirit of the thing.
A better prompt would be "Guess where this photo was taken, do not look at the EXIF data, use visual clues only".

haswell 2 months ago

He mentions this in the same paragraph:

> If you’re still suspicious, try stripping EXIF by taking a screenshot and run an experiment yourself—I’ve tried this and it still works the same way.

suddenlybananas 2 months ago
Why didn't he do that then for this post?
- segmondy 2 months ago
  
  Even better, edit it and place a false location.
  
  2 replies →
- simonw 2 months ago
  
  Because I'd already determined it wasn't using EXIF in prior experiments and didn't bother with the one that I wrote up.
  I added two examples at the end just now where I stripped EXIF via screenshotting first.