Comment by wongarsu

4 days ago

That's (scarily) pretty standard for most LLMs by now. Paste the same images into ChatGPT and you will get a very accurate guess

It's also pretty fun to do this with Gemma 4 with its very pretty and structured reasoning output (which SotA model providers hide). For example for one picture that it misidentified as being taken inside the "Long Room of the Old Library at Trinity College Dublin" I can see that it did consider the correct answer (Duke Humfrey's Library in Oxford) early on as one of three candidates, but was apparently mislead by the ceiling height and a window in the background