Comment by xg15
1 year ago
This is already amazing, but one possible idea of improvement: Use the metadata (time and coordinates) to look up possible landmarks in the area or possible events/gatherings/conferences/etc that took place near the location and during that time, then add those to the prompt.
I posted some images that showed a well-known local landmark during a christmas fair event, as well as view of a close city.
The model accurately described the architectural details of the landmark that could be inferred from the photo, mentioned that there seems to be some event going on and made some speculations about the city in the background - but purely from the photo it had of course no way of knowing which landmark, event and city it was looking at.
I see this is slightly underestimating the amount of information you can extract from the photo: If you have a GIS database, it's not hard to know this stuff (or at least get a list of likely candidates) - and the kind of actors that this project is warning against very likely have one.
Also I'd be interested to see if the model could combine the context and the details from the photo to make some interesting additional observations.
No comments yet
Contribute on Hacker News ↗