← Back to context

Comment by zozbot234

2 years ago

OP says that Gemini had still images as input, not video - and the dev blog post shows it was instructed to reply to each input in relevant terms. Needless to say, that's quite different from what's implied in the demo, and at least theoretically is already within GPT's abilities.

3 comments

zozbot234

Reply

valine 2 years ago

How do you think the cup demo works? Lots of still images?

watusername 2 years ago
A few hand-picked images (search for "cup shuffling"): https://developers.googleblog.com/2023/12/how-its-made-gemin...
- valine 2 years ago
  
  Holy crap that demo is misleading. Thanks for the link.