← Back to context

Comment by pineapple_opus

6 days ago

All I see is mention of how various models generate image of "pelican riding bicycle(s)"

6 comments

pineapple_opus

Reply

emil-lp 6 days ago

Yes, the "pelican riding a bicycle" is the ultimate test of not understanding how LLMs work.

Well, a combination of that and believing that replication of test data is a good measure of progress.

vessenes 6 days ago
Spicy — why does it show ultimate non-understanding?
- JohnKemeny 5 days ago
  
  because success comes from reproducing a memorized pattern rather than transferable reasoning?
  At the same time failure proves little because most humans also could not manually create a correct SVG of a pelican riding a bicycle.
  What is it exactly that such a test is testing?
  In which situation would you measure the "competence" of a human being by asking them to write an SVG of a pelican riding a bicycle?
  
  1 reply →

ClikeX 6 days ago

We all know the true test of AI is Will Smith eating spaghetti.

ActionHank 5 days ago

Wait, are you saying you don't handcraft svgs of pelicans riding bicycles?