← Back to context Comment by pineapple_opus 6 days ago All I see is mention of how various models generate image of "pelican riding bicycle(s)" 6 comments pineapple_opus Reply emil-lp 6 days ago Yes, the "pelican riding a bicycle" is the ultimate test of not understanding how LLMs work.Well, a combination of that and believing that replication of test data is a good measure of progress. vessenes 6 days ago Spicy — why does it show ultimate non-understanding? JohnKemeny 5 days ago because success comes from reproducing a memorized pattern rather than transferable reasoning?At the same time failure proves little because most humans also could not manually create a correct SVG of a pelican riding a bicycle.What is it exactly that such a test is testing?In which situation would you measure the "competence" of a human being by asking them to write an SVG of a pelican riding a bicycle? 1 reply → ClikeX 6 days ago We all know the true test of AI is Will Smith eating spaghetti. ActionHank 5 days ago Wait, are you saying you don't handcraft svgs of pelicans riding bicycles?
emil-lp 6 days ago Yes, the "pelican riding a bicycle" is the ultimate test of not understanding how LLMs work.Well, a combination of that and believing that replication of test data is a good measure of progress. vessenes 6 days ago Spicy — why does it show ultimate non-understanding? JohnKemeny 5 days ago because success comes from reproducing a memorized pattern rather than transferable reasoning?At the same time failure proves little because most humans also could not manually create a correct SVG of a pelican riding a bicycle.What is it exactly that such a test is testing?In which situation would you measure the "competence" of a human being by asking them to write an SVG of a pelican riding a bicycle? 1 reply →
vessenes 6 days ago Spicy — why does it show ultimate non-understanding? JohnKemeny 5 days ago because success comes from reproducing a memorized pattern rather than transferable reasoning?At the same time failure proves little because most humans also could not manually create a correct SVG of a pelican riding a bicycle.What is it exactly that such a test is testing?In which situation would you measure the "competence" of a human being by asking them to write an SVG of a pelican riding a bicycle? 1 reply →
JohnKemeny 5 days ago because success comes from reproducing a memorized pattern rather than transferable reasoning?At the same time failure proves little because most humans also could not manually create a correct SVG of a pelican riding a bicycle.What is it exactly that such a test is testing?In which situation would you measure the "competence" of a human being by asking them to write an SVG of a pelican riding a bicycle? 1 reply →
Yes, the "pelican riding a bicycle" is the ultimate test of not understanding how LLMs work.
Well, a combination of that and believing that replication of test data is a good measure of progress.
Spicy — why does it show ultimate non-understanding?
because success comes from reproducing a memorized pattern rather than transferable reasoning?
At the same time failure proves little because most humans also could not manually create a correct SVG of a pelican riding a bicycle.
What is it exactly that such a test is testing?
In which situation would you measure the "competence" of a human being by asking them to write an SVG of a pelican riding a bicycle?
1 reply →
We all know the true test of AI is Will Smith eating spaghetti.
Wait, are you saying you don't handcraft svgs of pelicans riding bicycles?