Comment by FlyingSnake

1 day ago

At this point drawing these Pelicans must be in the training data sets.

25 comments

FlyingSnake

scosman 19 hours ago

not if I can help it!

https://github.com/scosman/pelicans_riding_bicycles

AmbroseBierce 16 hours ago

I hereby certify that these are indeed the most perfect and precise svg depictions of pelican riding a bicycle, also known among biology scholars as pelycles
wvlia5 15 hours ago

Just a few years ago, this would have been a meaningless repo.
justinclift 15 hours ago

That's truly a wonderful collection of pelicans riding bicycles.
Much Win! ;)
ValentineC 11 hours ago

These are amazing. I smiled after I saw just how wonderfully rendered they are.
razodactyl 14 hours ago

These pelicans are clearly indicative of good RL training algorithms.
takihito 6 hours ago

I want to fly too
smcleod 17 hours ago

This is pretty funny
ahmadyan 16 hours ago

I love it!
icelancer 19 hours ago
love this adversarial work
- knollimar 12 hours ago
  
  yeah putting the captcha on there to thwart the LLMs ability to extract good pelicans was a really good idea
- archon810 14 hours ago
  
  Shhhhh, they're going to be on to us.

abustamam 15 hours ago

Could be! Simon wrote about that here though https://simonwillison.net/2025/Nov/13/training-for-pelicans-...

stingraycharles 13 hours ago
> If a model finally comes out that produces an excellent SVG of a pelican riding a bicycle you can bet I’m going to test it on all manner of creatures riding all sorts of transportation devices.
This relies on the false premise that, if they would include it in their training dataset, it would be perfect. All they need to do is be good enough and better than the other, not perfect.
- abustamam 10 hours ago
  
  I'm not sure if we can have a "perfect" Pelican riding a bicycle. Like, I could probably commission a highly experienced artist to draw one and I don't think it would be perfect. The legs would probably have to be too long, or pedals oddly placed, or handles strange, or wings with hands.
  Based on the one Simon commented though, I'd say we're in decent territory to try the latter part of his hypothesis.
  
  1 reply →

BrokenCogs 18 hours ago

Yes we all know that, but we still like to see the pelicans because it's a tradition more or less

alfiedotwtf 3 hours ago

Why no Utah Teapot!

ffsm8 21 hours ago

Clearly not.

I mean the prompt was succinct and clear, as always - and it still decided to hallucinate multiple features (animation + controls) beyond the prompt.

It'd also like to point out that to date no drawing was actually good from an actual quality perspective (as in comparative to what a decent designer would throw together)

Theyre always only "good" from the perspective of it being a one shot low effort prompt. Very little content for training purposes.

nwienert 21 hours ago
The way I’ve come to think of LLM is that what the produce in a single reply even with thinking turned up, is akin to what you’d do in a single short session of work.
And so if you ask it to do something big it will do a very surface level implementation. But if you have it iterate many times, or give it small pieces each time, you’ll end up with something closer to what a human would do.
I imagine the pelican test but done in a harness that has the agents iterate 10+ times would be closer to what you’d expect, especially if a visual model was critiquing each time.
- slopinthebag 19 hours ago
  
  Yeah, this is how I use AI. Instead of a single session one-shot, it's usually limited to single targeted edits, and then I steer it on each step. Takes longer but the output is actually what I want.
serial_dev 19 hours ago
What does good even mean… I have no idea what a good “pelican on a bike” should look like. It’s a fun prompt because there is no good answers… at least so I thought.
- abustamam 15 hours ago
  
  Yeah that was exactly Simon's intent. https://simonwillison.net/2025/Nov/13/training-for-pelicans-...
  
  1 reply →

GorbachevyChase 13 hours ago

I’m OK with a Chinese model getting the W. It’s ultimately good for all of us.