← Back to context

Comment by nl

5 days ago

What on earth do you mean?

I live near an area with lots of pelicans. If you look up at one flying overhead this is what they look like.

Here is a photo for comparison: https://commons.wikimedia.org/wiki/File:American_white_pelic...

Sure, something like that. Note like the examples you posted.

  • I've very confused. The SVGs show the beak, wing, tail, feet and body as though viewed from directly underneath.

    They look similar to the photo, but meet the instructions better ("from underneath").

    What are you expecting exactly?

    • They don't look anything like the photo.

      They're not 'oblique' - they're 'squared' views and none of the anatomy looks appropriately adjusted.

      The model has no ability to 'rotate a figure in 3d space' and conceptualize how all of the elements work together.

      It's 'pattern matching'.

      This is the 'great intuition' for how LLMs work - it's not perfect because a lot of 'synthetic reasoning' can be done obviously.

      And they probably never will, LLMs are not the right thing for this kind of task.

      Think about how they can investigate massive code-bases and find arcane bugs - but cant draw a duck from arbitrary oblique angles etc.

      That said, with enough examples they probably could.