← Back to context Comment by whateveracct 12 hours ago [flagged] 5 comments whateveracct Reply fastball 11 hours ago It's just Simon Willison (the person you are replying to) who always makes a pelican, as his personal flippant benchmark. It's not that deep. dewey 11 hours ago No benchmark will be perfect, especially if it's public but it's a fun experiment to visually see how these models get better and better. post-it 11 hours ago Why is it so wrong? simonw 11 hours ago Thanks for the "scientific air" remark, that gave me a genuine LOL. a96 9 hours ago "The difference between screwing around and science is writing it down" -- Adam Savage
fastball 11 hours ago It's just Simon Willison (the person you are replying to) who always makes a pelican, as his personal flippant benchmark. It's not that deep.
dewey 11 hours ago No benchmark will be perfect, especially if it's public but it's a fun experiment to visually see how these models get better and better.
simonw 11 hours ago Thanks for the "scientific air" remark, that gave me a genuine LOL. a96 9 hours ago "The difference between screwing around and science is writing it down" -- Adam Savage
a96 9 hours ago "The difference between screwing around and science is writing it down" -- Adam Savage
It's just Simon Willison (the person you are replying to) who always makes a pelican, as his personal flippant benchmark. It's not that deep.
No benchmark will be perfect, especially if it's public but it's a fun experiment to visually see how these models get better and better.
Why is it so wrong?
Thanks for the "scientific air" remark, that gave me a genuine LOL.
"The difference between screwing around and science is writing it down" -- Adam Savage