Comment by whateveracct 9 hours ago [flagged] 5 comments whateveracct Reply fastball 9 hours ago It's just Simon Willison (the person you are replying to) who always makes a pelican, as his personal flippant benchmark. It's not that deep. dewey 9 hours ago No benchmark will be perfect, especially if it's public but it's a fun experiment to visually see how these models get better and better. post-it 9 hours ago Why is it so wrong? simonw 9 hours ago Thanks for the "scientific air" remark, that gave me a genuine LOL. a96 6 hours ago "The difference between screwing around and science is writing it down" -- Adam Savage
fastball 9 hours ago It's just Simon Willison (the person you are replying to) who always makes a pelican, as his personal flippant benchmark. It's not that deep.
dewey 9 hours ago No benchmark will be perfect, especially if it's public but it's a fun experiment to visually see how these models get better and better.
simonw 9 hours ago Thanks for the "scientific air" remark, that gave me a genuine LOL. a96 6 hours ago "The difference between screwing around and science is writing it down" -- Adam Savage
a96 6 hours ago "The difference between screwing around and science is writing it down" -- Adam Savage
It's just Simon Willison (the person you are replying to) who always makes a pelican, as his personal flippant benchmark. It's not that deep.
No benchmark will be perfect, especially if it's public but it's a fun experiment to visually see how these models get better and better.
Why is it so wrong?
Thanks for the "scientific air" remark, that gave me a genuine LOL.
"The difference between screwing around and science is writing it down" -- Adam Savage