← Back to context

Comment by vunderba

17 hours ago

Yeah I think that's a fair critique. It kind of looks like a bad cut-and-replace job (if you zoom in you can even see part of the neck is missing). I might give it some more attempts to see if it can do a better job.

I agree that Seedream could definitely be called out as a fail since it might just be a trick of perspective.

Have you ever considered a “partial pass”?

Perhaps it would be an easy cop out of making a decision if you had to choose something outside of pass/fail.

  • That's not a bad suggestion. I thought about adding a numerical score but it felt like it was bit overwhelming at the time. Maybe I should revisit it though in the form of:

      Fail = 0 points
      Partial = 0.5 points
      Success = 1 point
    

    There's definitely a couple of pictures where I feel like I'm at the optometrist and somehow failing an eye exam (1 or 2, A... or B).

    • I agree with this, some of those are "passing" and others are really passing. Specially with how much better some of the new model is compared to old ones.

      I think the paws one is a good example where I think the new model got 100% while the other was more like 75%