Comment by andrepd
6 days ago
So they can cherry pick the 1 out of 10 times that it actually performs in an impressive manner? That's the essence of most AI demos/"benchmarks" I've seen.
Testing for myself has always yielded unimpressive results. Maybe I'm just unlucky?
Livestream would be fair.