Comment by semanticintent 5 hours ago [flagged] 1 comment semanticintent Reply esperent 4 hours ago > A validator that checks "did the assistant reply?" instead of "was the reply correct?" was never a benchmark. It was a participation trophyPeople can't even write a two paragraph comment without ai now
esperent 4 hours ago > A validator that checks "did the assistant reply?" instead of "was the reply correct?" was never a benchmark. It was a participation trophyPeople can't even write a two paragraph comment without ai now
> A validator that checks "did the assistant reply?" instead of "was the reply correct?" was never a benchmark. It was a participation trophy
People can't even write a two paragraph comment without ai now