Comment by ashirviskas
2 hours ago
I created this sheet to get proper model accuracy using the the lenz data, check it out.
Note: It may still not be perfectly accurate representation of truth as it uses user submitted data. I also used AI to build the sheet.
https://docs.google.com/spreadsheets/d/e/2PACX-1vSnZlURmyYX3...
Awesome. We do plan to human-label the 1,000 claims and then compare Lenz' performance vs the 5 models. We've done some limited internal research with 150 claims, but more are needed for statistical significance.