← Back to context Comment by llmslave 3 hours ago The benchmarks on all these models are meaningless 2 comments llmslave Reply alchemist1e9 3 hours ago Why and what would a good benchmark look like? moffkalast 2 hours ago 30 people trying out all models on the list for their use case for a week and then checking what they're still using a month after.
alchemist1e9 3 hours ago Why and what would a good benchmark look like? moffkalast 2 hours ago 30 people trying out all models on the list for their use case for a week and then checking what they're still using a month after.
moffkalast 2 hours ago 30 people trying out all models on the list for their use case for a week and then checking what they're still using a month after.
Why and what would a good benchmark look like?
30 people trying out all models on the list for their use case for a week and then checking what they're still using a month after.