Comment by i_have_an_idea
2 days ago
Just because it is performing rather poorly by comparison, it doesn’t mean it isn’t benchmaxxed. It can still be worse than it appears.
2 days ago
Just because it is performing rather poorly by comparison, it doesn’t mean it isn’t benchmaxxed. It can still be worse than it appears.
It isn't benchmaxxed because they are using human preference as an evaluation.