← Back to context

Comment by triceratops

7 hours ago

Why would token speed matter for anything other than getting work done faster? It's in the name - "speed".

This would be true if the models were capable of always completing the tasks. But, since their failure rate is fairly high, going in a wrong direction for longer could mean that you take more time than a faster model, where you can spot it going wrong earlier.