← Back to context Comment by amosjyng 10 hours ago How are you collecting your metrics on token usage and reliability? 1 comment amosjyng Reply vidarh 8 hours ago They are from my own runs, with reliability measured in terms of passing extensive test suites. So caveat is that this applies for my specific use and might well vary greatly.
vidarh 8 hours ago They are from my own runs, with reliability measured in terms of passing extensive test suites. So caveat is that this applies for my specific use and might well vary greatly.
They are from my own runs, with reliability measured in terms of passing extensive test suites. So caveat is that this applies for my specific use and might well vary greatly.