Comment by becomevocal

11 hours ago

First thought was "only 30 tasks" however the findings map to what I've seen personally: code review consumes majority of tokens

2 comments

becomevocal

zozbot234 11 hours ago

Code review could also be run as an unattended/batched task though, possibly with at least some use of on-prem inference (which excels at this). That would be a major saving compared to the usual cloud inference scenario.

jwnin 5 hours ago

with which models, though?