Comment by becomevocal

11 hours ago

First thought was "only 30 tasks" however the findings map to what I've seen personally: code review consumes majority of tokens

Code review could also be run as an unattended/batched task though, possibly with at least some use of on-prem inference (which excels at this). That would be a major saving compared to the usual cloud inference scenario.