Comment by solenoid0937
3 hours ago
I suspect for many companies, the sunk cost of tokens relative to the output gain is low. The productivity gain we get from AI is such that using the latest Opus or GPT far outweighs the cost savings using a non frontier Chinese model.
Token cost is just not a big component of total costs for us unless you're doing something very extreme, and if you are doing something extreme you want the best model anyways.
I'm doubtful that the companies telling their employees to burn more tokens are doing careful evaluations of cost versus benefit. People on an expense account don't shop around much.
Maybe they'll penny-pinch later after running through their AI budgets?
Did anybody compared these directly using exactly same prompts and harness? I assume V4 Pro could be real frontier model, and if it's true, it'd be better to use it in automation or routine steps instead of simple models (e.g. haiku or even sonnet if V4pro is better)