Comment by lostmsu 3 months ago The implementation details are irrelevant to the discussion of the true cost of running the models. 1 comment lostmsu Reply mzl 3 months ago The cost of running things like prompt caching is defined by the implementation as that gives the infrastructure costs.
mzl 3 months ago The cost of running things like prompt caching is defined by the implementation as that gives the infrastructure costs.
The cost of running things like prompt caching is defined by the implementation as that gives the infrastructure costs.