← Back to context

Comment by cubefox

17 days ago

Why is Gemini Flash so much cheaper than other models here?

probably a mix of economies of scale (google workspace and search are already massive customers of these models meaning the build out is already there), and some efficiency dividends from hardware r&d (google has developed the model and the TPU hardware purpose built to run it almost in parallel)