Comment by glaslong

3 hours ago

Definitely has that smell... At the same time though, they NEED inference cost to drop substantially, and even better for them if it only happens for their models on their hardware.

I assume they're doing everything they can to make that happen model-side, but coming at it from the other end makes sense too if they can swing it.