Comment by iwontberude

1 day ago

As long as you use models trained or distilled for your use case, there is no need to waste compute on trillions of parameters. Anthropic and OpenAI are proving that the “everything” models are not a sustainable business model.

Just-In-Time or dynamic precompute of distilled models have already begun reducing the use of these frontier models for task inference.