Comment by itake
6 days ago
Cost, latency, and performance are huge reasons why my company chooses to fine tune models. We start with using a base model for a task and as our traffic grows, we tune a smaller model, resulting huge performance and cost savings.
I think something similar to Self-Adapting Language Models will automate this task soon.
https://arxiv.org/pdf/2506.10943