Comment by ai_slop_hater
8 hours ago
> You can add a bias to make it more or less likely for a model to generate a particular token, and that's how budgets work
Do you have a source for this? I am interested in learning more about how this works.
8 hours ago
> You can add a bias to make it more or less likely for a model to generate a particular token, and that's how budgets work
Do you have a source for this? I am interested in learning more about how this works.
It's how temperature/top_p/top_k work. Anthropic also just put out a paper where they were doing a much more advanced version of this, mapping out functional states within the modern and steering with that.
Huh, I wonder if that's why you cannot change the temperature when thinking is enabled. Do you have a link for the paper?
https://transformer-circuits.pub/2026/emotions/index.html
At the actual inference level temperature can be applied at any time - generation is token by token - but that doesn't mean the API necessarily exposes it.
1 reply →