Comment by ceroxylon

12 hours ago

Strangely enough, my first test with Sonnet 4.6 via the API for a relatively simple request was more expensive ($0.11) than my average request to Opus 4.6 (~$0.07), because it used way more tokens than what I would consider necessary for the prompt.

2 comments

ceroxylon

svachalek 10 hours ago

This is an interesting trend with recent models. The smarter ones get away with a lot less thinking tokens, partially to fully negating the speed/price advantage of the smaller models.

smartbit 3 hours ago

Just like humans :-)
Eg a smart person will automate a task instead of executing the task repeatedly.