Comment by deadeye
7 days ago
I don't think compacting often is good for saving money. It generates more output tokens and then the input is no longer from cache, which is priced differently...typically very differently.
7 days ago
I don't think compacting often is good for saving money. It generates more output tokens and then the input is no longer from cache, which is priced differently...typically very differently.
No comments yet
Contribute on Hacker News ↗