Comment by cortesoft
19 hours ago
I don’t think these numbers are accurate? It seems to ignore the fact that the models have cache for ongoing sessions, which means you (normally) aren’t actually sending all those tokens on every request… you only need to if you go too long between requests.
No comments yet
Contribute on Hacker News ↗