Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library

Comment by wolttam

12 hours ago

It depends on the use-case. yes, 90% of cost is cache in agentic coding scenarios (actually 95% in my experience). But not when the model reasons for 200k+ tokens before answering a complex problem.

3 comments

wolttam

Reply

himata4113  11 hours ago

gemini models solve a problem in 80% less tokens so that's something to think about.

  • johaugum  10 hours ago

    Source?

    • himata4113  9 hours ago

      https://help.kagi.com/kagi/ai/llm-benchmark.html

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities