← Back to context

Comment by eschluntz

5 months ago

We are definitely aware of this (and working on it for the web UI), and that's why Claude Code goes directly through the API!

I'm sure many of us would gladly pay more to get 3-5x the limit.

And I'm also sure that you're working on it, but some kind of auto-summarization of facts to reduce the context in order to avoid penalizing long threads would be sweet.

I don't know if your internal users are dogfooding the product that has user limits, so you may not have had this feedback - it makes me irritable/stressed to know that I'm running up close to the limit without having gotten to the bottom of a bug. I don't think stress response in your users is a desirable thing :).

  • This is the main point I always want to communicate to the teams building foundation models.

    A lot of people just want the ability to pay more in order to get more.

    I would gladly pay 10x more to get relatively modest increases in performance. That is how important the intelligence is.

    • As a growth company, they likely would prefer a larger amount of users even with occasional rate limits, vs smaller pool of power users.

      As long as capacity is an issue, you can't have both

      3 replies →

The problem with the API is that it, as it says in the documentation, could cost $100/hr.

I would pay $50/mo or something to be able to have reasonable use of Claude Code in a limited (but not as limited) way as through the web UI, but all of these coding tools seem to work only with the API and are therefore either too expensive or too limited.

  • > The problem with the API is that it, as it says in the documentation, could cost $100/hr.

    I've used https://github.com/cline/cline to get a similar workflow to their Claude Code demo, and yes it's amazing how quickly the token counts add up. Claude seems to have capacity issues so I'm guessing they decided to charge a premium for what they can serve up.

    +1 on the too expensive or too limited sentiment. I subscribed to Claude for quite a while but got frustrated the few times I would use it heavily I'd get stuck due to the rate limits.

    I could stomach a $20-$50 subscription for something like 3.7 that I could use a lot when coding, and not worry about hitting limits (or I suspect being pushed on to a quantized/smaller model when used too much).

    • Claude Code does caching well fwiw. Looking my costs after a few code sessions (totaling $6 or so) the vast majority is cache read, which is great to see. Without caching it'd be wildly more expensive.

      Like $5+ was cache read ($0.05/token vs $3/token) so it would have cost $300+