Comment by denysvitali

19 hours ago

OpenAI (Codex) keeps on resetting the usage limits each time they fuck up...

I have yet to see Anthropic doing the same. Sorry but this whole thing seems to be quite on purpose.

19 comments

denysvitali

weird-eye-issue 19 hours ago

Can you clearly state what they messed up?

tigershark 5 hours ago
Suddenly burning up the quota ~4x faster than usual is not a mess up in your opinion?
- weird-eye-issue 5 hours ago
  
  It is not inherently their fault though because usage is controlled both by the user and the harness behavior. So I was asking specifically what about the harness was messed up, can you provide that info?
  
  1 reply →
- yokoprime 4 hours ago
  
  You cannot reset usage across millions of users based off these AI slop reports.
  
  1 reply →
nodja 18 hours ago
Not parent but I can guess from watching mostly from the sidelines.
They introduced a 1M context model semi-transparently without realizing the effects it would have, then refused to "make it right' to the customer which is a trait most people expect from a business when they spend money on it, specially in the US, and specially when the money spent is often in the thousands of dollars.
Unless anthropic has some secret sauce, I refuse to believe that their models perform anywhere near the same on >300k context sizes than they do on 100k. People don't realize but even a small drop in success rate becomes very noticeable if you're used to have near 100%, i.e. 99% -> 95% is more noticeable than 55% -> 50%.
I got my first claude sub last month (it expires in 4 days) and I've used it on some bigish projects with opencode, it went from compacting after 5-10 questions to just expanding the context window, I personally notice it deteriorating somewhere between 200-300k tokens and I either just fork a previous context or start a new one after that because at that size even compacting seems to generate subpar summaries. It currently no longer works with opencode so I can't attest to how it well it worked the past week or so.
If the 1M model introduction is at fault for this mass user perception that the models are getting worse, then it's anthropics fault for introducing confusion into the ecosystem. Even if there was zero problems introduced and the 1M model was perfect, if your response when the users complain is to blame it on the user, then don't expect the user will be happy. Nobody wants to hear "you're holding it wrong", but it seems that anthropic is trying to be apple of LLMs in all the wrong ways as well.
- atonse 17 hours ago
  
  I still love Claude and nothing but a ton of respect for Boris and the team building such a phenomenal product.
  That said, I feel that things started to feel a bit off usage-wise after the introduction of 1M context.
  I'd personally be happy to disable it and go back to auto-compacting because that seems to have been the happy medium.
- logicchains 18 hours ago
  
  Especially since Codex faced the same issue but the team decided to explicitly default to only ~200k context to avoid surprises and degradation for users.

losteric 19 hours ago

[flagged]

mlinsey 19 hours ago

Different users do seem to be encountering problems or not based on their behavior, but for a rapidly-evolving tool with new and unclear footguns, I wouldn't characterize that as user error.
For example, I don't pull in tons of third-party skills, preferring to have a small list of ones I write and update myself, but it's not at all obvious to me that pulling in a big list of third-party skills (like I know a lot of people do with superpowers, gstack, etc...) would cause quota or cache miss issues, and if that's causing problems, I'd call that more of a UX footgun than user error. Same with the 1M context window being a heavily-touted feature that's apparently not something you want to actually take advantage of...
denysvitali 19 hours ago
Me and my colleagues faced, over the last ~1 month or so, the same issues.
With a new version of Claude Code pretty much each day, constant changes to their usage rules (2x outside of peak hours, temporarily 2x for a few weeks, ...), hidden usage decisions (past 256k it looks like your usage consumes your limits faster) and model degradation (Opus 4.6 is now worse than Opus 4.5 as many reported), I kind of miss how it can be an user error.
The only user error I see here is still trusting Anthropic to be on the good side tbh.
If you need to hear it from someone else: https://www.youtube.com/watch?v=stZr6U_7S90
- bcherny 19 hours ago
  
  > past 256k it looks like your usage consumes your limits faster
  This is false. My guess is what is happening is #1 above, where restarting a stale session causes a 256k cache miss.
  That said, I hear the frustration. We are actively working on improving rate limit predictability and visibility into token usage.
  
  1 reply →
mvkel 19 hours ago

Why did it suddenly become an issue, despite prompt caching behavior being unchanged?
ScoobleDoodle 19 hours ago

PEBKAC: Problem Exists Between Keyboard And Chair
extr 19 hours ago

Yes same here. I use CC almost constantly every day for months across personal and work max/team accounts, as well as directly via API on google vertex. I have hardly ever noticed an issue (aside from occasional outages/capacity issues, for which I switch to API billing on Vertex). If anything it works better than ever.
varispeed 18 hours ago

You know that people are not using the same resources? It's like 9 out of 10 computers get borked and you have the 1 that seems okay and you essentially say "My computer works fine, therefore all computers work fine." Come on dude.

Madmallard 19 hours ago

Money money money money