Comment by measurablefunc
11 days ago
Just to make sure I got this right. They serve millions of requests a day & somehow catastrophic error accumulation is what is causing the 10% degradation & no one at Anthropic is noticing it. Is that the theory?
FYI something in that region happened last august/September. Some inference bug triggered worse performance on TPUs vs GPU.