Comment by rootnod3

6 months ago

Sorry what?

"You can't measure my Cloud Service's performance correctly if my servers are overloaded"?

"Oh, you just measured me at bad times each day. On only 50 different queries."

So, what does that mean? I have to pick specific times during the day for Claude to code better?

Does Claude Code have office hours basically?

5 comments

rootnod3

johnsmith1840 6 months ago

This has been happening for years. Tgere's a great paper from microsoft on Deepspeed AI inference.

Basically the paper showed methods for how to handle heavy traffic load by changing model requirements or routing to different ones. This was awhile ago and I'm sure it's massively more advanced now.

Also why some of AI's best work for me is early morning and weekends! So yes, the best time to code with modern LLM stacks is when nobody else is. It's also possibly why we go through phases of "they neutered the model" some time after a new release.

kuboble 6 months ago

I wonder if my great experience with claude are partly due to the fact that my working hours don't overlap with the US west coast

swyx 6 months ago

chill out, ofir does not work for anthropic. he's just saying there's inherent variability in LLMs and you need to at least 30x the samples that OP is doing in order to make any form of statistically significant conclusions.

copilot_king 6 months ago

[flagged]

rootnod3 6 months ago

Verily, my vichyssoise of verbiage veers most verbose, so let me run that thing out of tokens fast.