← Back to context

Comment by hirako2000

17 hours ago

I faced the exact same problem, with the API. It seems that it doesn't throttle early enough, then may cumulate the cool off period, malong it impossible to determine when to fire requests again.

Also, I noticed Gemini (even flash) has Google search support. But only via the web UI or the native mobile app. Via the API that would requires serp via MCP of sort. Even with Gemini pro.

Oh, some models are regularly facing outages. 503s are not uncommon. No SLA page, alerts, whatsoever.

The reasoning feature is buggy, even if disabled, it sometimes triggers anyway.

It occured to me the other day that Google probably have the best engineers given how good Gemini performs and where it's coming from, and the context window that is uniquely large compared to any other model. But that it is likely operated by managers coming from AWS where shipping half baked, barely tested software, was all it took to get a bonus.