Comment by Shakahs
17 hours ago
If you are paying API rates (not using Max subscriptions) there's no reason to use Anthropic's API directly, the same models are hosted by both AWS and Google with better uptime than Anthropic.
17 hours ago
If you are paying API rates (not using Max subscriptions) there's no reason to use Anthropic's API directly, the same models are hosted by both AWS and Google with better uptime than Anthropic.
How do things like prompt caching etc play into that? Would I theoretically have a more stable harness backing my usage?
Im seriously over the current claude experience. After seemingly fixing my 4.6 usage by disabling adaptive thinking and moving to max effort, it seems that the release of 4.7 has broken that workflow and Im 99% certain that disabling adaptive thinking does nothing even on 4.6 now. Just egregious errors in 2 days this week after coming back from vacation.
AWS Bedrock supports prompt caching, just note that if you use the Converse API you need to set the cache points manually.
> Would I theoretically have a more stable harness backing my usage?
If you don’t mind an opinionated harness that asks for a pretty specific workflow, but one that works well, use OpenCode.
If you want to spread your wings and feel the sweet kiss of freedom, use Pi.
Im looking at moving to Pi and I like the minimal nature, but I disagree with a handful of decisions they make. So Id likely need to maintain a fork which is less than ideal.
7 replies →
pi for the win, i have my own ai extend it when i want more specific features. vibe coded in 20 minutes shift+tab like claude code to add permission control.
1 reply →
you can use claude code with these other providers
The enterprise tier is API pricing only.
https://support.claude.com/en/articles/9797531-what-is-the-e...
Enterprise adds IAM, logging, and analytics, all of which AWS provides for free or for metered usage without needing an enterprise plan.
They'll cut you a private offer for bedrock tokens but bedrock has a 32k output limit
I use bedrock with 1M context every day. Not sure this is right
3 replies →
isnt that an input limit from api gateway?