← Back to context Comment by piokoch 2 months ago It is not unlimited, being not careful with your context management, you hit the limits quickly. 2 comments piokoch Reply abhijat 2 months ago Isn't the context window the same for all plans, 200k? You would hit usage limits? billyjobob 2 months ago If you send the full 200k tokens on every request you will get very few requests before you hit the token limit. Caching reduces the number sent but I don't know how much they can cache?
abhijat 2 months ago Isn't the context window the same for all plans, 200k? You would hit usage limits? billyjobob 2 months ago If you send the full 200k tokens on every request you will get very few requests before you hit the token limit. Caching reduces the number sent but I don't know how much they can cache?
billyjobob 2 months ago If you send the full 200k tokens on every request you will get very few requests before you hit the token limit. Caching reduces the number sent but I don't know how much they can cache?
Isn't the context window the same for all plans, 200k? You would hit usage limits?
If you send the full 200k tokens on every request you will get very few requests before you hit the token limit. Caching reduces the number sent but I don't know how much they can cache?