Comment by piokoch 1 month ago It is not unlimited, being not careful with your context management, you hit the limits quickly. 2 comments piokoch Reply abhijat 1 month ago Isn't the context window the same for all plans, 200k? You would hit usage limits? billyjobob 1 month ago If you send the full 200k tokens on every request you will get very few requests before you hit the token limit. Caching reduces the number sent but I don't know how much they can cache?
abhijat 1 month ago Isn't the context window the same for all plans, 200k? You would hit usage limits? billyjobob 1 month ago If you send the full 200k tokens on every request you will get very few requests before you hit the token limit. Caching reduces the number sent but I don't know how much they can cache?
billyjobob 1 month ago If you send the full 200k tokens on every request you will get very few requests before you hit the token limit. Caching reduces the number sent but I don't know how much they can cache?
Isn't the context window the same for all plans, 200k? You would hit usage limits?
If you send the full 200k tokens on every request you will get very few requests before you hit the token limit. Caching reduces the number sent but I don't know how much they can cache?