← Back to context

Comment by pzo

1 year ago

I'm wondering how gemini can OCR big image correctly with good quality. They charge for image as input ~250 tokens. Always the same no matter the size of the image you send. 250 tokens its ~200 words. Will OCR work if you send 4k image that has a lot of text in small font? What if page will have more than 200 words? Are google selling it at cost?