Comment by energy123
2 months ago
Yeah, once it gets converted into tokens how does "zooming in" somehow increase information content?
2 months ago
Yeah, once it gets converted into tokens how does "zooming in" somehow increase information content?
It's cropping the original image then tokenizing it again with less downsampling, not cropping its internal representation.