Comment by nutrientharvest
8 months ago
It's cropping the original image then tokenizing it again with less downsampling, not cropping its internal representation.
8 months ago
It's cropping the original image then tokenizing it again with less downsampling, not cropping its internal representation.
No comments yet
Contribute on Hacker News ↗