Comment by nutrientharvest
6 months ago
It's cropping the original image then tokenizing it again with less downsampling, not cropping its internal representation.
6 months ago
It's cropping the original image then tokenizing it again with less downsampling, not cropping its internal representation.
No comments yet
Contribute on Hacker News ↗