Comment by nutrientharvest
9 months ago
It's cropping the original image then tokenizing it again with less downsampling, not cropping its internal representation.
9 months ago
It's cropping the original image then tokenizing it again with less downsampling, not cropping its internal representation.
No comments yet
Contribute on Hacker News ↗