Comment by nutrientharvest
10 months ago
It's cropping the original image then tokenizing it again with less downsampling, not cropping its internal representation.
10 months ago
It's cropping the original image then tokenizing it again with less downsampling, not cropping its internal representation.
No comments yet
Contribute on Hacker News ↗