Comment by nutrientharvest
2 months ago
It's cropping the original image then tokenizing it again with less downsampling, not cropping its internal representation.
2 months ago
It's cropping the original image then tokenizing it again with less downsampling, not cropping its internal representation.
No comments yet
Contribute on Hacker News ↗