Comment by tacotime
19 days ago
I could imagine a vision-enabled transformer model being useful to create a customizable “reading mode”, that adjusts page layout based on things like user prefs, monitor/window size, ad recognition, visual detail of images, information density of the text, etc.
Maybe in an alternate universe where every user-agent enabled browser had this type of thing enabled by default, most companies would skip site design all together and just publish raw ad copy, info, and images.
No comments yet
Contribute on Hacker News ↗