Comment by tsimionescu
2 months ago
I think it's quite obvious that any textual description that had any hope of being converted to video in this way would be entirely useless for a human mind. It wouldn't say something like "the fastener is on the under side of the chair about 3/5s of the way", it would say somerhing like "there is a square-shaped object in view 5cm from the top of the view and 120cm from the right; the object is 2cm x 2.2cm, color 0x7F325A".
> entirely useless for a human mind.
You may be right, although, of course, current LLMs often do the right thing with "about 3/5ths of the way."
OTOH, as someone who has done CAD and schematic drawings by programming, I am not 100% convinced about the inevitability of unreadability.
In any case, though, the bar is not really whether any human can interpret the text, but whether the average human will interpret the text or video faster, and here, to your point, yes, the video probably still wins handily.
The closest analogy I can think of is animated math gifs like these:
https://en.wikipedia.org/wiki/User:LucasVB/Gallery
Which can be a huge aid in learning.
But this leads to another conundrum. Where do animated GIFs end and video begin? Because I could see a simple line-drawing style animated GIF being sufficient for most purposes.