Comment by dbbk
15 hours ago
That sounds weird. Why does it need a video feed? The computer can already generate an accessibility tree, same as Playwright uses it for webpages.
15 hours ago
That sounds weird. Why does it need a video feed? The computer can already generate an accessibility tree, same as Playwright uses it for webpages.
So that it can utilize gui and interfaces designed for humans. Think of video editing program for example.
Yes. GUIs expose an accessibility tree.
Not all of them do, and not all of the ones that do expose enough to be useful to the AI.
Even if they do (often not the case) this will be far from exhaustive, and likely won’t reflect the structure of the application very well. Vision based testing is often combined with accessibility based testing
I feel like a legion of blind computer users could attest to how bad accessibility is online. If you added AI Agents to the users of accessibility features you might even see a purposeful regression in the space.