Comment by lorddumpy
15 hours ago
With vision models (SOTA models like Gemini and ChatGPT can do this), you can take a picture/screenshot of the button layout, upload it, and have it work from that. Feeding it current documentation (eg a pdf of a user manual) helps too.
Referencing outdated documentation or straight up hallucinating answers is still an issue. It is getting better with each model release though
No comments yet
Contribute on Hacker News ↗