Comment by lorddumpy

1 month ago

With vision models (SOTA models like Gemini and ChatGPT can do this), you can take a picture/screenshot of the button layout, upload it, and have it work from that. Feeding it current documentation (eg a pdf of a user manual) helps too.

Referencing outdated documentation or straight up hallucinating answers is still an issue. It is getting better with each model release though