Comment by wonjunhwang
10 days ago
Do LLM Agents really understand Linux?.
I am working on world model for computer systems. I am designing experiment and benchmark for LLM Agents to see if they possess understanding of "Linux". World model for computer systems will be crucial next step for computer use agents to reliably plan their actions over long horizon.
Links to draft: https://open.substack.com/pub/disastermanagementtechnologies...
No comments yet
Contribute on Hacker News ↗