Comment by johnys
17 days ago
Curious: what are people using as the best open source and locally hosted versions to have agents browse the web?
17 days ago
Curious: what are people using as the best open source and locally hosted versions to have agents browse the web?
Playwright, same thing we use when doing non-ai automation
Fun fact, ai can use the same tools you do, we don't have to reinvent everything and slap a "built for ai" label on it
We love these tools but they were designed for testing, not for automation. They are too low-level to be used as they are by AI.
For example, the playwright MCP is very unreliable and inefficient to use. To mention a few issues, it does not correctly pierce through the different frames and does not handle the variety of edge cases that exist on the web. This means that it can't click on the button it needs to click on. Also, because it lacks control over the context design, it cannot optimize for contextual operations and your LLM trace gets polluted with incredible amount of useless tokens. This increases cost, task complexity for the LLM, and latency
On top of that, these tools rely on the accessibility tree, which is just not a viable approach for a huge number of websites
again (see other comment), you are not listening to users and asking questions, you are telling them they are wrong
You describe problems I don't have. I'm happy with Playwright and other scraping tools. Certainly not frustrated enough to pay to send my data to a 3rd party
2 replies →
Chrome devtool mcp, this is the most reliable way to give claude to control my browser.