Comment by azinman2

2 days ago

The demo buying toothpaste shows the difficulty of these tasks. Toothpaste itself was very underspecified, and it essentially randomly chose from a huge list. Some tasks may have past actions that could help guide, others won't have any to inform. Failure cases abound -- maybe the toothpaste you previously bought is no longer available. Then what? Ultimately how much time did this particular example save since you need to double check the result anyway? This is what doomed Alexa for the purchasing experience that Amazon assumed it would enable in the first place.

I think it'd be better to show more non-trivial examples where the time savings is clear, and the failure cases are minimized... or even better how it's going to recover from those failure cases. Do I get a bespoke UI for the specific problem? Talk to it via chat?

This whole world is non-trivial. Good luck!

3 comments

azinman2

felarof 2 days ago

Great points! For sure, the whole agentic browsers space is still super early.

We are also just getting started and trying to narrow down on a high-value niche use-case.

There are few repetitive, boring use-cases where time saving could be meaningful -- one example: Walmart 3rd-party sellers routinely (multiple times a day) keep checking prices of the competitor products to price their products appropriately. This could be easily automated with current agentic browsers.

byteknight 2 days ago
But in reality, would much more consistently be automated by a single playwright script.
- felarof 2 days ago
  
  True, there are plenty of libs already available to do such an automation if you are (or can hire a) dev.
  But for non-technical folks, agentic browsers seems like a good UX to build such and many more automations.