← Back to context

Comment by patrickaljord

2 days ago

my guess is it's just an electron app or chromium wrapper with an ollama wrapper to talk to it (there are plenty of free open source libs to control browsers).

We are a chromium "wrapper"

But we are much more performant than other libs (like playwright) which are written in JS, as we implement bunch of changes at chromium source code level -- for example, we are currently implementing a way to build enriched DOMtree required for agent interactions (click, input text, find element) directly at C++ level.

We also plan to expose those APIs to devs.

“Just” is a four-letter word :)

When someone in their infinite wisdom decides to refactor an api and deprecate the old one, it creates work for everyone downstream.

Maybe as an industry we can agree to do this every so often to keep the LLMs at bay for as long as possible. We can take a page out of the book of the maintainers of moviepy for shuffling their apis around, it definitely keeps everyone on their toes.