Comment by netsharc

19 hours ago

Cool, almost a "build your own ecosystem" Siri/voice assistant...

The 2 line explanation is sort of vague, but from the code I surmise the Python "app" watches a webpage (configured as https://www.doubao.com/chat/624642496948226) and every time the DOM there is modified, it sees that new prompt, looks for the word "note", and if so, creates an Obsidian note with the transcription of the prompt.

    CHAT_URL: str = "https://www.doubao.com/chat/624642496948226"
    [...]
    await page.goto(CHAT_URL, timeout=120000, wait_until="domcontentloaded")

Alexa has "build your own app", this seems less convoluted.

Google Gemini also records my prompts (under My Activity), I guess with an always-listening Gemini Assistant and a similar Python script that monitors https://myactivity.google.com/product/gemini (I'm guessing this page needs a hard reload to update), it's possible to build something similar.

I don't have my phone to respond to "Hey Google", but I have an alarm clock that has that (not Gemini, but Google Assistant), and I often tell it to "Remind me about [...] in x hours". I just tested the phrase "Add a note about...", and it added a note in Google Keep. But with an analog Python script one could trigger many more things.