Comment by lowsong

2 days ago

> YouTube gaining an “ask a question about this video” button, this is a perfect example of how to sprinkle AI on an existing product.

> Extremely slow, but the obvious incremental addition of Gemini to Docs is another example.

These are great examples of insulting and invasive introductions of LLMs into already functional workflows. These are anti-features.

The Ask button in YouTube is a game changer for the use case of "what timestamp in this hour-long video talks about topic x?".

What's the existing functional workflow for that? Downloading the captions and querying with a local LLM or a very fuzzy keyword search?

  • Perhaps this is a difference in terminology, but in no way do you need a LLM for fuzzy search. Semantic search, fuzzy keyword search, and text to speech have existed for years and predate the technology for an LLM. In your use-case, do you really need a chatbot to "ask the video" about this, wouldn't a "search in video" function that does the same thing be better?

    • Yeah, I was using the term "fuzzy" a bit loosely/incorrectly. I want to be able to say "where's the joke about British people?" even if the transcript only uses related but distinct terms like "Scotsman". I know you can get close to that with Transformers sans LLM, but LLMs are kinda like regex where they're often a non optimal but adequate way to solve a problem quickly.