Comment by Rebelgecko
12 hours ago
The Ask button in YouTube is a game changer for the use case of "what timestamp in this hour-long video talks about topic x?".
What's the existing functional workflow for that? Downloading the captions and querying with a local LLM or a very fuzzy keyword search?
Perhaps this is a difference in terminology, but in no way do you need a LLM for fuzzy search. Semantic search, fuzzy keyword search, and text to speech have existed for years and predate the technology for an LLM. In your use-case, do you really need a chatbot to "ask the video" about this, wouldn't a "search in video" function that does the same thing be better?