Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library
← Back to context

Comment by mt_

9 months ago

You can just dump the youtube link video in Google AI studio and ask it to transcribe the video with speaker labels and even ask it it to add useful visual clues, because the model is multimodal for video too.

2 comments

mt_

Reply

MaxDPS  9 months ago

Can I ask what you mean by “useful visual clues”?

  • mt_  9 months ago

    What is the speaker showcasing in its slides, what is it's body language and so on.

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities