Comment by Philpax

4 days ago

Hate to be that guy, but which LLM was doing the generation? GPT-4 Turbo / Claude 3.x have not really let me down in generating ffmpeg commands - especially for basic requests - with most of their failures resulting from domain-specific vagaries that an expert would need to weigh in on m

GPT-4

  • Fair enough. If you remember what you were testing with, I'd love to try it again to see if things are better now.

    • You have a fair point. Some LLMs are better at some tasks, and prompts can make a difference no doubt.

      Perhaps at some point there will be a triage LLM to slurp up the problem and then decide which secondary LLM is most optimal for that query, and some tertiary LLMs that execute and evaluate it in a virtual machine, etc.

      Maybe someday

      3 replies →

Hate to be that guy, but which model works without fail for any task that ffmpeg can do?

  • I don't think there's a single human on or outside of this planet that can meet that requirement, but Claude has been pretty good to me. It's certainly a much better starting point than pouring over docs and SO posts.

    • In my experience you still get a lot of stuff that used to work or stuff that it just makes up.

  • I know I struggled on getting a good command to “simply” make the videos from my Z8 smaller (in file size).

    Usually the color was wrong and I don’t care enough to learn about colorspaces to figure out how to fix it and it’s utterly insane how difficult it is even with LLMs.

    Just reencode it as is but a little more lossy. Is that so hard?

  • This doesnt exist in reality so in one sense, you could challenge the relevance

    • I think in the non LLM world though you at least have the trail of documentation you can unwind once you're in a bind. I don't care for prompt-a-mole fighting.