← Back to context

Comment by handoflixue

14 hours ago

I mean, there's an obvious difference between "distributing copies" (which is what the law was designed to prevent) and "training an LLM". We already managed "banning LLM output that contains copyrighted text" - it's much easier to just pirate a copy of the text. So I think the copyright lawyers will continue to have work as long as human written texts are worth buying.

> I mean, there's an obvious difference between "distributing copies" (which is what the law was designed to prevent) and "training an LLM".

What's the difference between me/you downloading an mp3 through torrents for personal use (not distributing) while risking criminal punishment in most of the western world and BigCorp downloading petabytes worth of copyrighted works "to train an LLM" and resell it?

Can me/you do the same, when police comes to mine/your door?

"Dear police, don't lock me up - I was just going to train an LLM!"

  • Well, uh, the BigCorps already went to court and paid that cost and aren't doing it anymore? Whereas you and I are apparently still pirating MP3s and probably haven't ever been to court?