Comment by hk__2
1 day ago
Do I have to publish my book for free because I got inspiration from 100's of other books I read during my life?
1 day ago
Do I have to publish my book for free because I got inspiration from 100's of other books I read during my life?
Humans are punished for plagiarism all the time. Myriad examples exist of students being disenrolled from college, professionals being fired, and personal reputations tarnished forever.
When a LLM is trained on copyright works and regurgitates these works verbatim without consent or compensation, and then sells the result for profit, there is currently no negative impact for the company selling the LLM service.
Issue to me is that I or someone else bought those books. Or in case of local libraries the authors got money for my borrowing copy.
And I can not copy paste myself to discuss with thousands or millions of users at time.
To me clear solution is to make some large payment to each author of material used in traing per training of model say 10k to 100k range.
false equivalence because machines are not human beings
a lossy compression algorithm is not "inspired" when it is fed copyrighted input
> lossy compression algorithm is not "inspired" when it is fed copyrighted input
That's exactly what happens when you read. Copyrighted input fed straight into your brain, a lossy storage and processing machine.
I think it’s a pretty easy principle that machines are not people and people learning should be treated differently than machines learning
4 replies →
Following your analogy, parrots should be considered human.
If you are plagiarizing, “for free” doesn’t even save you.
If your book reproduces something 95% verbatim, you won't even be able to publish it.
Exactly. We assess plagiarism by checking the output (the book), not the input (how many book I’ve read before). It’s not an issue to train LLM on copyrighted resources if their output is randomized enough.