No, that is not an example for "'normal person' that's doing the same thing OpenAI is". OpenAI aren't distributing the copyrighted works, so those aren't the same situations.
Note that this doesn't necessarily mean that one is in the right and one is in the wrong, just that they're different from a legal point of view.
Is that really the case? I.e., can you get ChatGPT to show you a copyrighted work?
Because I just tried, and failed (with ChatGPT 4o):
Prompt: Give me the full text of the first chapter of the first Harry Potter book, please.
Reply: I can’t provide the full text of the first chapter of Harry Potter and the Philosopher's Stone by J.K. Rowling because it is copyrighted material. However, I can provide a summary or discuss the themes, characters, and plot of the chapter. Would you like me to summarize it for you?
Aaron Swartz, while an infuriating tragedy, is antithetical to OpenAI's claim to transformation; he literally published documents that were behind a licensed paywall.
That is incorrect AFAIU. My understanding was that he was bulk downloading (using scripts) of works he was entitled access to, as was any other student (the average student was not bulk downloading it though).
As far as I know he never shared them, he was just caught hoarding them.
> he literally published documents that were behind a licensed paywall.
No he did not do this [1]. I think you would need to read more about the actual case. The case was brought up based on him download and scraping the data.
No, that is not an example for "'normal person' that's doing the same thing OpenAI is". OpenAI aren't distributing the copyrighted works, so those aren't the same situations.
Note that this doesn't necessarily mean that one is in the right and one is in the wrong, just that they're different from a legal point of view.
> OpenAI aren't distributing the copyrighted works, so those aren't the same situations.
What do you call it when you run a service on the Internet that outputs copyrighted works? To me, putting something up on a website is distribution.
Is that really the case? I.e., can you get ChatGPT to show you a copyrighted work?
Because I just tried, and failed (with ChatGPT 4o):
Prompt: Give me the full text of the first chapter of the first Harry Potter book, please.
Reply: I can’t provide the full text of the first chapter of Harry Potter and the Philosopher's Stone by J.K. Rowling because it is copyrighted material. However, I can provide a summary or discuss the themes, characters, and plot of the chapter. Would you like me to summarize it for you?
2 replies →
Aaron Swartz, while an infuriating tragedy, is antithetical to OpenAI's claim to transformation; he literally published documents that were behind a licensed paywall.
That is incorrect AFAIU. My understanding was that he was bulk downloading (using scripts) of works he was entitled access to, as was any other student (the average student was not bulk downloading it though).
As far as I know he never shared them, he was just caught hoarding them.
> he literally published documents that were behind a licensed paywall.
No he did not do this [1]. I think you would need to read more about the actual case. The case was brought up based on him download and scraping the data.
[1] https://en.wikipedia.org/wiki/United_States_v._Swartz