← Back to context

Comment by thewebguyd

5 hours ago

The difference is in licensing. Those platforms are protecting (or rather, monetizing) a database of human authored assets which those humans have given them a license to exploit.

Anthropic (and others) are trying to protect a stream of uncopyrightable, public-domain machine outputs.

I don't see how that's relevant. They have a license to redistribute my comments, but that's the extent of their legal rights with respect to my work. They're not my agent or my publisher. Moreover I don't have any say in the matter. If I'm pro AI scraping, I can't tell them "yeah it's fine to scrape my comment, don't put up any captcha walls". Finally, what if I dedicate my comments to the public domain? Does that mean they're in the wrong to put up scraping walls?

  • The license goes beyond redistribution. You are granting a sublicensable and transferable right to your content, giving the platform the legal authority to sell or license it (or to not license it) to AI scrapers and other entities. The platform's right to block said scrapers comes from posession rights.

    Its like if you made a painting and put it in a museum. You still technically own the copyright, but the museum owns the building. They can lock the door, charge admissino, kick out anyone they want, prevent anyone they want from seeing it, etc. You licensing it to them makes it their private property to do with what they wish.

    > I can't tell them "yeah it's fine to scrape my comment, don't put up any captcha walls".

    Correct, because you signed away that control.

    > what if I dedicate my comments to the public domain?

    That means you forfeit copyright, but you cannot waive the platform's rights regarding their servers.

    But, because you still retain copyright (or in the case that its public domain), you can and are welcome to submit it to AI companies yourself. Just because Reddit may not allow a scraper, that doesn't remove my right as the copyright holder to re-submit my comment to another platform that does allow the scraper.

    The difference with Anthropic/LLM output is that there are zero intellectual property rights over the outputs once they leave the API endpoint.

    • >The license goes beyond redistribution. You are granting a sublicensable and transferable right to your content, giving the platform the legal authority to sell or license it (or to not license it) to AI scrapers and other entities. The platform's right to block said scrapers comes from posession rights.

      They don't need to sublicense it because the license was already granted by you. Stackoverflow comments are licensed under creative commons, which means you don't need to seek a license from stackoverflow to use it. It's same if you found some random MIT licensed repo on github. It's not github granting you a sublicense, it's coming from the original author.

      >You still technically own the copyright, but the museum owns the building. They can lock the door, charge admissino, kick out anyone they want, prevent anyone they want from seeing it, etc.

      And Anthropic can't decide who gets to use their service, and for what purpose?

      4 replies →