← Back to context

Comment by Cthulhu_

8 hours ago

Under copyright laws, if HN's T's & C's didn't override it, anything I write and have written on HN is my IP. And the AI data hoarders used it to train their stuff.

Let's meet in the middle: only allow AI data hoarders to train their stuff on your content if the model is open source. I can stand behind that.

Calling a HN comment “intellectual property” is like calling a table saw in your garage “capital”. There are specific regulatory contexts where it might be somewhat accurate, but it’s so different from the normal case that none of our normal intuitions about it apply.

For example, copyright makes it illegal to take an entire book and republish it with minor tweaks. But for something short like an HN comment this doesn’t apply; copyright always permits you to copy someone’s ideas, even when that requires using many of the same words.

  • People seem to either intentionally or unintentionally (large from being taught by the intentional ones), to not know what training an AI involves.

    I think most people think that AI training means copying vast troves of data onto ChatGPT hard drives for the model to actively reference.