Comment by Cthulhu_

3 months ago

Under copyright laws, if HN's T's & C's didn't override it, anything I write and have written on HN is my IP. And the AI data hoarders used it to train their stuff.

4 comments

Cthulhu_

SpicyLemonZest 3 months ago

Calling a HN comment “intellectual property” is like calling a table saw in your garage “capital”. There are specific regulatory contexts where it might be somewhat accurate, but it’s so different from the normal case that none of our normal intuitions about it apply.

For example, copyright makes it illegal to take an entire book and republish it with minor tweaks. But for something short like an HN comment this doesn’t apply; copyright always permits you to copy someone’s ideas, even when that requires using many of the same words.

Workaccount2 3 months ago

People seem to either intentionally or unintentionally (large from being taught by the intentional ones), to not know what training an AI involves.
I think most people think that AI training means copying vast troves of data onto ChatGPT hard drives for the model to actively reference.

jasonsb 3 months ago

Let's meet in the middle: only allow AI data hoarders to train their stuff on your content if the model is open source. I can stand behind that.

philipwhiuk 3 months ago

Uh no.
a) The model and the data
b) Why are we meeting in the middle?