Comment by simoncion
3 days ago
> I don't accept the premise that "training on" and "copying" are the same thing...
Nor do I. Training and copying are clearly different things... and if these tools had never emitted -verbatim- nontrivial chunks of the code they'd ingested, [0] I'd be much less concerned about them. But as it stands now, some-to-many of the companies that build and deploy these machines clearly didn't care to ensure that their machines simply wouldn't plagiarize.
I've a bit more commentary that's related to whether or not what these companies are doing should be permitted here. [1]
[0] Based on what I've seen, when it happens, it is often with either incorrect copyright and/or license notifications, or none of the verbiage the license of the copied code requires in non-trivial reproductions of that code.
No comments yet
Contribute on Hacker News ↗