Comment by wvenable
2 hours ago
> If I use my knowledge to produce code, under a specific license, then you take that code, and reproduce it without the license, you have broken the law.
Correct. But if read your code, produce a detailed specification of that code, and then give that code to another team (that has never seen your code) and they create a similar product then they haven't broken the law.
LLMs reproducing exact content from their training data is symptom of overfitting and is an error that needs correcting. Memorizing specific training data means that it is not generalizing enough.
No comments yet
Contribute on Hacker News ↗