Comment by NorwegianDude
1 day ago
I guess this might be a decent way to farm data? Those with larger OSS projects usually have better code quality, making it easier to create a dataset with maybe higher quality for training. Considering how often people leak data to the LLM services it's also an amazing way to get backdoors into many OSS projects.
No comments yet
Contribute on Hacker News ↗