Comment by TeMPOraL
3 days ago
Or maybe ask yourself why are you doing open source in the first place?
AI training on your code is success if you care about your code being genuinely helpful to others. It's a problem only if you're trying to make money or personal reputation, and abusing open source as a vector for it.
Just to add to this. Open source for money has been a dead end for a long time, except for the (increasingly rare) situations where people accidentally convert their open source _contributions_ into employment (I accidentally did this back in 2015). Open source for recognition/reputation makes a bit more sense, but it is also becoming increasingly rare. LLMs are super-charging the extinction, but this was also observable in 2021, when I wrote this: https://news.ycombinator.com/item?id=29714929 .
Even before LLMs, I have seen people (shamelessly) re-implement code from open source project A into open source project B, without attribution (IIRC, a GPL C++ project [no hate, I use C++ too these days] basically copied the very distinctive AVL Tree implementation of a CDDL C project -- this is a licensing violation _and_ plagiarism, and it effectively writes the C project out of history. When asked about this, various colleagues[1], just shrugged their shoulders, and went on about their lives.). LLMs now make this behavior undetectable _and_ scalable.
If we want strong copyright protections for open source, we may need to start writing _literate_ programs (i.e. the Knuthian paradigm, which I am quite fond of). But that probably will not happen, because most programmers are bad at writing (because they hate it, and would rather outsource it to an LLM). The more likely alternative, is that people will just stop writing open source code (I basically stopped publishing my repos when the phrase "Big Tech" became common in 2018; Amazon in particular would create hosted versions of projects without contributing anything back -- if the authors were lucky they would be given the magnanimous opportunity to labor at Amazon, which is like inventing dynamite and being granted the privilege of laboring in the mines).
The fact is, if we want recognition, we need to sing each others' praises, instead hoping that someone will look at a version control history. We need to be story-tellers, historians, and archivists. Where is my generation's Jargon File?
[1]: Not co-worker, which is someone who shares an employer, but colleague, which is someone who shares a profession.
That's a big reason why FOSS is going to crumble. If AI succeeds and decimates the tech labor industry, people won't have the luxury to "code for fun". Life isn't a bunch of comfy programmers working on stuff in their spare time anymore.
We already see a component of this with art, but art actually needs to be displayed unlike code to show its vslue. So they adapt. Tools to keep the machine from training on their work, or more movements into work that is much harder to train on (a 2d image of a 3d model does the job and the model can be shared off the internet). Programming will follow a similar course; the remaining few become mercenaries and need to protect their IP themselves.
> abusing open source as a vector for it
It seems like you are very against open source not being an altruistic endeavor. Or that you should not make money with an open source project. I would like to challenge you on that.
Would you say that the Linux Foundation is a net positive on the software ecosystem? How about big open source projects like curl or QGIS? How about mattermost or nextcloud? All of these have full-time employees working on them (The Linux Foundation generated almost 300 million USD of gross revenue in 2024).
I would argue that good monetization is paramount to a healthy open source ecosystem.
Both can be true:
- AI training on your code is success
- AI undermining the sustainability of your project by reducing funding is an issue
Also, I see you haven't changed your mind much on the training LLMs being one of the major benefits of open source since the last discussion we had ;) https://news.ycombinator.com/item?id=44155746#44156782
> Would you say that the Linux Foundation is a net positive on the software ecosystem?
On the software ecosystem? Maybe. For society? Now that's a difficult question, and I haven't really made up my mind on that yet.
On the one hand, OSS in general is a great win in terms of innovation. On the other hand, it pretty much destroyed the ability to make money on software directly in a honest way - exchanging money for providing value. This, in turn, became a major driver of turning everything into subscription, and for the surveillance economy.
I'd like to contribute to open source to help and empower people.
Your environmental mission feels moot if you do a lot to help with greenhouse emissions and then proceed to also dump all the waste in the ocean. Your mission is "accomplished" by your hands and you are recognized as a champion. but morally you feel like you took a step back and became the evil you sought to address.
Now apply that mentality to someone in FOSS who sees their work go into a trillion dollar industry seeking to remove labor as a concept from it, and the rest of society. Even of you are independently wealthy and never needed to make money to get by, you feel like your mission has failed. Even if people give you a pat on your back for the software you made.
>Or maybe ask yourself why are you doing open source in the first place?
I, like everyone started work on OSS because it's fun. The problem comes when your project gets popular - either you try to make it your job or you abandon the project, because at a certain point it becomes like an unpaid job with really demanding customers.
That makes sense but doesn't answer "why do open source" though. In fact, it only shows that there is little incentive to pursue a serious open-source project and just stick to hobby projects while ackowledging it'll never go anywhere. I struggle to answer that myself.
Lol, I never in a million years expected my project to get 100 users never mind the tens of thousands it now has. Sometimes others make the decision for you ;) it's still your baby though.
This is fair, but it restricts the number of open source contributors massively if that's the criteria.
Let's say I'm a company and I have this library I've developed at enormous expense. The company is happy to share it so long as competitor X a big multi-national corp doesn't get it for free. Is it better that it gets open sourced as GPL3 with commercial use on application, or better it stays closed source?
Let's say I'm a developer trying to get a job, I pour months of my time into a new project that's open source, of course I want that attached to my reputation, because that's a part of how I get my new job.
The number of people who can code for free and are happy to not attach thier name and to watch as big AI labs profit off their work while they can't afford rent is super close to 0.