Comment by larodi
14 hours ago
after 25 years wikipedia showed what it truly was created for, by selling the content for training. otherwise - okay, this was a cool project, perhaps we need better. like federated, crypto-signed articles that once collected together, @atproto style, produce the article with notable changes to it.
Their enterprise offering is more for fresh retrieval than training. For training, you can just download the free database dump — one you would inadvertently end up recreating if you were to use their enterprise APIs in a (pre-)training pipeline.
Context: https://arstechnica.com/ai/2026/01/wikipedia-will-share-cont...
tl;dr: Wikipedia is CC and has public APIs, but AI companies have recently started paying for "enterprise" high-speed access.
Notably, the enterprise program started in 2021 and Google has been paying since 2022.
You’re saying Wikipedia was created 25 years ago to sell its content to train LLMs that didn’t even exist?! I doubt it…
“Jimmy Wales is even more of a visionary than we thought”