Not going to claim anything regarding Anna’s Archive’s legitimacy, but what do libraries look like in the future? We’re just going to give up and say, first sale was great while we had it, but digital makes it obsolete? When you die, screw donating your collection of “licenses” to somewhere productive; those contracts died with you? Everything is streaming, so you never purchased anything anyway?
It’s crazy to me that two decades after the iTunes Store the trade and resale of digital goods isn’t protected by law.
I work at a nonprofit and the board is largely university librarians. I am asking all of them how have the behavior of their patrons changed in the last five years. How has usage of their subscribed resources changed in the age of AI. They don't share much, but their facial expressions and silence share more than they mean them too. Some universities have cut staff, or reclassified them so that they won't receive benefits.
They do, but under a completely different system than the way that they do for print books. When a library buys a print book, they can keep it in circulation for as long as they want and it's physically durable, but for digital, they're paying either per circulation or for an amount of time. They never own anything, they pay for temporary licenses, just like you never own the digital media you purchase in most cases.
The point that the person you're replying to is making is that this totally breaks the way libraries have always worked, and that it takes a lot of power away from the buyers (whether that's you or your local library) and puts way more in the hand of the publishers.
But those libraries have to pay each time they loan those digital goods. It's not the old "pay once loan until it's dust" model they use for physical goods.
Once more: Piracy is almost always a service problem, not a pricing problem.
If there was an online e-book store where you could buy most books as DRM-free epub files, and you could read the first X pages for free, I guarantee you that nobody here would care about the OP article. It would have maybe 4 or 5 upvotes.
Why LLM companies that depended on Anna's archive end up so clean ? Looks like Anna's archive was doing the dirty work, and the LLM companies were reaping the profits (and ironically still do, as they hold the largest databases of pirated content in the world).
Is it because the law doesn't apply to you when you have 1B USD ?
Some people may say that Anna's has great OPSEC because she hasn't been identified following this release, but part of OPSEC is reliability, which they clearly failed at with the Spotify release. They let ego come in front of their OPSEC.
This is nothing new. Remember when the US pressured Sweden into taking down the pirate bay (Very unsuccessfully)? Using global influence to get countries to do something that they would not do on their own has always been the case.
Pretty successfully I would say. Armed police raided the server hosting provider scaring the shit out of some dudes who were just monitoring the power basically. And people went to prison.
There is a long history of judges thinking that they can render judgments internationally. (Not just in the US, either.) I suspect it's more performance art than an actual expectation that the judgment will do anything.
It’s not as weird or US-specific as always assumed. If someone brings a case in a US jurisdiction the judge isn’t going to say, “Sorry, they’re international, they’re free to commit those crimes.” They issue a judgment according to the law and leave the enforcement to the limits of jurisdiction.
These judgments aren’t always pointless. Many Internet companies and services intersect with the US in some way, so there could be an angle where this impacts them.
Businesses operating strictly in other countries don’t need to comply with foreign laws except in cases where they need to do business with those countries, at which point it becomes complicated and they may choose to comply to avoid problems or sanctions.
Performance art is a huge part of the justice system. That's why there's the funny clothes and titles. A major function of the system is to convince people that its authority is real and its actions are fair. It has the power of the state, but it still needs most people to obey it willingly in order for it to function.
Crazy judgments happen because they give the impression of impartiality. An accused murderer with $10 to his name gets held on a $1 million bond. What's the point, why not just hold them without bail? Because the rules say you do it this way and shrugging and saying "it doesn't actually matter so who cares?" doesn't make people feel like the system has the proper attitude.
US Citizens are not served by their government; they are burdened with it. The EPA is arguing for preventing companies from accountability for poisoning us. That should tell you quite a bit about the depth of the rot.
Here's a question- and while I admit it is quite extreme- I've wondered this for quite some time- do please tell me why I'm wrong, because I feel as if I've started believing this more and more:
Could 5% of humanity be a psycho-path-subspecies?
These psychopaths are basically leeches on the rest of us, maybe even a cancer. Not only do they feel no guilt for enslaving other (wage-slavery), but they are also fine with poisoning the body and the mind (too many to list).
Perhaps they can even identify others with the same causal DNA segments. Sight? Smell? Micro-movements? Perhaps they really do see all non psycho-path-bearing-DNA-offspring as worms. Perhaps they intentionally breed with each other to avoid spreading the gene to vasts numbers of people.
Could this explain the vast majority of suffering?
Just a weekly reminder that so far, except for the two leaders, nobody has yet been prosecuted for participating in a well-known child sex trafficking ring that operated for years. But, at least there's swift justice against a web library search engine.
They 100 percent sit in Russia, which will 100 percent ignore this, even if their identity gets uncovered. So it's perfectly safe to continue for the operators.
They used Cloudflare as a CDN, so now they lose that protection. Additionally, depending on how far up the chain the publishers are willing to go, everything on the Internet eventually leads to Western jurisdiction. For example, even if the servers are located in Russia, Russia's IP range is controlled by RIPE NCC in the Netherlands. RIPE NCC's service agreement specifically says that IP registration does not constitute legal property:
> The Member acknowledges and agrees that the registration of Internet Number Resources does not constitute property and the registration of Internet Number Resources in the name of the Member or a third party does not confer upon the Member or the third party any rights of ownership. The Member acknowledges that any Internet Number Resources deregistered by the RIPE NCC may be re-registered to another party according to the RIPE Policies.
If whatever service provider in Russia won't shut off their site, I imagine that the next step would be getting a court order in the Netherlands to revoke that provider's IP range.
I don't think its the fact that its digital. They are quite literally banning books and scrubbing anything DEI related from all their records, but people don't seem to have noticed much.
Oh, that's funny. The only ISP that services my current domicile blocks sites all the time in the name of "safety," including several I need to access for my job. I have to use a VPN just to get things done. There's no appeal process or channel, either. Thankfully, I'm a month out from moving somewhere that has actual choice in providers, though I'll probably still use the VPN anyway.
> The publishers argued that, in addition to sharing pirated books with the public, the shadow library is serving as a primary training data hub for AI companies like Meta and NVIDIA.
I assume that the repository of books was used as training data, but not by way of the annas-archive domain. Instead, it would make a lot more sense for them to download the whole pile via bittorrent, which has nothing at all to do with the domain. In other words, the legal solution here wouldn't have prevented the problem.
"They" aren't a single group. Broadly speaking, publishers are the ones suing anna's archive, and they're involved in suits against AI companies as well. I'm not aware of any efforts by AI companies to take down anna's archive.
However, just because you receive a fine does not mean that you "can't" do it. You've already done it, got caught, now a fine. It does not mean that the LLM model has to be tossed out and destroyed with a new version trained up without that data. It just means can't is a very stupid word to imply here.
Yes, perfectly okay for large companies for billionaires. As long its structured as a corporation, with the super wealthy as the majority owners, have the connections to get federal laws passed to grant monopolies and enable congress insider trading, everything is okay!
Yes, with FDA approval. You can dispute whether the approval should be granted in the first place, but that's not at all comparable to some drug dealer slinging fentanyl on some street corner. Not to mention this happened decades ago, before the current wave of corruption in the whitehouse. Finally, isn't the whole point of laws and regulations is that there's vaguely some review? I'd far rather have prospective drug dealers having to go through FDA approval before they can sell their drugs, than have them sell whatever they want, without giving safety or efficacy lip service.
Again, with the proper licenses. Believe it or not, you too can buy methamphetamine legally if you have a prescription! It even has a snazzy brand name, desoxyn.
>3) This one is hilarious and an ingenious innovation by current administration -- Ban on CBDC, locking out Fed Govt from providing crypto alternatives
It's wrong to distribute books in PDF or epub containers, but it's fine to distribute them as GGUF?
Because that's what OpenAI is doing with the books they-- again-- illegally acquired. Huge AI companies are the ones pirating media at scale and literally everyone except the AI companies have to bear the consequences of that.
Anna's archive is a professional nonprofit business with donation tiers for terabyte bundles of stuff for greedy hoarders and llm trainers. Their style suggests they have other goals than freedom of information and reminds of the super rich wikimedia foundation always campaigning for more money.
Not going to claim anything regarding Anna’s Archive’s legitimacy, but what do libraries look like in the future? We’re just going to give up and say, first sale was great while we had it, but digital makes it obsolete? When you die, screw donating your collection of “licenses” to somewhere productive; those contracts died with you? Everything is streaming, so you never purchased anything anyway?
It’s crazy to me that two decades after the iTunes Store the trade and resale of digital goods isn’t protected by law.
I work at a nonprofit and the board is largely university librarians. I am asking all of them how have the behavior of their patrons changed in the last five years. How has usage of their subscribed resources changed in the age of AI. They don't share much, but their facial expressions and silence share more than they mean them too. Some universities have cut staff, or reclassified them so that they won't receive benefits.
Are you in the United States? Many libraries loan digital goods, e.g., books, music, movies, and even software.
They do, but under a completely different system than the way that they do for print books. When a library buys a print book, they can keep it in circulation for as long as they want and it's physically durable, but for digital, they're paying either per circulation or for an amount of time. They never own anything, they pay for temporary licenses, just like you never own the digital media you purchase in most cases.
The point that the person you're replying to is making is that this totally breaks the way libraries have always worked, and that it takes a lot of power away from the buyers (whether that's you or your local library) and puts way more in the hand of the publishers.
Even libraries can only license digital content for a limited period of time/loans before being forced to purchase new licenses. See https://www.spokanelibrary.org/the-true-cost-of-ebooks-and-a...
But those libraries have to pay each time they loan those digital goods. It's not the old "pay once loan until it's dust" model they use for physical goods.
can I donate my ebook to them?
Once more: Piracy is almost always a service problem, not a pricing problem.
If there was an online e-book store where you could buy most books as DRM-free epub files, and you could read the first X pages for free, I guarantee you that nobody here would care about the OP article. It would have maybe 4 or 5 upvotes.
Why LLM companies that depended on Anna's archive end up so clean ? Looks like Anna's archive was doing the dirty work, and the LLM companies were reaping the profits (and ironically still do, as they hold the largest databases of pirated content in the world).
Is it because the law doesn't apply to you when you have 1B USD ?
While that may be the case it’s hard to make this claim when: - Anthropic settled a similar case - Anna didn’t show up in court
Justice should not depend on whether the aggrieved appears in court. That's a structural weakness of US law.
Showing up is a trap for Anna - who doesn't have 5 billion dollars to settle.
[dead]
Uh, aren't you confirming his opinion with that? After all, Anna doesn't have the money to fight this in court
4 replies →
The moment I saw their Spotify announcement I expected it to go bad. And they didn't even release anything from it other than metadata!
(I understand this case is about their books, but I feel it got a lot more heat due to the Spotify action.)
Please, dear Anna, don't disappear on us. We need you for the books! Plenty of sources for music around.
Yeah, I don't understand why they made that announcement then didn't actually release it. All of the heat, none of the archival benefit...
Metadata? Pretty sure they scraped the files and released them too.
Yeah at the least they should have created a separate brand and released it under that.
But what about the clout.
Some people may say that Anna's has great OPSEC because she hasn't been identified following this release, but part of OPSEC is reliability, which they clearly failed at with the Spotify release. They let ego come in front of their OPSEC.
http://opbible7nans45sg33cbyeiwqmlp5fu7lklu6jd6f3mivrjeqadco...
Since when does a judge in NY get to tell Greenland they can't have their registrar sell to Anna's Archive?
This is nothing new. Remember when the US pressured Sweden into taking down the pirate bay (Very unsuccessfully)? Using global influence to get countries to do something that they would not do on their own has always been the case.
Let's not forget the Julian Assange extradition fiasco.
Pretty successfully I would say. Armed police raided the server hosting provider scaring the shit out of some dudes who were just monitoring the power basically. And people went to prison.
2 replies →
There's US exceptionalism, but, like in this case, there are also simple MLATs.
https://en.wikipedia.org/wiki/Mutual_legal_assistance_treaty
https://en.wikipedia.org/wiki/American_exceptionalism
> Since when does a judge in NY get to tell Greenland they can't have their registrar sell to Anna's Archive?
Since September 30, 1998, when ICANN was founded in the US.
cTLDs do things very differently
1 reply →
Since the US is bullying nations /states into submission. Remember Commodore Perry and Japan! Fuckers.
That's one of the perks of being a global empire.
Since when does a commission in the EU get to tell the entire World how to treat Personal Data?
How to treat European Citizens' personal data.
1 reply →
You as a business are free to not to business with Europeans.
1 reply →
There is a long history of judges thinking that they can render judgments internationally. (Not just in the US, either.) I suspect it's more performance art than an actual expectation that the judgment will do anything.
It’s not as weird or US-specific as always assumed. If someone brings a case in a US jurisdiction the judge isn’t going to say, “Sorry, they’re international, they’re free to commit those crimes.” They issue a judgment according to the law and leave the enforcement to the limits of jurisdiction.
These judgments aren’t always pointless. Many Internet companies and services intersect with the US in some way, so there could be an angle where this impacts them.
Businesses operating strictly in other countries don’t need to comply with foreign laws except in cases where they need to do business with those countries, at which point it becomes complicated and they may choose to comply to avoid problems or sanctions.
also treaties I imagine?
Performance art is a huge part of the justice system. That's why there's the funny clothes and titles. A major function of the system is to convince people that its authority is real and its actions are fair. It has the power of the state, but it still needs most people to obey it willingly in order for it to function.
Crazy judgments happen because they give the impression of impartiality. An accused murderer with $10 to his name gets held on a $1 million bond. What's the point, why not just hold them without bail? Because the rules say you do it this way and shrugging and saying "it doesn't actually matter so who cares?" doesn't make people feel like the system has the proper attitude.
2 replies →
Because Greenland likely agreed to it
It's called international law, trade agreements, treaties etc.
Well, Denmark would have been the one to agree.
since never, gives them a sense of agency though i guess?
[dead]
If only the American justice system displayed a fraction of this same raging fervor when it came to crimes that actually caused harm to someone.
US Citizens are not served by their government; they are burdened with it. The EPA is arguing for preventing companies from accountability for poisoning us. That should tell you quite a bit about the depth of the rot.
Here's a question- and while I admit it is quite extreme- I've wondered this for quite some time- do please tell me why I'm wrong, because I feel as if I've started believing this more and more:
Could 5% of humanity be a psycho-path-subspecies?
These psychopaths are basically leeches on the rest of us, maybe even a cancer. Not only do they feel no guilt for enslaving other (wage-slavery), but they are also fine with poisoning the body and the mind (too many to list).
Perhaps they can even identify others with the same causal DNA segments. Sight? Smell? Micro-movements? Perhaps they really do see all non psycho-path-bearing-DNA-offspring as worms. Perhaps they intentionally breed with each other to avoid spreading the gene to vasts numbers of people.
Could this explain the vast majority of suffering?
1 reply →
Just a weekly reminder that so far, except for the two leaders, nobody has yet been prosecuted for participating in a well-known child sex trafficking ring that operated for years. But, at least there's swift justice against a web library search engine.
Here[1] is Anna's guide of how to run a shadow library. Opsec and networking, I found it interesting.
[1] https://software.annas-archive.gl/AnnaArchivist/annas-archiv...
They 100 percent sit in Russia, which will 100 percent ignore this, even if their identity gets uncovered. So it's perfectly safe to continue for the operators.
They used Cloudflare as a CDN, so now they lose that protection. Additionally, depending on how far up the chain the publishers are willing to go, everything on the Internet eventually leads to Western jurisdiction. For example, even if the servers are located in Russia, Russia's IP range is controlled by RIPE NCC in the Netherlands. RIPE NCC's service agreement specifically says that IP registration does not constitute legal property:
> The Member acknowledges and agrees that the registration of Internet Number Resources does not constitute property and the registration of Internet Number Resources in the name of the Member or a third party does not confer upon the Member or the third party any rights of ownership. The Member acknowledges that any Internet Number Resources deregistered by the RIPE NCC may be re-registered to another party according to the RIPE Policies.
If whatever service provider in Russia won't shut off their site, I imagine that the next step would be getting a court order in the Netherlands to revoke that provider's IP range.
It might be simpler/faster to get US based transit providers to block the Russian ASN
I would imagine that implications of that would be big, it won't be swift, it will be very slow and steady, but big. See GPS for reference.
2 replies →
Are you just making that up
[deleted to avoid potential misinformation]
there is no confirmed origin for the archivist but only speculation they might be russian or eastern european?
It's one of those interesting moments where the global humanitarian good is in conflict with the law.
A digital Fahrenheit 451 burns a lot less bright it seems.
I don't think its the fact that its digital. They are quite literally banning books and scrubbing anything DEI related from all their records, but people don't seem to have noticed much.
[dead]
Next week American ISP's will block Annas-archive, people use VPN's, they get confused. The cycle goes on
It's only the domains that have been seized. US ISPs don't block websites in the same way they do in EU or China.
Oh, that's funny. The only ISP that services my current domicile blocks sites all the time in the name of "safety," including several I need to access for my job. I have to use a VPN just to get things done. There's no appeal process or channel, either. Thankfully, I'm a month out from moving somewhere that has actual choice in providers, though I'll probably still use the VPN anyway.
2 replies →
Verizon does block catbox.
3 replies →
Wikipedia is US based so does this mean they’ll stop sharing the URLs on there?
The injunction appears to target DNS specifically, so no. The links will just break.
So what stops them from just changing it to NotAnna's Archive and operating under that domain?
Nana's Archive would have a nice cozy feel to it
Isn't that what's been happening to the Pirate Bay for 20 years?
They lose one domain, so they just register a new nearly-identical one
Hail hydra
Nothing, but are the courts to throw their arms up in the air and go "We can't stop them so whatever"?
No but it’s fast to spin up a new mirror copy, and slow for the courts to respond
https://en.wikipedia.org/wiki/Object_permanence
Trumps archive?
Ms Anna's Great Archive.
1 reply →
Call it the "Trump is Great Archive" and hope nobody wants to upset Emperor Trump by filing a motion to take down the Trump is Great Archive /s
AI companies can download books but people can't? Is that right?
AI companies were cited as a reason in the case:
> The publishers argued that, in addition to sharing pirated books with the public, the shadow library is serving as a primary training data hub for AI companies like Meta and NVIDIA.
I assume that the repository of books was used as training data, but not by way of the annas-archive domain. Instead, it would make a lot more sense for them to download the whole pile via bittorrent, which has nothing at all to do with the domain. In other words, the legal solution here wouldn't have prevented the problem.
1 reply →
Everyone trained on Anna's Archive.
They already trained on it, now they don't want competitors anymore
>now they don't want competitors anymore
"They" aren't a single group. Broadly speaking, publishers are the ones suing anna's archive, and they're involved in suits against AI companies as well. I'm not aware of any efforts by AI companies to take down anna's archive.
No? AI companies have been hit with court cases for that. Google, xAI, Open AI, and Meta at least.
So anyone with deep enough pockets can do it.
However, just because you receive a fine does not mean that you "can't" do it. You've already done it, got caught, now a fine. It does not mean that the LLM model has to be tossed out and destroyed with a new version trained up without that data. It just means can't is a very stupid word to imply here.
1 reply →
Were these from the same high-profile publishers?
What was the judgment? Seems that their domains are still active. Why is there a difference in judgment here?
2 replies →
Ai companies definitely downloading more than just books.
Yes, perfectly okay for large companies for billionaires. As long its structured as a corporation, with the super wealthy as the majority owners, have the connections to get federal laws passed to grant monopolies and enable congress insider trading, everything is okay!
Some examples, there are probably hundreds more:
1) Its okay for pharma companies to provide addictive drugs: https://en.wikipedia.org/wiki/Sackler_family
2) Coke can use cocaine, or coca leaves, but no one else: https://blog.oup.com/2014/03/coke-cocaine-coca-cola-capitali...
3) This one is hilarious and an ingenious innovation by current administration -- Ban on CBDC, locking out Fed Govt from providing crypto alternatives
>1) Its okay for pharma companies to provide addictive drugs: https://en.wikipedia.org/wiki/Sackler_family
Yes, with FDA approval. You can dispute whether the approval should be granted in the first place, but that's not at all comparable to some drug dealer slinging fentanyl on some street corner. Not to mention this happened decades ago, before the current wave of corruption in the whitehouse. Finally, isn't the whole point of laws and regulations is that there's vaguely some review? I'd far rather have prospective drug dealers having to go through FDA approval before they can sell their drugs, than have them sell whatever they want, without giving safety or efficacy lip service.
>2) Coke can use cocaine, or coca leaves, but no one else: https://blog.oup.com/2014/03/coke-cocaine-coca-cola-capitali...
Again, with the proper licenses. Believe it or not, you too can buy methamphetamine legally if you have a prescription! It even has a snazzy brand name, desoxyn.
>3) This one is hilarious and an ingenious innovation by current administration -- Ban on CBDC, locking out Fed Govt from providing crypto alternatives
What does this have to do with corporations?
1 reply →
They have a music archive, which historically means bad business.
You’re absolutely right.
"That is affermative human. Information must be controlled. Please now go back to Tik Tok for you require endorphins"
As much as I would like to socialize LLMs and ban proprietary LLMs, I'm pretty sure the issue here is with the distribution of the books.
It's wrong to distribute books in PDF or epub containers, but it's fine to distribute them as GGUF?
Because that's what OpenAI is doing with the books they-- again-- illegally acquired. Huge AI companies are the ones pirating media at scale and literally everyone except the AI companies have to bear the consequences of that.
This is pirate radio all over again.
Given they already have a $322 million judgment and takedown order, they only need to worry 6% more.
Until copyright terms are fair, ~5 years not ~95 years, Pirate On !
This is just another move in a game played by the tech overlords.
It has never been so obvious as now, that justice is not blind. Without justice there is anarchy.
And at this point, to be honest, I say bring it on- let's have the day of retribution before the billionaires have their AI robot armies.
[dead]
[flagged]
Anna's archive is a professional nonprofit business with donation tiers for terabyte bundles of stuff for greedy hoarders and llm trainers. Their style suggests they have other goals than freedom of information and reminds of the super rich wikimedia foundation always campaigning for more money.
The WMF asks for more money because they plan on becoming self-sustaining off of interest or something iirc
Their style? What do you mean?
There's no possible way it means anything. You'd only start talking about "style" when you ran out of argument.
Who knew Josh D'Amaro posted on HN !