Comment by rvnx
3 hours ago
Why LLM companies that depended on Anna's archive end up so clean ? Looks like Anna's archive was doing the dirty work, and the LLM companies were reaping the profits (and ironically still do, as they hold the largest databases of pirated content in the world).
Is it because the law doesn't apply to you when you have 1B USD ?
While that may be the case it’s hard to make this claim when: - Anthropic settled a similar case - Anna didn’t show up in court
Showing up is a trap for Anna - who doesn't have 5 billion dollars to settle.
Justice should not depend on whether the aggrieved appears in court. That's a structural weakness of US law.
is there a country where if you don't show up to court you don't lose by default?
2 replies →
Uh, aren't you confirming his opinion with that? After all, Anna doesn't have the money to fight this in court
No. Anthropic fought and paid $1.5 billion in settlement and agreed to delete all the copyrighted material.
5 replies →
Anthropic knows they could just pay off the aggrieved party.
The operators of Anna's know they will go to prison.
You can make an argument that training an LLM on something is not the same as copying it in the same way that your brain is not in breach of copyright for having watched a Disney movie. I'm not sure of the rights and wrongs of that but it complicates legal action.
Can I download an archive of movies so a human animator can study the techniques there?
Surely you have to make the copy to feed it into the llm for training, so
I think some of the LLM companies have used legally purchased materials.
Distribution. Anna's archive actively distributes the pirated material. LLM companies don't.
Fruit of the poisonous tree.