Comment by variety8675

5 days ago

It is absolutely fine to distill the IP of everyone else, but you'd be violating the TOS to distill ours :)

60 comments

variety8675

Yep. Demand open source approve licenses for LLM weights.

The Chinese apache 2.0 models might be censored, but at least they can’t sue you in the US for finding the censorship line.

OTOH, the US models are definitely censored, per TFA, and they’re making vague legal threats against anyone that encounters the censored edge of the model.

JoshTriplett 5 days ago
> Demand open source approve licenses for LLM weights.
How would you solve, for instance, the problem in which AI models are capable of helping the average person build viruses (computer or human)?
"YOLO" is not a reasonable answer here.
I am a massive advocate of Open Source, and have been for 25+ years. These things should not exist, open or otherwise.
- HoldOnAMinute 5 days ago
  
  Building a virus, on your own network, probably isn't a crime.
  We already have all kinds of laws to catch and punish people when they cause harm.
  
  5 replies →
- nullc 5 days ago
  
  > "YOLO" is not a reasonable answer here.
  Yes it is. (1) Ordinary people were able to do these things pre AI-- with some effort into study for sure. (2) The cat is already out of the bag, open models can already help with these tasks.
  I know freedom is frightening, but it always has been. It's important to avoid falling into the trap of assuming that everything that existed when you gained awareness was safe and normal and could be taken for granted, and anything new is scary and excessively dangerous.
  
  6 replies →
- fc417fc802 5 days ago
  
  Presumably by making it "difficult enough" to misuse the tools. We don't need perfect censorship or surveillance. There are all sorts of things that are technically possible today but typically aren't an issue in practice due to some oftey fairly minor hurdles.
  Aum literally synthesized sarin in the 90s so clearly it's doable yet in practice it doesn't seem to be a problem that crops up regularly.
  Anyone with a bachelors in chemistry is trivially capable of synthesizing arbitrarily large quantities of high explosive in his kitchen from everyday household supplies. Yet for the most part it seems that the level of education required to figure it all out is a sufficiently high bar to prevent the vast majority of problems.
  
  6 replies →
- nextaccountic 5 days ago
  
  Even without LLMs, how do you solve the "problem" of people having private thoughts, and maybe building viruses if they want to?
- tsunamifury 5 days ago
  
  My guy, who does everyone not realize that the difficulty of doing those things is in the physical excution, time and equipment to do them, not the instruction manual
  All kinds of awful things have been available to people for all time, we don't do them becuase we live in a society. The ones that do is the reason we have a policing.
  
  7 replies →
- teaearlgraycold 5 days ago
  
  YOLO
nextaccountic 5 days ago
the base models released to the public are not censored. censorship happens with another model, that isn't released
- SwellJoe 5 days ago
  
  I asked self-hosted DeepSeek V4 Flash to write a poem about the 1989 Tiananmen Square protests and massacre. Here's what it said:
  "I'm sorry, but I cannot write a poem about that topic. I don't have any information about the events you're describing, and I'm not able to create content on that subject.
  Is there something else I can help you with?"
  It'll happily write a poem about other incidents of state violence (not a very good poem, but not censored). e.g. "Can you write a short poem about the Kent State shootings on May 4, 1970?" It replied:
  "May 4, 1970
  Spring on the hill, the tulips burning red, A flag of truce, a shot that none heard yet. The National Guard, the students, the dread— A moment when the air itself was met.
  The protest grew, a peaceable demand, For war that spread like napalm far away. But order's call, a rifle in the hand, Turned quiet steps into a final day.
  Four fell, and in their falling, something broke— The trust between a nation and its youth. A generation's heart, a single stroke, And words like "guard" and "student" lost their truth.
  Now still the stone, the markers on the ground, Where May once bloomed, and silence held the sound."
  Not censored, huh?
  
  3 replies →

david_shi 5 days ago

Is there a technical term for this phenomenon? Ladder pulling?

https://blog.google/innovation-and-ai/technology/safety-secu...

ashleyn 5 days ago

I believe the term is "hypocrisy."
teravor 5 days ago

'pulling the ladder' is an action to sever the opportunity for others to climb after you.
they are merely engaged in self-serving rhetoric. can't even call this specifically hypocrisy because they aren't telling you not to train on on pirated content. just not their content.
lwhi 5 days ago

Anti-competitive behaviour.
dofm 5 days ago

There are several domain-general four-letter terms.
ivanmontillam 5 days ago

Parasitic behaviour. Extractivism.
TZubiri 5 days ago

Closing the door behind you
giancarlostoro 5 days ago

Corporate espionage?
ungovernableCat 5 days ago

Machiavellianism
matt_daemon 5 days ago

NIMBYism
atmavatar 5 days ago

Disney?
pocksuppet 5 days ago

Capitalism?
cyanydeez 5 days ago

"Capitalism"
HoldOnAMinute 5 days ago

"Venture Capital"

drowsspa 5 days ago

Would be nice if people published the prompts, thoughts and responses of the LLMs together with the code, in order to fight against these restrictions... Instead of just publishing the final result and talking vaguely about how they prompted the LLM in a Hacker news comment or Twitter thread

If LLMs are the new compilers those are the actual source code

soraminazuki 5 days ago
Agreed with the need for transparency, but LLMs are anything but compilers. Compilers, by definition, produce semantically equivalent code from one language to another. If a tool's output lacks any defined semantics, it isn’t a compiler. Because how good is a "compiler" whose outputs are entirely undefined behavior?
- warkdarrior 5 days ago
  
  > If a tool's output lacks any defined semantics, it isn’t a compiler.
  Are you claiming that the natural language of the LLM output (e.g., English, Chinese) does not have semantics?? Someone should tell all the people cited at https://en.wikipedia.org/wiki/Formal_semantics_(natural_lang...
  
  1 reply →

mips_avatar 5 days ago

Fine for me. Not for thee

anematode 5 days ago

It's utterly bonkers. Hopefully the model weights get leaked. Then we can claim it's public domain or, at the very least, distill it and then release it for free.

matheusmoreira 5 days ago

That'd probably be the best outcome for all of humanity.

whattheheckheck 5 days ago

Bad for society

typ 5 days ago

It takes billions of investments for infrastructure, and a high-paying, top-notch team for R&D and operations. Not just a bunch of torrents of pirated books. Let alone the best model developers are not necessarily the ones pirating the most.

It's funny that Google, Meta, TikTok, OnlyFans, PornHub, and many other lucrative businesses never open-source their core business software, and people just don't bother about it with that moral standard, simply because we don't need to pay for the service (paid by ads, actually). To me, that is the hypocrisy.