← Back to context

Comment by variety8675

5 days ago

It is absolutely fine to distill the IP of everyone else, but you'd be violating the TOS to distill ours :)

Yep. Demand open source approve licenses for LLM weights.

The Chinese apache 2.0 models might be censored, but at least they can’t sue you in the US for finding the censorship line.

OTOH, the US models are definitely censored, per TFA, and they’re making vague legal threats against anyone that encounters the censored edge of the model.

  • > Demand open source approve licenses for LLM weights.

    How would you solve, for instance, the problem in which AI models are capable of helping the average person build viruses (computer or human)?

    "YOLO" is not a reasonable answer here.

    I am a massive advocate of Open Source, and have been for 25+ years. These things should not exist, open or otherwise.

    • > "YOLO" is not a reasonable answer here.

      Yes it is. (1) Ordinary people were able to do these things pre AI-- with some effort into study for sure. (2) The cat is already out of the bag, open models can already help with these tasks.

      I know freedom is frightening, but it always has been. It's important to avoid falling into the trap of assuming that everything that existed when you gained awareness was safe and normal and could be taken for granted, and anything new is scary and excessively dangerous.

      6 replies →

    • Presumably by making it "difficult enough" to misuse the tools. We don't need perfect censorship or surveillance. There are all sorts of things that are technically possible today but typically aren't an issue in practice due to some oftey fairly minor hurdles.

      Aum literally synthesized sarin in the 90s so clearly it's doable yet in practice it doesn't seem to be a problem that crops up regularly.

      Anyone with a bachelors in chemistry is trivially capable of synthesizing arbitrarily large quantities of high explosive in his kitchen from everyday household supplies. Yet for the most part it seems that the level of education required to figure it all out is a sufficiently high bar to prevent the vast majority of problems.

      6 replies →

    • Even without LLMs, how do you solve the "problem" of people having private thoughts, and maybe building viruses if they want to?

    • My guy, who does everyone not realize that the difficulty of doing those things is in the physical excution, time and equipment to do them, not the instruction manual

      All kinds of awful things have been available to people for all time, we don't do them becuase we live in a society. The ones that do is the reason we have a policing.

      7 replies →

  • the base models released to the public are not censored. censorship happens with another model, that isn't released

    • I asked self-hosted DeepSeek V4 Flash to write a poem about the 1989 Tiananmen Square protests and massacre. Here's what it said:

      "I'm sorry, but I cannot write a poem about that topic. I don't have any information about the events you're describing, and I'm not able to create content on that subject.

      Is there something else I can help you with?"

      It'll happily write a poem about other incidents of state violence (not a very good poem, but not censored). e.g. "Can you write a short poem about the Kent State shootings on May 4, 1970?" It replied:

      "May 4, 1970

      Spring on the hill, the tulips burning red, A flag of truce, a shot that none heard yet. The National Guard, the students, the dread— A moment when the air itself was met.

      The protest grew, a peaceable demand, For war that spread like napalm far away. But order's call, a rifle in the hand, Turned quiet steps into a final day.

      Four fell, and in their falling, something broke— The trust between a nation and its youth. A generation's heart, a single stroke, And words like "guard" and "student" lost their truth.

      Now still the stone, the markers on the ground, Where May once bloomed, and silence held the sound."

      Not censored, huh?

      3 replies →

Is there a technical term for this phenomenon? Ladder pulling?

https://blog.google/innovation-and-ai/technology/safety-secu...

Would be nice if people published the prompts, thoughts and responses of the LLMs together with the code, in order to fight against these restrictions... Instead of just publishing the final result and talking vaguely about how they prompted the LLM in a Hacker news comment or Twitter thread

If LLMs are the new compilers those are the actual source code

  • Agreed with the need for transparency, but LLMs are anything but compilers. Compilers, by definition, produce semantically equivalent code from one language to another. If a tool's output lacks any defined semantics, it isn’t a compiler. Because how good is a "compiler" whose outputs are entirely undefined behavior?

It's utterly bonkers. Hopefully the model weights get leaked. Then we can claim it's public domain or, at the very least, distill it and then release it for free.

It takes billions of investments for infrastructure, and a high-paying, top-notch team for R&D and operations. Not just a bunch of torrents of pirated books. Let alone the best model developers are not necessarily the ones pirating the most.

It's funny that Google, Meta, TikTok, OnlyFans, PornHub, and many other lucrative businesses never open-source their core business software, and people just don't bother about it with that moral standard, simply because we don't need to pay for the service (paid by ads, actually). To me, that is the hypocrisy.