Comment by fblp

10 hours ago

There's something heartwarming about the developer docs being released before the flashy press release.

18 comments

fblp

Their audience is people who build stuff, techs audience is enterprise CEOs and politicians, and anyone else happy to hype up all the questionably timed releases and warnings of danger, white collar irrelevence, or promises of utopian paradise right before a funding round.

onchainintel 10 hours ago

Insert obligatory "this is the way" Mando scene. Indeed!

necovek 10 hours ago

Where's the training data and training scripts since you are calling this open source?

Edit: it seems "open source" was edited out of the parent comment.

b65e8bee43c2ed0 9 hours ago
doesn't it get tiring after a while? using the same (perceived) gotcha, over and over again, for three years now?
no one is ever going to release their training data because it contains every copyrighted work in existence. everyone, even the hecking-wholesome safety-first Anthropic, is using copyrighted data without permission to train their models. there you go.
- necovek 8 hours ago
  
  There is an easy fix already in widespread use: "open weights".
  It is very much a valuable thing already, no need to taint it with wrong promise.
  Though I disagree about being used if it was indeed open source: I might not do it inside my home lab today, but at least Qwen and DeepSeek would use and build on what eg. Facebook was doing with Llama, and they might be pushing the open weights model frontier forward faster.
  
  2 replies →
- Tepix 7 hours ago
  
  Nvidia did with Nemo.
  
  1 reply →
- fragmede 8 hours ago
  
  it's not a gotcha but people using words in ways others don't like.
  
  1 reply →
woctordho 5 hours ago

They are exactly open source. The training data is the internet. Don't say it's on the internet. It IS the internet.
The training scripts are in Megatron and vLLM.
bl4ckneon 8 hours ago
Aww yes, let me push a couple petabytes to my git repo for everyone to download...
- necovek 8 hours ago
  
  An easier thing would be to say "open weights", yes.
0-_-0 8 hours ago
Weights are the source, training data is the compiler.
- injidup 8 hours ago
  
  You got it the wrong way round. It's more akin to.
  1. Training data is the source. 2. Training is compilation/compression. 3. Weights are the compiled source akin to optimized assembly.
  However it's an imperfect analogy on so many levels. Nitpick away.
  
  1 reply →