← Back to context

Comment by Gigachad

4 years ago

It does ruin the illusion that you need crazy million dollar servers to run the model and the world would fall in to chaos if the public got their hands on these models.

To be fair, the world only just got their hands on this (and by world I mean people with decent hardware), so too soon to say what the ramifications will be.

  • Also to be fair, the job "AI ethicist" probably didn't exist as a real thing until a few years ago. So the people in those roles over at OpenAI likely have no idea what they're doing.

We wouldn't be able to run it ourselves if they hadn't trained it on 4000 GPUs for a month.

  • The cost of training is actually quite a bit less. Emad, the creator of SD stated this on Twitter:

    "We actually used 256 A100s for this per the model card, 150k hours in total so at market price $600k"

    • Even if it was hard to train, you could make your own by fine-tuning a larger model for much cheaper.

      That's called "base models". (or "foundation models" if you're Stanford trying to co-opt it)

      2 replies →