← Back to context

Comment by DoctorOetker

2 hours ago

it is instructive to calculate the size and requirements for a system that can pretrain a 405B parameter transformer in ~ 17 days.

a different question is the expected payback time, unless someone can demonstrate a reasonable calculation that shows a sufficiently short payback period, if no one here can we still can't exclude big tech seeing something we don't have access to (the launch costs charged to third parties may be different than the launch costs charged for themselves for example).

suppose the payback time is in fact sufficiently short or commercial life sufficiently long to make sense, then the scale didn't really matter, it just means sending up the system described above repeatedly.

I mean yeah if you consider the "scale" to not be a problem there are no problems indeed. I argue that the scale actually is the biggest problem here... which is the case with most of our issues (energy, pollution, cooling, heating, &c.)