Comment by keeda

4 months ago

> These things are so hideously inefficient.

Quite the opposite, really. I did some napkin math for energy and water consumption, and compared to humans these things are very resource efficient.

If LLMs improve productivity by even 5% (studies actually peg productivity gains across various professions at 15 - 30%, and these are from 2024!) the resource savings by accelerating all knowledge workers are significant.

Simplistically, during 8 hours of work a human would consume 10 kWH of electricity + 27 gallons of water. Sped up by 5%, that drops by 0.5kWH and 1.35 gallons. Even assuming a higher end of resources used by LLMs, a 100 large prompts (~1 every 5 minutes) would only consume 0.25 kWH + 0.3 gallons. So we're still saving ~0.25 kWH + 1 gallon overall per day!

That is, humans + LLMs are way more efficient than humans alone. As such, the more knowledge workers adopt LLMs, the more efficiently they can achieve the same work output!

If we assume a conservative 10% productivity speed up, adoption across all ~100M knowledge work in the US will recoup the resource cost of a full training run in a few business days, even after accounting for the inference costs!

Additional reading with more useful numbers (independent of my napkin math):

https://www.nature.com/articles/s41598-024-76682-6

https://cacm.acm.org/blogcacm/the-energy-footprint-of-humans...

17 comments

keeda

wlesieutre 4 months ago

So with the AI is doing more of the work and you need less humans, what are you doing with the extra humans to eliminate their no-longer-productive resource consumption?

Saying “we can do the same work with less resource use” doesn’t mean resource consumption is reduced. You’ve just gone from humans using resources to humans using the same resources and doing less work, plus AI using more resources.

pmontra 4 months ago

Resource consumption often goes up. It's a time vs energy tradeoff and it's not free.
Your question is a variant of what do we do with all those humans now that they don't have to walk miles to the well every day because we invented aqueducts? The point is that they didn't want to walk to the well but they had to (and in some places they still have to) and very few people want to work, even now and even us, but they have to.
We will see what happens this time when we won't have to walk to that well.
danans 4 months ago

> So with the AI is doing more of the work and you need less humans, what are you doing with the extra humans to eliminate their no-longer-productive resource consumption?
Soon enough, we won't be able to avoid this question.
whatshisface 4 months ago

You put them to work doing more things than were possible in a month before.
keeda 4 months ago
The thing is, there are many interplaying dynamics here that are impossible to unravel. This is why I called it "napkin math", because figuring out the full ramifications of this change is a pretty large economic problem that nobody has figured out!
For instance, I think operating at this level of productivity is unsustainable (https://news.ycombinator.com/item?id=46896066)
There are many more dynamics at play of course, but I think an equilibrium will be found purely because everyone is incentivized to find a solution (UBI?) that keeps both the elites and the plebes living long and prospering. I expect some turmoil, but luckily, the severe resource crunch of GPUs gives us time to figure things out.
- keybored 4 months ago
  
  What I gather from your analysis and gleeful exclamation marks is that I should start rioting now rather than wait.
  
  2 replies →
_kb 4 months ago

Turn them into biogas to create more energy for DCs.

danielbln 4 months ago

Do keep in mind that 1 large prompt every 5 minutes is not how e.g. coding agents are used. There it's 1 large prompt every couple of seconds.

keeda 4 months ago

True, but I think in these scenarios they rely on prompt caching, which is much cheaper: https://ngrok.com/blog/prompt-caching/
I have no expertise here, but a couple years ago I had a prototype using locally deployed Llama 2 that cached the context (now deprecated https://github.com/ollama/ollama/issues/10576) from previous inference calls, and reused it for subsequent calls. The subsequent calls were much much faster. I suspect prompt caching works similarly, especially given changed code is very small compered to the rest of the codebase.

gosub100 4 months ago

Are you excluding the cost of training the AI from the calculation?

keeda 4 months ago

In the initial analysis of a single worker, yes, but when scaling up per-human savings to use by the wider population, the aggregate resource savings compensate for training resource usage within a few days, weeks at most.

what 4 months ago

How is a human consuming 27 gallons of water in an 8 hour work shift?

keeda 4 months ago

This includes things like drinking, sanitation, etc. Derived the number from here: https://www.epa.gov/watersense/statistics-and-facts
Mostly lines up with this reference too, which focuses only on water usage at work: https://quench.culligan.com/blog/average-water-usage-per-per...
DoctorOetker 4 months ago

Since their example scales the water consumption with their electricity consumption one may conclude it was the fresh water consumed (evaporated) during production of the electricity. Gaseous H2O is an even more potent GHG than CO2.
missingdays 4 months ago

One burger for lunch is 660 gallons of water. So 27 is actually a very huge underestimation