Comment by tyingq

5 days ago

The abrupt swing in many non-technology company IT departments from "hey developer, you aren't using enough tokens" to this is just too funny.

And I'm seeing almost no self-awareness from leaders. They are making decisions about things that they just don't understand. And are completely unworried about it. Just blindly following whatever the news cycle is about AI.

82 comments

tyingq

datakan 5 days ago

The closer people live to the consequences of their decisions the more rational they become. Until leaders(and I use that term loosely) are held accountable, the insanity will continue.

greesil 5 days ago
Their only accountability is to the stock price. The insanity will continue.
- dfedbeef 5 days ago
  
  As long as our stock price continues to... Continues to rise... Which... Hmm... I'm just now reading our balance sheet. Is this number right? Great, thanks.
  As I was saying, you're all fired.
- Henchman21 5 days ago
  
  I’m willing to bet that most of us here are capable of acquiring pitchforks and torches.
  I predict that will be their comeuppance; it will begin a new era in history.
  
  2 replies →
mrandish 5 days ago

In addition to being true, this observation is profound. When designing any multi-step system that relies on humans making decisions, whether in governance, organizations or economies, placing root causes as close to end effects as possible is almost always better.
oofbey 5 days ago
I’m sorry you are used to working with out of touch leadership. Not all companies are like that. Even big ones can have smart, empathetic leaders. Although very often money gets in the way of empathy.
- rf15 5 days ago
  
  Money alao has the problematic tendency to warp the people around you, it's its own kind of gravity. The more powerful you are the more you attract yesmannerism and the more you lose touch with what's going on.
- therealdrag0 5 days ago
  
  Also notably these attributes don’t make one infallible. I see a lot of engineers judging from the sidelines without any sense of how to run large orgs and how you have to make tough calls with imperfect info all the time.
- OutOfHere 4 days ago
  
  Being out of touch is the default state for leadership. They mostly just parrot the news with a multi-month lag.
- pdimitar 5 days ago
  
  You hiring?

bunderbunder 5 days ago

I've been enjoying journalist Ed Zitron's recent diatribes about how impossible it is to find a business leader who had a plan for measuring their ROI from adopting AI coding.

What he says he's consistently hearing from them mirrors what I saw at my own employer: they thought they had ROI metrics, but they actually only had usage metrics such as "lines of code committed" or "number of pull requests". The only way those could possibly work as an ROI measure is if your business charges customers by the line of code.

conception 5 days ago
What they really means is they previously had no valid metric to measure productivity of developers before either. AI or not.
- bunderbunder 5 days ago
  
  Measuring productivity of developers isn’t really in line with what needs to happen, either. A team can be incredibly productive and still generate negative 100% ROI if what they are building so industriously is stuff that nobody wants to buy.
  Which reflects another thing I’ve seen at work. A lot of what AI coding has enabled is diving headfirst into quagmires. Our costs have spiked - not just because of the token spend, also because we gotta pay the cloud platform to run all these new services, operators to operate them, marketers to market them, etc. - but revenue hasn’t budged.
- no-name-here 5 days ago
  
  But at least pre AI, most managers presumably subjectively measured devs on relevant performance. Using systems where employees who burn the most tokens ($) per week ‘win’ is crazy - just ask the AI to spin up a subagents to implement every conceivable approach to a task, then spin up n agent judge to pick the winner, and repeat. You've immediately got 50x or whatever your previous usage from that alone.
  
  1 reply →

vasco 5 days ago

During ZIRP they discovered that the way to lead companies nowadays is to become a maxxer of whatever current fad is, and the more you maxx the better. And then when things change and you're wrong, you'll be a strong leader and, in ZIRPs case fire everyone you over-hired, with AI will be similar.

Why be a normal guy that waits to see what happens and is measured and pragmatic when you can get attention basically through the whole cycle by being the earliest adopter, adopt it to the maxx, then also be the loudest big brain when the tide changes and be praised for "taking hard decisions" when you revert everything you said so far?

The fakemaxxing economy.

janussunaj 5 days ago

A special case of the more general cringe economy we're in. The dumbest, most outrageous ideas win, amplified by social media. Say stupid sh*t loudly, be wrong, profit.

sdeframond 5 days ago

Groups resist to change - the bigger the group, the most resistance there is.

As a leader, pushing for rapid change cannot really be nuanced lest the push dissipates into the organization's entropy.

HarHarVeryFunny 5 days ago
Perhaps, but the change you get (if any) is most likely to be what you push for and reward/punish.
It's irrational to push for tokenmaxxing (literally "please increase our AI spending") and not expect that this is the result you are going to get. You won't get productivity increase, since that is not what you are pushing for - you will get token usage maximization (engineers running inane agentic tasks against your code base to increase usage, using company paid AI for their side projects, etc, etc).
- lanstin 5 days ago
  
  The evidence suggests that many tech leaders do not realize that an immediate result of heavy handed uninformed top down decision making is transforming the “work together, succeed together, giving quality” ethos into a cynical game theory minimax effort to game whatever stupid arbitrary metrics are used to implement the top down fad of the quarter; do it consistently and you get a work force that can be given a metric and immediately, instinctively, tell you how the work flow will be adjusted for the new metric, and where the difficult problems will be shunted to.
- SpicyLemonZest 5 days ago
  
  I'm not sure the leaders would disagree with what you're saying. They tokenmaxxed to understand what it looks like when AI gets into every corner of the business; now they feel they've gotten enough info (or at least that more info wouldn't be worth the cost), so they're adding in cost controls. As the article says, this is not great for AI model providers trying to predict what their future revenue is going to be, but it's not obvious that there's any mistake here for AI users.
  
  6 replies →

qoez 5 days ago

I feel like most successful businesses have such a moat of required capital to compete with them that even tho in theory poor decisions like this is supposed to give opportunities for entreprenuers to hit when the big dogs make a wrong move, it doesn't end up happening.

morgan814 5 days ago

> leaders

Don’t play their game and call them leaders. They are management, bosses, executives.

> They are making decisions about things that they just don't understand. And are completely unworried about it.

Clowns, even.

> Just blindly following whatever the news cycle is about AI.

But followers might be most apt.

——

This is such a huge pet peeve of mine. Describing management goofs using their language that makes them sound all-so-brilliant. We constantly watch these people do the dumbest shit and then they go around describing themselves as “thought leaders” and “servant leaders”. When, really, most are just clowns with fragile egos.

And, while I’m rambling, they’ve tried to take away the fact we are workers by calling us individual contributors. Using language to attempt and hide the hierarchy and power dynamic at play. It just…bothers me so much.

joquarky 5 days ago

I don't hear them refer to themselves as "job creators" much these days.
And many of them still claim they are "risk takers", but have effectively insulated themselves from risk by socializing losses.
danaris 4 days ago

> Don’t play their game and call them leaders. They are management, bosses, executives.
You're falling into a common trap here: the ambiguity of the English language.
"Leader" means multiple different things. Yes, it means someone who has leadership qualities—who genuinely inspires those around them to do better, or who boldly marches into the unknown and gets people to follow them.
But it also means "someone in charge of a thing."
Now it's certainly true that many people in charge of things who are also really bad at actually inspiring or getting people to follow them (aside from with threats of destitution) also play on that ambiguity to try to convince people that because they're in charge of things, they must also be Good Leaders, and that's crappy...but yelling at others for using the term casually is very much an "old man yells at cloud" situation.

schonfinkel 3 days ago

The worst part is that techies can still work around the insanity if they keep their opinions private. For the serious average Joe the AI mandates must be feeling like hell on earth.

I once worked in a company that had soviet-level efforts to push LLMs into everything, someone eventually made the classic "Natural Language -> SQL Query -> Magic Result in webpage" and got promoted, the tool got mandated for every non-tech employee as part of an AI-boosting effort (people pushing metrics up).

One day I wake up with a product person in despair because the tool couldn't handle what looked like a very simple aggregation, I stopped what I was doing, crafted a 30-line SQL query over HORRIBLE TABLES, a couple CTEs and window functions here and there got him what he wanted. I found out later that single query that took 30 minutes to make saved him from inheriting a 6-month effort to create a microservice dedicated to patching said tool.

surgical_fire 5 days ago

I've never seen self-awareness from leaders. They always lead on vibes.

Understanding this was one of the most important things in my career.

steve1977 5 days ago

That's nothing new though. It's just very obvious this time.

im3w1l 5 days ago

Having studied control theory I think it makes perfect sense. When trying to make a system target a new level it's quite natural for there to be overshoot that needs to be reigned in. It's also natural for the correction to go too far and need to be corrected in turn. This is not indicative of stupidity it's completely normal.

It would only be laughable if they waited way too long to reverse course, but I don't think that's the case.

RJIb8RBYxzAMX9u 5 days ago

Suppose I'm driving at 20 kph, and I set my cruise control to 40 kph. My car then goes WOT, overshoots my target speed and hits 120 kph, at which point it slams on the brakes[0], dropping my speed to 15 kph. It repeats until it finally settles at my target speed. (Rhetorical question) would that be considered "completely normal"?
Over/undershoots and corrections are of course unavoidable and normal; the absurdity is at the magnitude and rate of change. Furthermore, this is giving it the benefit of the doubt, that measuring AI spend is a good indicator; that's arguably also in dispute. To stretch my car analogy a bit more: it would be like the cruse control system has to hit the target speed, but it only has data from the O2 sensors.
[0] I know that the "classic" cruise control system cannot apply the brakes, but hey no analogy's perfect.
adammarples 5 days ago
It's not like they accidentally overshot, they were telling people to tokenmax, they didn't even know you could overshoot they thought it was exponential gains all the way. Subtle ideas like balance were not on their minds.
- im3w1l 5 days ago
  
  Intentionally overshooting can be a legitimate strategy.

onlyrealcuzzo 5 days ago

The actual cost is going to drop 99% in ~4 years.

How much that makes it into enterprise pricing is TBD, since none of the hyper scalers are making money yet of selling AI inference.

Almost all businesses are ahead of the gun. For most of their use cases, AI is either not yet good enough on its own, or good enough but too expensive.

No one wants to get left behind, so everyone's trying to get onto it now, even though it's not ready for what most enterprises want to do with it.

It's easy for them to look at a small startup without billions of lines of legacy business logic debt and see them having success and wonder why they can't have just as much - or more - why they're bigger so they should have better and more success, right???

Wrong...

But when it gets ~99% cheaper for local inference over the next 4 years, at the same time the price per watt improve 4x -> a lot of those cases will start to pencil out.

BearOso 5 days ago
Going from Opus 4.5 to 4.7 secretly required 6x more compute to run. 4.8 is apparently 30% more on top. I haven't seen any optimizations lately aside from distillation. Nobody's optimizing, they're just scaling up.
- rescbr 5 days ago
  
  > Nobody's optimizing
  The Chinese, since they lack computing hardware due to US export controls, are.
  
  5 replies →
- trollbridge 5 days ago
  
  DeepSeek and Alibaba would like to have a word.
  
  2 replies →
- new_account_102 5 days ago
  
  [dead]
krona 5 days ago
> The actual cost is going to drop 99%
Do you mean the marginal cost by the producer, or the cost on the consumer? I can't see the price of electricity falling much, and the demand curve is apparently exponential if the hype is to be believed.
- trollbridge 5 days ago
  
  DeepSeep V4 Pro is 99% cheaper than similarly performing models were 2 years ago (if such a model even existed).
  Computing has always been about how to wring out more efficiency. The ENIAC was 150,000 watts, with 3 phase 240 volt power, and cost about $500,000.
  My day to day laptop (a year old) is 35 watts, with 1 phase 20 volt power, and cost $1,000, so that's 99.98% less power consumption, 99.8% cheaper, and it has about 10 orders of magnitude more computing power, all on a time span of 80 years.
  
  3 replies →
packetlost 5 days ago
I don't see how this is even remotely true. Unless there's some super breakthrough into a fundamentally different architecture, there's not really a path to a 50% reduction in price, much less a 99% reduction.
- kilroy123 5 days ago
  
  In fairness, I think _current_ capabilities will be cheaper. So the models of today will be run drastically cheaper in 4 years.
- onlyrealcuzzo 5 days ago
  
  And yet 90% drops for the same level of quality every 18 months have happened like clockwork...
  And the technology already exists on the algorithmic front TODAY to lock in another 10x gain -> when, typically, algorithmic gains only account for ~30% of that drop and the other ~70% comes from better data (often synthetic) and knowledge distilation from frontier models.
  Just look at DeepSeek's pricing...
datakan 5 days ago
What makes you think prices will drop? Everyone I’ve spoken to believes they will only skyrocket. Genuinely curious
- onlyrealcuzzo 5 days ago
  
  The technology already exists now on the algorithmic front for the next 10x drop between everyone adopting DeepSeek's MLA, MoE (mostly already done), Medusa (a better version of Google's speculative decoding), Kimi's Attn Residuals, and Mimo's Sliding Window Attn, and (possibly) Microsoft's 1.58b (this may be a nothing burger).
  Historic trends, every 18 months, performance for the same level of quality has gone down 90%.
  See: https://www.reddit.com/r/LocalLLaMA/comments/1gpr2p4/llms_co...
  And Chart 13 here: https://www.rdworldonline.com/ais-great-compression-20-chart...
  And here: https://epoch.ai/data-insights/llm-inference-price-trends
  Historically, algorithmic gains are only ~30% of the pie, but there's enough out there to get to 10x, with just what's available already. The other ~70% of the pie is better training data (often synthetic) and distilling frontier knowledge. There's no sign we are tapped out on that front.
  Additionally, GRAM (from ~10 days ago) is likely to be a 5-10x on its own (if not substantially more for smaller models). It's unlikely within 4 years LeCun's JEPA ideas and similar ideas like GRAM applied to LLMs have ZERO impact. The preliminary results are absolutely astounding (5000x better reasoning - this is not peanuts).
  Further, that's not even counting that cost per watt is still dropping ~2x every 2 years on its own on the hardware front.
  If you look at the "cost" of inference. People think it's electricity - but it's currently almost ~80% hardware amortization. The memory shortage is not going to last, nor are Nvidia's ~80-90% margins.
  The human brain is still 8-10 orders of magnitude more efficient than the best LLMs of today. With ~1/10th of global capex riding on AI, if you don't think they're going to knock of 2 orders of magnitude more, when it's this obvious and easy... I don't know what to tell you...
  Sure, it might take 6 years instead of 4. My crystal ball isn't perfect.
  
  5 replies →
bakugo 5 days ago
Prices have been very obviously trending up, not down. Even open weights models are becoming more expensive with every release. Computer hardware is ballooning in price.
- onlyrealcuzzo 5 days ago
  
  Prices are going up for BETTER quality -> not for the SAME level of quality.
  People are willing to pay more for BETTER quality.
  You obviously haven't seen DeepSeek v4 Pro's pricing if you think pricing only goes up...
  
  1 reply →
- abalashov 5 days ago
  
  Just wait for the next model and the next model architecture. Just wait for it, bro.
  
  4 replies →
- trollbridge 5 days ago
  
  Grab a 5090 and run Qwen 3.6 35b on it (6 parameter seems to work best for me).
  Then buy $10 (or $2, if you're cheap, and they take PayPal) of DeepSeek credits.
  Whilst you're at it spring for a Claude subscription too and GPT.
  Switch models between Qwen, DeepSeek Flash, DeepSeek Pro, and you can meet 99% of your code generation needs.
  Hop over to Opus 4.7 (or 4.8, but I haven't really used it yet) and GPT-5.5 when doing very complex architecture/design or troubleshooting something where DeepSeek Pro is getting stuck.
  It is ridiculous how cheap this stuff is now. It's affordable at third world prices.
  
  2 replies →
AllegedAlec 5 days ago
> The actual cost is going to drop 99% in ~4 years.
And fusion power is just 2 decades into the future!
- jjav 5 days ago
  
  Full self driving guaranteed here before the end of the year (every year).
mrandish 5 days ago

> The actual cost is going to drop 99% in ~4 years.
We have little visibility into current frontier model costs at mass scale. As a broad historical trend, tech costs tend to fall over longer time periods but your claim far exceeds Moore's Law rates in its heyday - and that heyday is long gone.
In 2021 TSMC announced it was increasing it's price per gate for new nodes for the first time in its history. In the past five years cutting edge nodes have delivered ~8-15% real-world performance gains on average at costs at least 10-20% more than the last node. If you're positing a string of unprecedented efficiency breakthroughs in LLM algorithms - such extraordinary claims require extraordinary evidence.