Comment by tyingq

5 days ago

The abrupt swing in many non-technology company IT departments from "hey developer, you aren't using enough tokens" to this is just too funny.

And I'm seeing almost no self-awareness from leaders. They are making decisions about things that they just don't understand. And are completely unworried about it. Just blindly following whatever the news cycle is about AI.

The closer people live to the consequences of their decisions the more rational they become. Until leaders(and I use that term loosely) are held accountable, the insanity will continue.

  • Their only accountability is to the stock price. The insanity will continue.

    • As long as our stock price continues to... Continues to rise... Which... Hmm... I'm just now reading our balance sheet. Is this number right? Great, thanks.

      As I was saying, you're all fired.

    • I’m willing to bet that most of us here are capable of acquiring pitchforks and torches.

      I predict that will be their comeuppance; it will begin a new era in history.

      2 replies →

  • In addition to being true, this observation is profound. When designing any multi-step system that relies on humans making decisions, whether in governance, organizations or economies, placing root causes as close to end effects as possible is almost always better.

  • I’m sorry you are used to working with out of touch leadership. Not all companies are like that. Even big ones can have smart, empathetic leaders. Although very often money gets in the way of empathy.

    • Money alao has the problematic tendency to warp the people around you, it's its own kind of gravity. The more powerful you are the more you attract yesmannerism and the more you lose touch with what's going on.

    • Also notably these attributes don’t make one infallible. I see a lot of engineers judging from the sidelines without any sense of how to run large orgs and how you have to make tough calls with imperfect info all the time.

    • Being out of touch is the default state for leadership. They mostly just parrot the news with a multi-month lag.

I've been enjoying journalist Ed Zitron's recent diatribes about how impossible it is to find a business leader who had a plan for measuring their ROI from adopting AI coding.

What he says he's consistently hearing from them mirrors what I saw at my own employer: they thought they had ROI metrics, but they actually only had usage metrics such as "lines of code committed" or "number of pull requests". The only way those could possibly work as an ROI measure is if your business charges customers by the line of code.

  • What they really means is they previously had no valid metric to measure productivity of developers before either. AI or not.

    • Measuring productivity of developers isn’t really in line with what needs to happen, either. A team can be incredibly productive and still generate negative 100% ROI if what they are building so industriously is stuff that nobody wants to buy.

      Which reflects another thing I’ve seen at work. A lot of what AI coding has enabled is diving headfirst into quagmires. Our costs have spiked - not just because of the token spend, also because we gotta pay the cloud platform to run all these new services, operators to operate them, marketers to market them, etc. - but revenue hasn’t budged.

    • But at least pre AI, most managers presumably subjectively measured devs on relevant performance. Using systems where employees who burn the most tokens ($) per week ‘win’ is crazy - just ask the AI to spin up a subagents to implement every conceivable approach to a task, then spin up n agent judge to pick the winner, and repeat. You've immediately got 50x or whatever your previous usage from that alone.

      1 reply →

During ZIRP they discovered that the way to lead companies nowadays is to become a maxxer of whatever current fad is, and the more you maxx the better. And then when things change and you're wrong, you'll be a strong leader and, in ZIRPs case fire everyone you over-hired, with AI will be similar.

Why be a normal guy that waits to see what happens and is measured and pragmatic when you can get attention basically through the whole cycle by being the earliest adopter, adopt it to the maxx, then also be the loudest big brain when the tide changes and be praised for "taking hard decisions" when you revert everything you said so far?

The fakemaxxing economy.

  • A special case of the more general cringe economy we're in. The dumbest, most outrageous ideas win, amplified by social media. Say stupid sh*t loudly, be wrong, profit.

Groups resist to change - the bigger the group, the most resistance there is.

As a leader, pushing for rapid change cannot really be nuanced lest the push dissipates into the organization's entropy.

  • Perhaps, but the change you get (if any) is most likely to be what you push for and reward/punish.

    It's irrational to push for tokenmaxxing (literally "please increase our AI spending") and not expect that this is the result you are going to get. You won't get productivity increase, since that is not what you are pushing for - you will get token usage maximization (engineers running inane agentic tasks against your code base to increase usage, using company paid AI for their side projects, etc, etc).

    • The evidence suggests that many tech leaders do not realize that an immediate result of heavy handed uninformed top down decision making is transforming the “work together, succeed together, giving quality” ethos into a cynical game theory minimax effort to game whatever stupid arbitrary metrics are used to implement the top down fad of the quarter; do it consistently and you get a work force that can be given a metric and immediately, instinctively, tell you how the work flow will be adjusted for the new metric, and where the difficult problems will be shunted to.

    • I'm not sure the leaders would disagree with what you're saying. They tokenmaxxed to understand what it looks like when AI gets into every corner of the business; now they feel they've gotten enough info (or at least that more info wouldn't be worth the cost), so they're adding in cost controls. As the article says, this is not great for AI model providers trying to predict what their future revenue is going to be, but it's not obvious that there's any mistake here for AI users.

      6 replies →

I feel like most successful businesses have such a moat of required capital to compete with them that even tho in theory poor decisions like this is supposed to give opportunities for entreprenuers to hit when the big dogs make a wrong move, it doesn't end up happening.

> leaders

Don’t play their game and call them leaders. They are management, bosses, executives.

> They are making decisions about things that they just don't understand. And are completely unworried about it.

Clowns, even.

> Just blindly following whatever the news cycle is about AI.

But followers might be most apt.

——

This is such a huge pet peeve of mine. Describing management goofs using their language that makes them sound all-so-brilliant. We constantly watch these people do the dumbest shit and then they go around describing themselves as “thought leaders” and “servant leaders”. When, really, most are just clowns with fragile egos.

And, while I’m rambling, they’ve tried to take away the fact we are workers by calling us individual contributors. Using language to attempt and hide the hierarchy and power dynamic at play. It just…bothers me so much.

  • I don't hear them refer to themselves as "job creators" much these days.

    And many of them still claim they are "risk takers", but have effectively insulated themselves from risk by socializing losses.

  • > Don’t play their game and call them leaders. They are management, bosses, executives.

    You're falling into a common trap here: the ambiguity of the English language.

    "Leader" means multiple different things. Yes, it means someone who has leadership qualities—who genuinely inspires those around them to do better, or who boldly marches into the unknown and gets people to follow them.

    But it also means "someone in charge of a thing."

    Now it's certainly true that many people in charge of things who are also really bad at actually inspiring or getting people to follow them (aside from with threats of destitution) also play on that ambiguity to try to convince people that because they're in charge of things, they must also be Good Leaders, and that's crappy...but yelling at others for using the term casually is very much an "old man yells at cloud" situation.

The worst part is that techies can still work around the insanity if they keep their opinions private. For the serious average Joe the AI mandates must be feeling like hell on earth.

I once worked in a company that had soviet-level efforts to push LLMs into everything, someone eventually made the classic "Natural Language -> SQL Query -> Magic Result in webpage" and got promoted, the tool got mandated for every non-tech employee as part of an AI-boosting effort (people pushing metrics up).

One day I wake up with a product person in despair because the tool couldn't handle what looked like a very simple aggregation, I stopped what I was doing, crafted a 30-line SQL query over HORRIBLE TABLES, a couple CTEs and window functions here and there got him what he wanted. I found out later that single query that took 30 minutes to make saved him from inheriting a 6-month effort to create a microservice dedicated to patching said tool.

I've never seen self-awareness from leaders. They always lead on vibes.

Understanding this was one of the most important things in my career.

Having studied control theory I think it makes perfect sense. When trying to make a system target a new level it's quite natural for there to be overshoot that needs to be reigned in. It's also natural for the correction to go too far and need to be corrected in turn. This is not indicative of stupidity it's completely normal.

It would only be laughable if they waited way too long to reverse course, but I don't think that's the case.

  • Suppose I'm driving at 20 kph, and I set my cruise control to 40 kph. My car then goes WOT, overshoots my target speed and hits 120 kph, at which point it slams on the brakes[0], dropping my speed to 15 kph. It repeats until it finally settles at my target speed. (Rhetorical question) would that be considered "completely normal"?

    Over/undershoots and corrections are of course unavoidable and normal; the absurdity is at the magnitude and rate of change. Furthermore, this is giving it the benefit of the doubt, that measuring AI spend is a good indicator; that's arguably also in dispute. To stretch my car analogy a bit more: it would be like the cruse control system has to hit the target speed, but it only has data from the O2 sensors.

    [0] I know that the "classic" cruise control system cannot apply the brakes, but hey no analogy's perfect.

  • It's not like they accidentally overshot, they were telling people to tokenmax, they didn't even know you could overshoot they thought it was exponential gains all the way. Subtle ideas like balance were not on their minds.

The actual cost is going to drop 99% in ~4 years.

How much that makes it into enterprise pricing is TBD, since none of the hyper scalers are making money yet of selling AI inference.

Almost all businesses are ahead of the gun. For most of their use cases, AI is either not yet good enough on its own, or good enough but too expensive.

No one wants to get left behind, so everyone's trying to get onto it now, even though it's not ready for what most enterprises want to do with it.

It's easy for them to look at a small startup without billions of lines of legacy business logic debt and see them having success and wonder why they can't have just as much - or more - why they're bigger so they should have better and more success, right???

Wrong...

But when it gets ~99% cheaper for local inference over the next 4 years, at the same time the price per watt improve 4x -> a lot of those cases will start to pencil out.

  • > The actual cost is going to drop 99%

    Do you mean the marginal cost by the producer, or the cost on the consumer? I can't see the price of electricity falling much, and the demand curve is apparently exponential if the hype is to be believed.

    • DeepSeep V4 Pro is 99% cheaper than similarly performing models were 2 years ago (if such a model even existed).

      Computing has always been about how to wring out more efficiency. The ENIAC was 150,000 watts, with 3 phase 240 volt power, and cost about $500,000.

      My day to day laptop (a year old) is 35 watts, with 1 phase 20 volt power, and cost $1,000, so that's 99.98% less power consumption, 99.8% cheaper, and it has about 10 orders of magnitude more computing power, all on a time span of 80 years.

      3 replies →

  • I don't see how this is even remotely true. Unless there's some super breakthrough into a fundamentally different architecture, there's not really a path to a 50% reduction in price, much less a 99% reduction.

    • In fairness, I think _current_ capabilities will be cheaper. So the models of today will be run drastically cheaper in 4 years.

    • And yet 90% drops for the same level of quality every 18 months have happened like clockwork...

      And the technology already exists on the algorithmic front TODAY to lock in another 10x gain -> when, typically, algorithmic gains only account for ~30% of that drop and the other ~70% comes from better data (often synthetic) and knowledge distilation from frontier models.

      Just look at DeepSeek's pricing...

  • What makes you think prices will drop? Everyone I’ve spoken to believes they will only skyrocket. Genuinely curious

    • The technology already exists now on the algorithmic front for the next 10x drop between everyone adopting DeepSeek's MLA, MoE (mostly already done), Medusa (a better version of Google's speculative decoding), Kimi's Attn Residuals, and Mimo's Sliding Window Attn, and (possibly) Microsoft's 1.58b (this may be a nothing burger).

      Historic trends, every 18 months, performance for the same level of quality has gone down 90%.

      See: https://www.reddit.com/r/LocalLLaMA/comments/1gpr2p4/llms_co...

      And Chart 13 here: https://www.rdworldonline.com/ais-great-compression-20-chart...

      And here: https://epoch.ai/data-insights/llm-inference-price-trends

      Historically, algorithmic gains are only ~30% of the pie, but there's enough out there to get to 10x, with just what's available already. The other ~70% of the pie is better training data (often synthetic) and distilling frontier knowledge. There's no sign we are tapped out on that front.

      Additionally, GRAM (from ~10 days ago) is likely to be a 5-10x on its own (if not substantially more for smaller models). It's unlikely within 4 years LeCun's JEPA ideas and similar ideas like GRAM applied to LLMs have ZERO impact. The preliminary results are absolutely astounding (5000x better reasoning - this is not peanuts).

      Further, that's not even counting that cost per watt is still dropping ~2x every 2 years on its own on the hardware front.

      If you look at the "cost" of inference. People think it's electricity - but it's currently almost ~80% hardware amortization. The memory shortage is not going to last, nor are Nvidia's ~80-90% margins.

      The human brain is still 8-10 orders of magnitude more efficient than the best LLMs of today. With ~1/10th of global capex riding on AI, if you don't think they're going to knock of 2 orders of magnitude more, when it's this obvious and easy... I don't know what to tell you...

      Sure, it might take 6 years instead of 4. My crystal ball isn't perfect.

      5 replies →

  • Prices have been very obviously trending up, not down. Even open weights models are becoming more expensive with every release. Computer hardware is ballooning in price.

    • Prices are going up for BETTER quality -> not for the SAME level of quality.

      People are willing to pay more for BETTER quality.

      You obviously haven't seen DeepSeek v4 Pro's pricing if you think pricing only goes up...

      1 reply →

    • Grab a 5090 and run Qwen 3.6 35b on it (6 parameter seems to work best for me).

      Then buy $10 (or $2, if you're cheap, and they take PayPal) of DeepSeek credits.

      Whilst you're at it spring for a Claude subscription too and GPT.

      Switch models between Qwen, DeepSeek Flash, DeepSeek Pro, and you can meet 99% of your code generation needs.

      Hop over to Opus 4.7 (or 4.8, but I haven't really used it yet) and GPT-5.5 when doing very complex architecture/design or troubleshooting something where DeepSeek Pro is getting stuck.

      It is ridiculous how cheap this stuff is now. It's affordable at third world prices.

      2 replies →

  • > The actual cost is going to drop 99% in ~4 years.

    And fusion power is just 2 decades into the future!

    • Full self driving guaranteed here before the end of the year (every year).

  • > The actual cost is going to drop 99% in ~4 years.

    We have little visibility into current frontier model costs at mass scale. As a broad historical trend, tech costs tend to fall over longer time periods but your claim far exceeds Moore's Law rates in its heyday - and that heyday is long gone.

    In 2021 TSMC announced it was increasing it's price per gate for new nodes for the first time in its history. In the past five years cutting edge nodes have delivered ~8-15% real-world performance gains on average at costs at least 10-20% more than the last node. If you're positing a string of unprecedented efficiency breakthroughs in LLM algorithms - such extraordinary claims require extraordinary evidence.