Comment by daxfohl

12 days ago

I still find in these instances there's at least a 50% chance it has taken a shortcut somewhere: created a new, bigger bug in something that just happened not to have a unit test covering it, or broke an "implicit" requirement that was so obvious to any reasonable human that nobody thought to document it. These can be subtle because you're not looking for them, because no human would ever think to do such a thing.

Then even if you do catch it, AI: "ah, now I see exactly the problem. just insert a few more coins and I'll fix it for real this time, I promise!"

51 comments

daxfohl

gtowey 12 days ago

The value extortion plan writes itself. How long before someone pitches the idea that the models explicitly almost keep solving your problem to get you to keep spending? Would you even know?

password4321 11 days ago

First time I've seen this idea, I have a tingling feeling it might become reality sooner rather than later.
sailfast 12 days ago
That’s far-fetched. It’s in the interest of the model builders to solve your problem as efficiently as possible token-wise. High value to user + lower compute costs = better pricing power and better margins overall.
- d0mine 12 days ago
  
  > far-fetched
  Remember Google?
  Once it was far-fetched that they would make the search worse just to show you more ads. Now, it is a reality.
  With tokens, it is even more direct. The more tokens users spend, the more money for providers.
  
  5 replies →
- xienze 12 days ago
  
  > It’s in the interest of the model builders to solve your problem as efficiently as possible token-wise.
  Unless you’re paying by the token.
- lelanthran 11 days ago
  
  > It’s in the interest of the model builders to solve your problem as efficiently as possible token-wise. High value to user + lower compute costs = better pricing power and better margins overall.
  It's only in the interests of the model builders to do that IFF the user can actually tell that the model is giving them the best value for a single dollar.
  Right now you can't tell.
  
  4 replies →
fragmede 12 days ago
The free market proposition is that competition (especially with Chinese labs and grok) means that Anthropic is welcome to do that. They're even welcome to illegally collude with OpenAi such that ChatGPT is similarly gimped. But switching costs are pretty low. If it turns out I can one shot an issue with Qwen or Deepseek or Kimi thinking, Anthropic loses not just my monthly subscription, but everyone else's I show that too. So no, I think that's some grade A conspiracy theory nonsense you've got there.
- coffeefirst 12 days ago
  
  It’s not that crazy. It could even happen by accident in pursuit of another unrelated goal. And if it did, a decent chunk of the tech industry would call it “revealed preference” because usage went up.
  
  1 reply →
- jrflowers 12 days ago
  
  This is a good point. For example if you have access to a bunch of slot machines, one of them is guaranteed to hit the jackpot. Since switching from one slot machine to another is easy, it is trivial to go from machine to machine until you hit the big bucks. That is why casinos have such large selections of them (for our benefit).
  
  3 replies →
- bandrami 11 days ago
  
  > But switching costs are pretty low
  Switching costs are currently low. Once you're committed to the workflow the providers will switch to prepaying for a year's worth of tokens.
- daxfohl 12 days ago
  
  To be clear I don't think that's what they're doing intentionally. Especially on a subscription basis, they'd rather me maximize my value per token, or just not use them. Lulling users into using tokens unproductively is the worst possible option.
  The way agents work right now though just sometimes feels that way; they don't have a good way of saying "You're probably going to have to figure this one out yourself".
- thunderfork 12 days ago
  
  As a rational consumer, how would you distinguish between some intentional "keep pulling the slot machine" failure rate and the intrinsic failure rate?
  I feel like saying "the market will fix the incentives" handwaves away the lack of information on internals. After all, look at the market response to Google making their search less reliable - sure, an invested nerd might try Kagi, but Google's still the market leader by a long shot.
  In a market for lemons, good luck finding a lime.
  
  2 replies →
- zelphirkalt 10 days ago
  
  And we all know the market always gives us the best quality product ...
Fnoord 11 days ago

I was thinking more of deliberate backdoor in code. RCE is an obvious example, but another one could be bias. "I'm sorry ma'am, computer says you are ineligable for a bank account." These ideas aren't new. They were there in 90s already when we still thought about privacy and accountability regarding technology, and dystopian novels already described them long, long ago.
chanux 11 days ago
Is this from a page of dating apps playbook?
- direwolf20 11 days ago
  
  yes

wvenable 12 days ago

> These can be subtle because you're not looking for them

After any agent run, I'm always looking the git comparison between the new version and the previous one. This helps catch things that you might otherwise not notice.

teaearlgraycold 11 days ago

And after manually coding I often have an LLM review the diff. 90% of the problems it finds can be discounted, but it’s still a net positive.

einrealist 11 days ago

And there is this paradox where it becomes harder to detect the problems as the models 'improve'.

charcircuit 12 days ago

You are using it wrong, or are using a weak model if your failure rate is over 50%. My experience is nothing like this. It very consistently works for me. Maybe there is a <5% chance it takes the wrong approach, but you can quickly steer it in the right direction.

testaccount28 12 days ago
you are using it on easy questions. some of us are not.
- meowface 11 days ago
  
  A lot of people are getting good results using it on hard things. Obviously not perfect, but > 50% success.
  That said, more and more people seem to be arriving at the conclusion that if you want a fairly large-sized, complex task in a large existing codebase done right, you'll have better odds with Codex GPT-5.2-Codex-XHigh than with Claude Code Opus 4.5. It's far slower than Opus 4.5 but more likely to get things correct, and complete, in its first turn.
  
  1 reply →
- mikkupikku 12 days ago
  
  I think a lot of it comes down to how well the user understands the problem, because that determines the quality of instructions and feedback given to the LLM.
  For instance, I know some people have had success with getting claude to do game development. I have never bothered to learn much of anything about game development, but have been trying to get claude to do the work for me. Unsuccessful. It works for people who understand the problem domain, but not for those who don't. That's my theory.
  
  10 replies →
- baq 12 days ago
  
  Don’t use it for hard questions like this then; you wouldn’t use a hammer to cut a plank, you’d try to make a saw instead