Comment by boh

2 days ago

I think the big secret is that AI is just software. In the same way that a financial firm doesn't all of sudden make a bunch of money because Microsoft shipped an update to Excel, AI is inert without intention. If there's any major successes in AI output it's because a person got it to do that. Claude Code is great, but it will also wipe out a database even though it's instructed not to (I can confirm from experience). The idea that there's some secret innovation that will come out any minute doesn't change the fact that it's software that requires human interaction to work.

76 comments

boh

codingdave 2 days ago

Yes, and it has been said since day one of LLMs that all we need to do is keep things that way - no action without human intervention. Just like it was said that you should never grant AI direct access to change your production systems. But the stories of people who have done exactly that and had their systems damaged and deleted show that people aren't trying to even keep such basic safety nets in place.

AI is getting strong enough that if people give some general direction as well as access to production systems of any kind, things can go badly. It is not true that all implementations of agentic AI requires human intervention for all action.

Terr_ 2 days ago

My cynical rule of thumb: By default we should imagine LLMs like javascript logic offloaded into a stranger's web-browser.
The risks are similar: No prompts/data that go in can reliably be kept secret; A sufficiently-motivated stranger can have it send back completely arbitrary results; Some of those results may trigger very bad things depending on how you use or even just display them on your own end.
P.S. This conceptual shortcut doesn't quite capture the dangers of poison data, which could sabotage all instances even when they happen to be hosted by honorable strangers.
jrflowers 2 days ago
If you had made a tool that gave gpt-3 the ability to run arbitrary commands on your production systems you could have seen things go badly.
- Lalabadie 2 days ago
  
  Good news! Today's SOTA models can also make things go badly.
  
  1 reply →
stuaxo 2 days ago
Eh, these same people will attach openclaw to production systems soon and destroy their own companies.
- CamperBob2 2 days ago
  
  The problem is, out of ten companies who take this approach, nine will indeed destroy themselves and one will end up with a trillion-dollar market cap. It will outcompete hundreds of companies who stuck with more conservative approaches. Everybody will want to emulate company #10, because "it obviously works."
  I don't see any stabilizing influences on the horizon, given how much cash is sloshing around in the economy looking for a place to land. Things are going to get weird, stupid, and chaotic, not necessarily in that order.
- flats 2 days ago
  
  One does not even need OpenClaw to achieve this outcome: https://x.com/lifeof_jer/status/2048103471019434248
  
  2 replies →
- AndrewKemendo 2 days ago
  
  Sounds like a pretty efficient self correcting mechanism
  I’m not sure what the problem is there
  
  6 replies →

dataviz1000 2 days ago

LLM models are a distribution. Unlike a python script or turning machine, a LLM model is capable of generating any series of tokens. Developers need stop reasoning about LLM agents as deterministic and to start to think about agents in terms of Monte Carlo and Las Vegas algorithms. It isn't enough to have an agents, it also requires a cheap verifier.

If I was a Ph.D. student today, I'd probably do a thesis on cheap verifiers for LLM agents. Since LLM agents are not reliable and therefore not very useful without it, that is a trillion dollar problem.

Once a developer groks that concept, the agents stop being scary and the potential is large.

aleph_minus_one 2 days ago
> If I was a Ph.D. student today, I'd probably do a thesis on cheap verifiers for LLM agents. Since LLM agents are not reliable and therefore not very useful without it, that is a trillion dollar problem.
PhD thesis are for (ideally) setting up a new world standard in some research area (at the end, you build your PhD thesis out of the deep emotional shards of this completely destroyed life dream), and not for some personal self-discovery project of which you hope that it will turn you into the popular kid on the block.
- dataviz1000 2 days ago
  
  That is like telling students to never do a PhD thesis on superscalar out-of-order execution, stochastic gradient descent, or UDP. I'm framing it as an analogous problem. What is missing is a cheap verification process.
  
  2 replies →
add-sub-mul-div 2 days ago
If you told a programmer 30 years ago that someday we'd switch from a deterministic to nondeterministic paradigm for programming computers, they'd ask if we'd put lead back in the drinking water.
- munk-a 2 days ago
  
  We'd just explain that management told us we had to and then they'd understand.
- dg247 2 days ago
  
  Been doing this 30 years now. I am asking that question. Everyone talks around it.
  
  1 reply →
- reducesuffering 2 days ago
  
  Right? I get a kick out of programming used to being:
  put this exact value inside this exact register at the right concurrent time and all the tedious exactness that C required
  into now:
  "pretty please can you not do that and fix the bug somewhere a different way"
- georgemcbay 2 days ago
  
  > they'd ask if we'd put lead back in the drinking water.
  With Lee Zeldin heading the EPA is anyone sure we won't?
  
  1 reply →
- com2kid 2 days ago
  
  It has always been non-deterministic but we relied on low level engineers who knew the dark magicks to keep the horrors at bay.
  Bit flips in memory are super common. Even CPUs sometimes output the wrong answer for calculations because of random chance. Network errors are common, at scale you'll see data corruption across a LAN often enough that you'll quickly implement application level retries because somehow the network level stuff still lets errors through.
  Some memory chips are slightly out of timing spec. This manifests itself as random crashes, maybe one every few weeks. You need really damn good telemetry to even figure out what is going on.
  Compilers do indeed have bugs. Native developers working in old hairy code bases will confirm, often with stories of weeks spent debugging what the hell was going on before someone figured out the compiler was outputting incorrect code.
  It is just that the randomness has been so rare, or the effects so minor, that it has all been, mostly, an inconvenience. It worries people working in aviation or medical equipment, but otherwise people accept the need for an occasional reboot or they don't worry about a few pixels in a rendered frame being the wrong color.
  LLMs are uncertainty amplifiers. Accept a lot of randomness and in return you get a tool that was pure sci-fi bullshit 10 years ago. Hell when reading science fiction now days I am literally going "well we have that now, and that, oh yeah we got that working, and I think I just saw a paper on that last week."
  
  6 replies →
throwaway27448 2 days ago
What would a verifier even look like without having all of the same problems that the chatbot itself does? Are humans themselves not the cheap verifiers?
- xdavidliu 2 days ago
  
  humans are probably the least cheap thing you can have in this context
  
  1 reply →
drBonkers 2 days ago
Do you have any readings you recommend to start thinking in terms of non-deterministic algorithms and cheap verifiers?
- f1shy 2 days ago
  
  Neurosymbolic programming
  
  2 replies →
- mistrial9 2 days ago
  
  filters
airstrike 2 days ago

While you're at it, I'll take a pair of unicorns too if you can find them.

cmdrk 2 days ago

My observation is that the true believers really don't want to think of models as an inert pile of weights. There's some mysticism attached to imagining it's the ship's computer from Star Trek, HAL-9000 or C-3PO. A file loaded into memory and executed over is just so... _pedestrian_.

ben_w 2 days ago

Canonically, the Star Trek computers have pretty much always been just computers, not themselves sentient because the software running on them just isn't.
I'm still not sure if HAL-9000 was supposed to be conscious or just an interesting plot device with a persona as superficial as LLMs are dismissed as today.
LLMs could definitely play the part of all three of your examples, given the flaws they showed on-screen. Could even do a decent approximation of Data (though perhaps not Lore without some jailbreaking).
Still weird that even the best of them isn't really ready to be KITT.

bellBivDinesh 2 days ago

The specter of AGI helps them obfuscate this

cyanydeez 2 days ago

I think the market isn't for anyone but other businesses. We're all ants trying to understand how AI is going to eradicate the lower levels of society.

trolleski 2 days ago

Just call the errors 'consciousness' and keep selling those tokens! Let the Spineless Generation have their last bubble!

ctoth 2 days ago

> doesn't change the fact that it's software that requires human interaction to work.

Have you ever seen Claude Code launch a subagent? You've used it, right? You've seen it launch a subagent to do work? You understand that that is, in fact, Claude Code running itself, right?

simonw 2 days ago
I don't think subagents are representative of anything particularly interesting on the "agents can run themselves" front.
They're tool calls. Claude Code provides a tool that lets the model say effectively:
run_in_subagent("Figure out where JWTs are created and report back")
The current frontier models are all capable of "prompting themselves" in this way, but it's really just a parlor trick to help avoid burning more tokens in the top context window.
It's a really useful parlor trick, but I don't think it tells us anything profound.
- ctoth 2 days ago
  
  The mechanism being simple is the interesting part. If one large complex goal can be split into subgoals and the subgoals completed without you, then you need a lot fewer humans to do a lot more work.
  The OP says AI requires human interaction to work. This simply isn't true. You know yourself that as agents get more reliable you can delegate more to them, including having them launch more subagents, thereby getting more work done, with fewer and fewer humans. The unlock is the Task tool, but the power comes from the smarter and smarter models actually being able to delegate hierarchical tasks well!
  
  3 replies →
fnoef 2 days ago
My Linux server runs a cron job, that can spin off a thread and even use other ~apps~ tools. Did I invent AGI?
- ctoth 2 days ago
  
  Does your Linux server decide what processes it should launch at what time with a theory of what will happen next in order to complete a goal you specified in natural language? If so yes, I reckon you sure have!
  
  9 replies →
- recursive 2 days ago
  
  Maybe. But probably not. It doesn't matter if it's AGI though. If those other apps and tools do simple things that are predictable, then we can be pretty sure what will happen. If those tools can modify their own configuration and create new cron jobs, it becomes much harder to say anything about what will happen.
  
  1 reply →
- ahoka 2 days ago
  
  Well do you make 100 billion bucks with it? If no, then not AGI.
xboxnolifes 2 days ago

My claude has never yet launched itself from my terminal, gave itself a prompt, and then got to work. It has only ever spawned a sub-agent after I had given it a prompt. It was inert until a human got involved.
If that is software running itself, then an if statement that spawns a process conditionally is running itself.
islandfox100 2 days ago

Substance aside, I feel this comment is combative enough to be considered unhelpful. Patronizing and talking down to others convinces no one and only serves as a temporary source of emotional catharsis and a less temporary source of reputational damage.
boh 2 days ago

You're using it and if someone else was using it the output would be different. The point is really that simple.
echelon 2 days ago

All AI requires steering as the results begin to decohere and self-enshittify over time.
AI in the hands of an expert operator is an exoskeleton. AI left alone is a stooge.
Nobody has built an all-AI operator capable of self-direction and choices superior to a human expert. When that happens, you'd better have your debts paid and bunker stocked.
We haven't seen any signs of this yet. I'm totally open to the idea of that happening in the short term (within 5 years), but I'm pessimistic it'll happen so quickly. It seems as though there are major missing pieces of the puzzle.
For now, AI is an exoskeleton. If you don't know how to pilot it, or if you turn the autopilot on and leave it alone, you're creating a mess.
This is still an AI maximalist perspective. One expert with AI tools can outperform multiple experts without AI assistance. It's just got a much longer time horizon on us being wholly replaced.
DeathArrow 2 days ago
A one liner shell script can run itself.
- recursive 2 days ago
  
  One liner shell scripts can be analyzed. Some of them can be determined to not delete the production database. The others will not be executed.