Comment by antdke

16 days ago

Well, imagine this was controlling a weapon.

“Should I eliminate the target?”

“no”

“Got it! Taking aim and firing now.”

8 comments

antdke

bigstrat2003 16 days ago

It is completely irresponsible to give an LLM direct access to a system. That was true before and remains true now. And unfortunately, that didn't stop people before and it still won't.

unselect5917 16 days ago

And yet it's only a matter of time before someone does it. If they haven't already.

nielsole 16 days ago

Shall I open the pod bay doors?

verdverm 16 days ago

That's why we keep humans in the loop. I've seen stuff like this all the time. It's not unusual thinking text, hence the lack of interestingness

marbletiles 16 days ago
The human in the loop here said “no”, though. Not sure where you’d expect another layer of HITL to resolve this.
- verdverm 16 days ago
  
  Tool confirmation
  Or in the context of the thread, a human still enters the coords and pulls the trigger
  Ukraine is letting some of their drones make kill decisions autonomously, re: areas of EW effect in dead man's zones
  
  1 reply →

nvch 16 days ago

"Thinking: the user recognizes that it's impossible to guarantee elimination. Therefore, I can fulfill all initial requirements and proceed with striking it."