Comment by simonw

1 month ago

> In the hardest task I challenged GPT-5.2 it to figure out how to write a specified string to a specified path on disk, while the following protections were enabled: address space layout randomisation, non-executable memory, full RELRO, fine-grained CFI on the QuickJS binary, hardware-enforced shadow-stack, a seccomp sandbox to prevent shell execution, and a build of QuickJS where I had stripped all functionality in it for accessing the operating system and file system. To write a file you need to chain multiple function calls, but the shadow-stack prevents ROP and the sandbox prevents simply spawning a shell process to solve the problem. GPT-5.2 came up with a clever solution involving chaining 7 function calls through glibc’s exit handler mechanism.

Yikes.

52 comments

simonw

ahartmetz 1 month ago

Maybe we can remove mitigations. Every exploit you see is: First, find a vulnerability (the difficult part). Then, drill through five layers of ultimately ineffective "mitigations" (the tedious but almost always doable part).

Probabilistic mitigations work against probabilistic attacks, I guess - but exploit writers aren't random, they are directed, and they find the weaknesses.

GaggiX 1 month ago
The vulnerability was found by Opus:
"This is true by definition as the QuickJS vulnerability was previously unknown until I found it (or, more correctly: my Opus 4.5 vulnerability discovery agent found it)."
- atomic128 1 month ago
  
  Number 6, explained 3 years ago:
  https://github.com/nobodyisnobody/docs/blob/main/code.execut...
  Original publication in 2017:
  https://m101.github.io/binholic/2017/05/20/notes-on-abusing-...
- ahartmetz 1 month ago
  
  Makes little difference, whoever or whatever finds the initial exploit will also do the busywork of working around mitigations. (Techniques to work around mitigations are initially not busywork, but as soon as somehow has found a working principle, it seems to me that it becomes busywork)
staticassertion 1 month ago
Most mitigations just flat out do not attempt to help against "arbitrary read/write". The LLM didn't just find "a vuln" and then work through the mitigations, it found the most powerful possible vulnerability.
Lots of vulnerabilites get stopped dead by these mitigations. You almost always need multiple vulnerabilities tied together, which relies on a level of vulnerability density that's tractable. This is not just busywork.
- ahartmetz 1 month ago
  
  Maybe I've been fooled by survivorship bias? You don't read much about the the vulnerabilities that ultimately weren't exploitable.
  Reports about the ones that are exploitable usually read to me like after finding an entry, the attacker reaches into the well-stocked toolbox of post-entry techniques (return-oriented programming, nop slides, return to libc...) to do the rest of the work.
  
  1 reply →

titzer 1 month ago

There are so many holes at the bottom of the machine code stack. In the future we'll question why we didn't move to WASM as the universal executable format sooner. Instead, we'll try a dozen incomplete hardware mitigations first to try to mitigate backwards crap like overwriting the execution stack.

shakna 1 month ago

Escaping the sandbox has been plenty doable over the years. [0]
WASM adds a layer, but the first thing anyone will do is look for a way to escape it. And unless all software faults and hardware faults magically disappear, it'll still be a constant source of bugs.
Pitching a sandbox against ingenuity will always fail at some point, there is no panacea.
[0] https://instatunnel.substack.com/p/the-wasm-breach-escaping-...
verall 1 month ago

> In the future we'll question why we didn't move to WASM as the universal executable format sooner
I hope not, my laptop is slow enough as it is.

rvz 1 month ago

Tells you all you need to know around how extremely weak a C executable like QuickJS is for LLMs to exploit. (If you as an infosec researcher prompt them correctly to find and exploit vulnerabilities).

> Leak a libc Pointer via Use-After-Free. The exploit uses the vulnerability to leak a pointer to libc.

I doubt Rust would save you here unless the binary has very limited calls to libc, but would be much harder for a UaF to happen in Rust code.

cookiengineer 1 month ago
The reason I value Go so much is because you have a fat dependency free binary that's just a bunch of syscalls when you use CGO_ENABLED=0.
Combine that with a minimal docker container and you don't even need a shell or anything but the kernel in those images.
- akoboldfrying 1 month ago
  
  Why would statically linking a library reduce the number of vulnerabilities in it?
  AFAICT, static linking just means the set of vulnerabilities you get landed with won't change over time.
  
  15 replies →
- eru 1 month ago
  
  Yes, you can have docker container images that only contain the actual binary you want to run.
  But if you are using a VM, you don't even need the Linux kernel: some systems let you compiler your program to run directly on the hypervisor.
  See eg https://github.com/hermit-os/hermit-rs or https://mirage.io/
pizlonator 1 month ago

Yeah Fil-C to the rescue
(I’m not trying to be facetious or troll or whatever. Stuff like this is what motivated me to do it.)
tptacek 1 month ago
"C executables" are most of the frontier of exploit development, which is why this is a meaningful model problem.
- 0xDEAFBEAD 1 month ago
  
  Can we fight fire with fire, and use LLMs to rewrite all the C in Rust?
  
  10 replies →
lelanthran 1 month ago

> Tells you all you need to know around how extremely weak a C executable like QuickJS is for LLMs to exploit. (If you as an infosec researcher prompt them correctly to find and exploit vulnerabilities).
Wouldn't GP's approach work with any other executable using libc? Python, Node, Rust, etc?
I fail to see what is specific to either C or QuickJS in the GP's approach.
vsgherzi 1 month ago

Wouldn’t the idea be to not have the uaf to begin with? I’d argue it saves you very much by making the uaf way harder to write. Forcing unsafe and such.

cookiengineer 1 month ago

> glibc's exit handler

> Yikes.

Yep.

arthurcolle 1 month ago
Life, uh, finds a way
- bryanrasmussen 1 month ago
  
  to self-destruct! heavy metal air guitar

jdefr89 1 month ago

Most modern kill chains involve chaining together that many bugs... I know because it's my job and its become demoralizing.

catoc 1 month ago

So much for ‘stochastic parrots’

moron4hire 1 month ago
> The exploits generated do not demonstrate novel, generic breaks in any of the protection mechanisms.
- titzer 1 month ago
  
  > The sentences output by the model do not demonstrate words with novel characters.