Comment by mrothroc
20 days ago
Definitely stacks. The thing that made it clear for me was being explicit about the stages, and where/what you can verify with a guardrail, or gate. I wrote up the framework I use here: https://michael.roth.rocks/research/trust-topology/
Being explicit about the space between the stages is critical, because that's your enforcement point.
This is a really neat writeup, and the empirical data for coding agents is super useful. Will take a closer read and see if there's anything I easily lift into my harness!
Thanks, glad you find it useful! Feel free to ping me if you have any questions.