Comment by alexhans

12 hours ago

I'd say that's almost fine if they can start expressing intent correctly and thinking what good looks like. They (or some automated thing if you're building "think for them" type of products instead of "give them tools and teach them to think how to use them") can then freeze determism more and more were useful

I wrote this to help people (not just Devs) reason about agent skills

https://alexhans.github.io/posts/series/evals/building-agent...

And this one to address the drift of non determism (but depending on the audience it might not resonate as much)

https://alexhans.github.io/posts/series/evals/error-compound...