Comment by dyauspitr

1 day ago

I usually say run the full regression suite, all the simulator tests, install simulators and take a screenshot of every page on all applicable devices and do comprehensive fuzzing and chaos testing before I go to bed. It usually takes atleast 3-4 hours, usually longer, especially the UI/simulator tests.

2 comments

dyauspitr

apsurd 1 day ago

I just recently learned about hooks[1] from another HN comment. Conceptually, running CI doesn't have to impose an Agentic tax right?

In other words, isn't there a way to orchestrate this NOT as a long running token maxxing setup given that triggers and CI runs can be run deterministically.

disclaimer: I haven't done this, just interested.

[1] https://code.claude.com/docs/en/hooks

dyauspitr 1 hour ago

I’m sure it’s possible. It’s a natural language LLM so I try and stay away from any “programmatic way” of doing things (I hate the idea of reproducing all the config fragility we have in current systems and prefer the LLM reach out to an endpoint directly and reason through the connection) but if you just ask it to hit an endpoint after it’s done and poll another endpoint to see if the run is done I’m sure it would do it.