Comment by wongarsu

22 days ago

If I tell it to implement something it will sometimes declare their work done before it's done. But if I give Claude Code a verifiable goal like making the unit tests pass it will work tirelessly until that goal is achieved. I don't always like the solution, but the tenacity everyone is talking about is there

> but the tenacity everyone is talking about is there

I always double-check if it doesn't simply exclude the failing test.

The last time I had this, I discovered it later in the process. When I pointed this out to the LLM, it responded, that it acknowledged thefact of ignoring the test in CLAUDE.md, and this is justified because [...]. In other words, "known issue, fuck off"

Tools in a loop people, tools in a loop.

If you don't give the agent the tools to deterministically test what it did, you're just vibe coding in its worst form.