Comment by jedisct1

25 days ago

Claude is good at writing code, not so good at reasoning, and I would never trust or deploy to production something solely written by Claude.

GPT-5.2 is not as good for coding, but much better at thinking and finding bugs, inconsistencies and edge cases.

The only decent way I found to use AI agents is by doing multiple steps between Claude and GPT, asking GPT to review every step of every plan and every single code change from Claude, and manually reviewing and tweaking questions and responses both way, until all the parties, including myself, agree. I also sometimes introduce other models like Qwen and K2 in the mix, for a different perspective.

And gosh, by doing so you immediately realize how dumb, unreliable and dangerous code generated by Claude alone is.

It's a slow and expensive process and at the end of the day, it doesn't save me time at all. But, perhaps counterintuitively, it gives me more confidence in the end result. The code is guaranteed to have tons of tests and assurance for edge cases that I may not have thought about.

0 comments

jedisct1

No comments yet

Contribute on Hacker News ↗