← Back to context

Comment by davideuler

2 months ago

I created a dashboard for stability of OpenClaw and Hermes. It shows stability score(10 is the most stable) which is calculated by analyzing Github issue by GPT.

Lots of friends asked me which version of OpenClaw/Hermes are recommended as a stable version. I've no clue of it, and I don't updated my OpenClaw/Hermes very often to avoid unstable versions frequently. So I created the Agent Watch dashboard.

https://agentwatch.aicompass.dev/

Very cool. How do you classify negative signals?

  • GPT would analyze each issue if it is negative. And also it would analyze if it the core features related issue. I iterated it several times. The dashboard seems more reasonable than the initial version. I would open source the project soon so that other could contribute to build a better stability dashboard for the daily Agents we use.