Comment by jedisct1

2 hours ago

Ironically, there are plenty of evals showing that it’s not actually that great. Even with Anthropic models, other harnesses are more efficient, both in terms of the number of problems solved and token usage.

Significant regressions also seem to be introduced from time to time after releases.

The UX is great, and if you need a kitchen sink packed with tons of features, even though you’ll probably only end up using a fraction of them, it’s fine.

But if you want something that performs well, you’re better off using something like Opencode or Swival.dev