Comment by jatora
22 days ago
I was confused by him basically inventing his own skills but I guess this is from Nov 2025 so makes sense as skills were pretty new at that point.
Also please note this is nowhere on the terminal bench leaderboard anymore. I'd advise everyone reading the comments here to be aware of that. This isn't a CLI to use. Just a good experiment and write up.
It's batteries-not-included, by design. Here's what it looks like with batteries (and note who owns this repo):
https://github.com/mitsuhiko/agent-stuff/tree/main
Perhaps benchmarks aren't the best judge.
I don’t follow nor use pi so no horse in this race, but I think the results were never submitted to terminal bench? not sure how the process works exactly but it’s entirely missing from the benchmark. is this a sign of weakness? I honestly don’t know.