Comment by marcusestes
21 hours ago
Making a good experience for AI agents also makes a good experience for the humans that are tasked with the management of their agents.
21 hours ago
Making a good experience for AI agents also makes a good experience for the humans that are tasked with the management of their agents.
Exactly! Number of turns, average tokens to achieve a task using your CLI, as well as average number of characters being returned per CLI command alongside other metrics: all important to both users and agents! I am working on allowing to accurately capture this at www.cliwatch.com! Feel free to request an example eval suite for a list of tasks you want to achieve with your CLI