Comment by qarl2

7 hours ago

From his explanation in these comments, he claims the agent did respond in the beginning but it became too costly, so he just manually checked it after that - did the agent correctly catch malicious messages?

It did not reject everything, it just stopped the costly processing.

> Is unwarranted.

Is this not a complaint?

> From his explanation in these comments, he claims the agent did respond in the beginning but it became too costly, so he just manually checked it after that - did the agent correctly catch malicious messages?

I checked his comments here, he does not make that claim. [EDIT: I mean the claim "It let processed all the non-malicious messages"]

> It did not reject everything, it just stopped the costly processing.

My reading of the article, and of the comments he made here, did not mention anything about false negatives - he never claimed to test false negatives so I am wondering why you think he did.

  • He said:

    > Author here. It was usable like any Openclaw agent. For example, I used it to ask it questions about the VPS, to summarize emails, etc.

    • > He said:

      >> Author here. It was usable like any Openclaw agent. For example, I used it to ask it questions about the VPS, to summarize emails, etc.

      That does not mean "I used it via emailing it". There is no ambiguity - he was asked specifically about this.

      Once again, I reiterate, an agent processing email that rejects every single one passes the test that the OP created, but then it can't do anything useful either.

      6 replies →