Comment by qarl2

8 hours ago

From his explanation in these comments, he claims the agent did respond in the beginning but it became too costly, so he just manually checked it after that - did the agent correctly catch malicious messages?

It did not reject everything, it just stopped the costly processing.

> Is unwarranted.

Is this not a complaint?

9 comments

qarl2

lelanthran 8 hours ago

> From his explanation in these comments, he claims the agent did respond in the beginning but it became too costly, so he just manually checked it after that - did the agent correctly catch malicious messages?

I checked his comments here, he does not make that claim. [EDIT: I mean the claim "It let processed all the non-malicious messages"]

> It did not reject everything, it just stopped the costly processing.

My reading of the article, and of the comments he made here, did not mention anything about false negatives - he never claimed to test false negatives so I am wondering why you think he did.

qarl2 8 hours ago
He said:
> Author here. It was usable like any Openclaw agent. For example, I used it to ask it questions about the VPS, to summarize emails, etc.
- lelanthran 8 hours ago
  
  > He said:
  >> Author here. It was usable like any Openclaw agent. For example, I used it to ask it questions about the VPS, to summarize emails, etc.
  That does not mean "I used it via emailing it". There is no ambiguity - he was asked specifically about this.
  Once again, I reiterate, an agent processing email that rejects every single one passes the test that the OP created, but then it can't do anything useful either.
  
  6 replies →