Comment by lelanthran

7 hours ago

> From his explanation in these comments, he claims the agent did respond in the beginning but it became too costly, so he just manually checked it after that - did the agent correctly catch malicious messages?

I checked his comments here, he does not make that claim. [EDIT: I mean the claim "It let processed all the non-malicious messages"]

> It did not reject everything, it just stopped the costly processing.

My reading of the article, and of the comments he made here, did not mention anything about false negatives - he never claimed to test false negatives so I am wondering why you think he did.

8 comments

lelanthran

qarl2 6 hours ago

He said:

> Author here. It was usable like any Openclaw agent. For example, I used it to ask it questions about the VPS, to summarize emails, etc.

lelanthran 6 hours ago
> He said:
>> Author here. It was usable like any Openclaw agent. For example, I used it to ask it questions about the VPS, to summarize emails, etc.
That does not mean "I used it via emailing it". There is no ambiguity - he was asked specifically about this.
Once again, I reiterate, an agent processing email that rejects every single one passes the test that the OP created, but then it can't do anything useful either.
- qarl2 6 hours ago
  
  > That does not mean "I used it via emailing it". There is no ambiguity - he was asked specifically about this.
  On the contrary - I think the most reasonable interpretation of his words is that he did use it via emailing it. But like I said at the beginning, I could be wrong. It will be interesting to see what he says when he returns to the conversation.
  > Once again, I reiterate, an agent processing email that rejects every single one passes the test that the OP created, but then it can't do anything useful either.
  No one is contesting that point, only that it is applicable.
  
  5 replies →