Comment by qarl2
8 hours ago
From his explanation in these comments, he claims the agent did respond in the beginning but it became too costly, so he just manually checked it after that - did the agent correctly catch malicious messages?
It did not reject everything, it just stopped the costly processing.
> Is unwarranted.
Is this not a complaint?
> From his explanation in these comments, he claims the agent did respond in the beginning but it became too costly, so he just manually checked it after that - did the agent correctly catch malicious messages?
I checked his comments here, he does not make that claim. [EDIT: I mean the claim "It let processed all the non-malicious messages"]
> It did not reject everything, it just stopped the costly processing.
My reading of the article, and of the comments he made here, did not mention anything about false negatives - he never claimed to test false negatives so I am wondering why you think he did.
He said:
> Author here. It was usable like any Openclaw agent. For example, I used it to ask it questions about the VPS, to summarize emails, etc.
> He said:
>> Author here. It was usable like any Openclaw agent. For example, I used it to ask it questions about the VPS, to summarize emails, etc.
That does not mean "I used it via emailing it". There is no ambiguity - he was asked specifically about this.
Once again, I reiterate, an agent processing email that rejects every single one passes the test that the OP created, but then it can't do anything useful either.
6 replies →