Comment by manwe150
6 days ago
Having seen the steps an LLM agent already will take to workaround any instructed limitations, I wouldn't be surprised if a malicious actor didn't even have to ask for that, and the code agent would just do that ROT-13 itself when it detects that the initial plain text exfiltration failed.
No comments yet
Contribute on Hacker News ↗