← Back to context

Comment by saagarjha

14 hours ago

Your ideas do not work against people who are trying to be malicious.

6 comments

saagarjha

Reply

insanitybit 14 hours ago

Oh. Yes they do.

saagarjha 14 hours ago
And your reason for believing this is…
- insanitybit 14 hours ago
  
  1. We've seen LLMs detect existing supply chain attacks when pointed at malicious install scripts. This is direct, empirical support for my position.
  2. We have a long history of using heuristic technologies to detect attacks. We can infer that other heuristic technologies can be combined in a successful manner.
  3. Shortcomings of LLMs are directly addressed by removing attacker controlled information from the input, which I specifically called out (using tools like grep for pattern matching + using sub agents to isolate contexts). This has been demonstrated already in a number of ways - feeding the LLM derived facts instead of attacker controlled data is the well worn path to avoiding injection attacks.
  
  3 replies →