Comment by dyauspitr
7 hours ago
Yes. SQL querying with standard inbuilt anti injection code when retrieving the transactions that it can write itself.
7 hours ago
Yes. SQL querying with standard inbuilt anti injection code when retrieving the transactions that it can write itself.
What kind of "standard inbuilt anti injection code" are you referring to? Mysql_real_escape_string()?
Look up "prepared statements", it's pretty well documented.
How does this prevent prompt injection described in the article?
How does it prevent DDOSing and/or exposing the database from an injected prompt?
The user asks for details of the last transaction, the user gets back the amount, the source, and the description in a safely quoted format with the LLM never reading it.
You can't inject the LLM if it doesn't see the data.
An architecture like this won't work in many situations, but it can work for a lot of simple questions.
And if you want the LLM to summarize things, you run an isolated instance that makes a summary and you never show that summary to the LLM that's following the user's instructions.
You can do this, it is useful, but it's just not the same as where the goalposts are now which is: the AI is a person in a box and can do everything a person can.
If we actually limit them to "only accepts tiny ultra well defined problems and ultra well defined outputs" then theycease being a $10T/year idea and become a merely $10B/year idea.
Thus, it is not exactly popular at the moment.
> The user asks for details of the last transaction, the user gets back the amount, the source, and the description in a safely quoted format
What's "safely quoted format" when prompt injection is already safe in the description?
> You can't inject the LLM if it doesn't see the data.
How doesn't it see the data when you literally say "The user asks for details of the last transaction, the user gets back the amount, the source, and the description"?
> And if you want the LLM to summarize things, you run an isolated instance that makes a summary
And it will make a summary exactly how?