Comment by Dylan16807

8 hours ago

The user asks for details of the last transaction, the user gets back the amount, the source, and the description in a safely quoted format with the LLM never reading it.

You can't inject the LLM if it doesn't see the data.

An architecture like this won't work in many situations, but it can work for a lot of simple questions.

And if you want the LLM to summarize things, you run an isolated instance that makes a summary and you never show that summary to the LLM that's following the user's instructions.

3 comments

Dylan16807

lelandbatey 7 hours ago

You can do this, it is useful, but it's just not the same as where the goalposts are now which is: the AI is a person in a box and can do everything a person can.

If we actually limit them to "only accepts tiny ultra well defined problems and ultra well defined outputs" then theycease being a $10T/year idea and become a merely $10B/year idea.

Thus, it is not exactly popular at the moment.

troupo 8 hours ago

> The user asks for details of the last transaction, the user gets back the amount, the source, and the description in a safely quoted format

What's "safely quoted format" when prompt injection is already safe in the description?

> You can't inject the LLM if it doesn't see the data.

How doesn't it see the data when you literally say "The user asks for details of the last transaction, the user gets back the amount, the source, and the description"?

> And if you want the LLM to summarize things, you run an isolated instance that makes a summary

And it will make a summary exactly how?

Dylan16807 4 hours ago

> How doesn't it see the data when you literally say "The user asks for details of the last transaction, the user gets back the amount, the source, and the description"?
The above post said how. The LLM writes code to do it. The code has a function to send text to the user. The LLM is not allowed to see the text, only the user is.
> And it will make a summary exactly how?
The second summarizing-only LLM is fed the raw data and allowed to output summary text. This is then sent directly to the user and put in a box with some hazard lines on it. The main LLM is not allowed to see the summary, only the user is.