Comment by 34679
7 months ago
The models don't know what portion of the entire context is relevant to your most recent query. The reason it works better is because in the standalone app, your query is the entire context, whereas otherwise it's query + x irrelevant tokens.
No comments yet
Contribute on Hacker News ↗