Comment by 34679
4 days ago
The models don't know what portion of the entire context is relevant to your most recent query. The reason it works better is because in the standalone app, your query is the entire context, whereas otherwise it's query + x irrelevant tokens.
No comments yet
Contribute on Hacker News ↗