Comment by Voloskaya
2 days ago
> So what is is marketing stuff about "directly into a model's working memory."? Is my mental view wrong?
Context is sometimes called working memory. But no your understanding is right: find the right document through cosine similarity (and thus through embeddings), then add the content of those docs to the context
One of the things I find confusing about this article is that the author positions RAG as being unrelated to both context engineering and vector search.