Comment by sailingparrot

7 months ago

> So what is is marketing stuff about "directly into a model's working memory."? Is my mental view wrong?

Context is sometimes called working memory. But no your understanding is right: find the right document through cosine similarity (and thus through embeddings), then add the content of those docs to the context

1 comment

sailingparrot

greymalik 7 months ago

One of the things I find confusing about this article is that the author positions RAG as being unrelated to both context engineering and vector search.