Comment by aimanbenbaha

6 months ago

I don't think RAG will survive this time

6 comments

aimanbenbaha

4.8b words on English Wikipedia. Knowledge cutoff of 6 months. A valid use case is to search across Wikipedia and ground your answers. Trivially proves that RAG is still needed.

drusepth 6 months ago

RAG still has lots of benefits for anyone paying per input token (e.g. over APIs).

azinman2 6 months ago
Not to mention latency
- disgruntledphd2 6 months ago
  
  And grounding for the model. Smaller models with tend to hallucinate a little less (anecdotally).

acchow 6 months ago

This is only for the small model. The medium model is still at 1M (like Gemini 2.5)

Even if we could get the mid models to 10M, that's still a medium-sized repo at best. Repos size growth will also accelerate as LLMs generate more code. There's no way to catch up.

gesman 6 months ago

RAG gets bigger as everyone else gets bigger. Flooding prompts with garbage is not a sound strategy...