Comment by naasking
3 days ago
I think the idea is to train a small, minimal LLM thinking model that can run on edge devices, but that has very little knowledge embedded in its weights, and so performs a sort of RAG to Encylopedia Britannica to ground answers to user queries.
No comments yet
Contribute on Hacker News ↗