Comment by m4ck_
3 days ago
Is your RAG manpages thing on github somewhere? I was thinking about doing something like that (it's high on my to-do list but I haven't actually done anything with llms yet.)
3 days ago
Is your RAG manpages thing on github somewhere? I was thinking about doing something like that (it's high on my to-do list but I haven't actually done anything with llms yet.)
I'll get it up soon, probably should. This little snippet will help you though:
That goes man -> html -> markdown which is not only token efficient but also llms are pretty good at creating hierarchies from markdown
I bet you could do the same thing with pandoc and skip serializing to HTML entirely.
Apparently yes: https://pandoc.org/MANUAL.html#options
Not the OP, but I did release my source :D https://github.com/scottyeager/Pal
My tool can read stdin, send it to an LLM, and do a couple nice things with the reply. Not exactly RAG, but most man pages fit into the context window so it's okay.