Comment by gillesjacobs

5 days ago

Do you have any retrieval and generation metric scores (eg, KILT or NQ datasets)?

I know benchmark datasets are not the be-all-end-all, but a halfway decent score and inference-time, would really help sell your framework (or help engineers make the choice).

In any case, very cool work, I built a lot of RAG pipelines as freelance NLP engineer and I will try this out.