Comment by flixing

5 months ago

Do you think kagi is the right Eval tool? If so,why?

The right eval tool depends on your evaluation task. Kagi LLM benchmark focuses on using LLMS in the context of information retrieval (which is what Kagi does) which includes measuring reasoning and instruction following capabilities.