← Back to context

Comment by jnnnthnn

1 year ago

Thanks! I tried a few other approaches and found the LLM results were overall better (latency and cost aside). Maybe that should be an option made available to users though...

i think not, better results >>> better latency + cost

  • Maybe a combined approach beats either? Let some non-LLM reranker quickly spit out two results, and fill in the rest with the LLM.