Comment by zmccormick7
1 year ago
Agreed that thresholds don't work when applied to the cosine similarity of embeddings. But I have found that the similarity score returned by high-quality rerankers, especially Cohere, are consistent and meaningful enough that using a threshold works well there.
I use similarity threshold (to remove absolutely irrelevant results) and then use a reranker to get Top N.