Comment by andes314

4 days ago

This is a great question! Yes, for a very specific subset of problems--those where you need total recall. HNSW-based algorithms typically only compare against a subset of the whole collection in order to achieve much faster results than linear time search, and it is sometimes the case that they miss the true best results, which is a trade-off worth making. I aimed to keep total recall and my method does in fact perform faster for this particular use case.

Are you aware of any benchmarks comparing the HNSW with the the binary quantization trade-offs in accuracy for different models/datasets?