Comment by bigggbob

5 days ago

Nice project. The 4GB index / ~15s search part made me think zvec might be a good fit here: https://github.com/alibaba/zvec

It’s an in-process vector DB, so the “local corpus, no separate server” shape is pretty much what it’s designed for. Its benchmark numbers are quite strong, and recent versions also support full-text + hybrid retrieval and DiskANN.

This would be an interesting case to try with zvec: same corpus, same embedding model, then compare indexing time, index size, memory usage, and query latency on normal hardware.

0 comments

bigggbob

No comments yet

Contribute on Hacker News ↗