Comment by GodelNumbering
3 hours ago
This looks very promising. Thank you for investing time in this.
Assuming it indexes everything locally and falls back to traditional search engines if none found, how do you feel about adding a shared middle layer? A layer that simply indexes all the canonical data that doesn't have any personal info. This way, the contributors can automatically contribute the pages they index - building a shared search engine over time! The whole thing can work without a crawler of its own (under appropriate license so people can trust it)
This is an awesome idea in theory, I'd love to go to this direction, but it's a surprisingly complex topic. I find it hard to come up with an implementation that can guarantee both result quality (no malicious actors) and user privacy.
I'd appreciate any kind of help designing such system. We are on IRC/Discord/Github/Codeberg.