← Back to context

Comment by chaxor

2 years ago

Wow, if 4 degrees out doesn't give 70k nodes it's extended to 7 degrees out? I figured almost any website would yield a near complete graph at just ~6 degrees out.

This is a fantastic idea though. I have been looking for something like this for quite some time to just have basically wikipedia, stackexchange, email, and gitlab available for 'work-mode'. My solution was to make my own search engine with various tools, but this may be easier.

Anyone know of other good solutions in this area for restricting to just {wikipedia, stackexchange, gitlab}?

I've recently been getting into Brave Goggles, and they have a `discard` feature so any site you don't explicitly mention get discarded from the results. For eg., see https://github.com/brave/goggles-quickstart/blob/main/goggle...

They use 1000 domains from Hacker News posts and do ranking and all that, but for your purpose, you can just have

    $discard
    $boost,site=wikipedia.org
    $boost,site=stackexchange.com
    $boost,site=gitlab.com

I've been using a self-hosted instance of SearXNG (a meta search engine) to exclude a lot of spammy sites from search results. I'm not sure if you can force it to only search a list of domains, but might be worth looking into.