Comment by dang

6 hours ago

  Heavy slop (5+ patterns) · 105 sites · 21%
  Mild (2–4) · 230 sites · 46%
  Clean (0–1) · 165 sites · 33%

Can we have a list of the "clean" ones please? Actually, if you give me a list of the IDs for all 3 categories, I'll make URLs for each that people can browse.

If the community feels that the division is useful, then we can maybe take you up on your offer to open-source the project, and perhaps find a way to use it on HN itself.