Comment by 0xbadcafebee

4 months ago

  In the past, browsers used an algorithm which only denied setting wide-ranging cookies for top-level domains with no dots (e.g. com or org). However, this did not work for top-level domains where only third-level registrations are allowed (e.g. co.uk). In these cases, websites could set a cookie for .co.uk which would be passed onto every website registered under co.uk.

  Since there was and remains no algorithmic method of finding the highest level at which a domain may be registered for a particular top-level domain (the policies differ with each registry), the only method is to create a list. This is the aim of the Public Suffix List.
  
  (https://publicsuffix.org/learn/)

So, once they realized web browsers are all inherently flawed, their solution was to maintain a static list of websites.

God I hate the web. The engineering equivalent of a car made of duct tape.

96 comments

0xbadcafebee

KronisLV 4 months ago

> Since there was and remains no algorithmic method of finding the highest level at which a domain may be registered for a particular top-level domain

A centralized list like this not just for domains as a whole (e.g. co.uk) but also specific sites (e.g. s3-object-lambda.eu-west-1.amazonaws.com) is both kind of crazy in that the list will bloat a lot over the years, as well as a security risk for any platform that needs this functionality but would prefer not to leak any details publicly.

We already have the concept of a .well-known directory that you can use, when talking to a specific site. Similarly, we know how you can nest subdomains, like c.b.a.x, and it's more or less certain that you can't create a subdomain b without the involvement of a, so it should be possible to walk the chain.

Example:

  c --> https://b.a.x/.well-known/public-suffix
  b --> https://a.x/.well-known/public-suffix
  a --> https://x/.well-known/public-suffix

Maybe ship the domains with the browsers and such and leave generic sites like AWS or whatever to describe things themselves. Hell, maybe that could also have been a TXT record in DNS as well.

IanCal 4 months ago
> any platform that needs this functionality but would prefer not to leak any details publicly.
I’m not sure how you’d have this - it’s for the public facing side of user hosted content, surely that must be public?
> We already have the concept of a .well-known directory that you can use, when talking to a specific site.
But the point is to help identify dangerous sites, by definition you can’t just let the sites mark themselves as trustworthy and rotate around subdomains. If you have an approach that doesn’t have to trust the site, you also don’t need any definition at the top level you could just infer it.
- ramses0 4 months ago
  
  It's actually exactly the same concept that come to mind for me. `SomeUser.geocities.com` is "tainted", along with `*.geocities.com`, so `geocities.com/.wellknown/i-am-tainted` is actually reasonable.
  Although technically it might be better as `.wellknown/taint-regex` (now we have three problems), like `TAINT "*.sites.myhost.com" ; "myhost.com/uploads/*" ; ...`
  
  4 replies →
shadowgovt 4 months ago

It does smell very much like a feature that is currently implemented as a text file but will eventually need to grow to its own protocol, like, indeed, the hostfile becoming DNS.
One key difference between this list and standard DNS (at least as I understand it; maybe they added an extension to DNS I haven't seen) is the list requires independent attestation. You can't trust `foo.com` to just list its subdomains; that would be a trivial attack vector for a malware distributor to say "Oh hey, yeah, trustme.com is a public suffix; you shouldn't treat its subdomains as the same thing" and then spin up malware1.trustme.com, malware2.trustme.com, etc. Domain owners can't be the sole arbiter of whether their domain counts as a "public suffix" from the point of view of user safety.
m0dest 4 months ago
It looks like Mozilla does use DNS to verify requests to join the list, at least.
$ dig +short txt _psl.website.one @1.1.1.1 "https://github.com/publicsuffix/list/pull/2625"
Doing this DNS in the browser in real-time would be a performance challenge, though. PSL affects the scope of cookies (github.io is on the PSL, so a.github.io can't set a cookie that b.github.io can read). So the relevant PSL needs to be known before the first HTTP response comes back.
IshKebab 4 months ago
I presume it has to be a curated list otherwise spammers would use it to evade blocks. Otherwise why not just use DNS?
- inopinatus 4 months ago
  
  Whois would be the choice. DNS’s less glamourous sibling, purpose built for delegated publication of accountability records
  
  2 replies →

lucideer 4 months ago

> God I hate the web

This is mostly a browser security mistake but also partly a product of ICANN policy & the design of the domain system, so it's not just the web.

Also, the list isn't really that long, compared to, say, certificate transparency logs; now that's a truly mad solution.

modeless 4 months ago

Show me a platform not made out of duct tape and I'll show you a platform nobody uses.

vincnetas 4 months ago
regular cars?
- shadowgovt 4 months ago
  
  Jeep just had an OTA update cause the car to shut down on the highway (it is rumored).
  Before we put computers in cars, we had the myriad small things that would break (stuck doors, stuck windows, failed seals, leaking gaskets), a continuous stream of recalls for low-probability safety issues, and the occasional Gremlin or Pinto.
  My favorite example is the Hyundai Elantra. They changed the alloy used in one of the parts in the undercarriage. Tested that model to death for a year, as they do, but their proving ground is in the southern United States.
  Several winters later, it turns out that road salt attacks the hell out of that alloy and people have wheels flying off their cars in the middle of the road.
- MonaroVXR 4 months ago
  
  The Honda issue where setting a certain radio station, would brick the infotainment? That good enough?
  
  13 replies →
bell-cot 4 months ago

Admitting I'm old, but my HP-11C still gets pretty-regular use.
And judging by eBay prices, or the SwissMicros product line, I suspect I have plenty of company.

lukan 4 months ago

"The engineering equivalent of a car made of duct tape"

Kind of. But do you have a better proposition?

jadengeller 4 months ago
I'd probably say we ought to use DNS.
- asplake 4 months ago
  
  And while we’re at it, 1) mark domains as https-only, and 2) when root domains map to a subdomain (eg www).
  
  1 reply →
0xbadcafebee 4 months ago
Cookies shouldn't be tied to domains at all, it's a kludge. They should be tied to cryptographic keypairs (client + server). If the web server needs a cookie, it should request one (in its reply to the client's first request for a given url; the client can submit again to "reply" to this "request"). The client can decide whether it wants to hand over cookie data, and can withhold it from servers that use different or invalid keys. The client can also sign the response. This solves many different security concerns, privacy concerns, and also eliminates the dependency on specific domain names.
I just came up with that in 2 minutes, so it might not be perfect, but you can see how with a little bit of work there's much better solutions than "I check for not-evil domain in list!"
- progval 4 months ago
  
  > They should be tied to cryptographic keypairs (client + server).
  So now, if a website leaks its private key, attackers can exfiltrate cookies from all of its users just by making them open an attacker-controlled link, for as long as the cookie lives (and users don't visit the website to get the rotated key).
  > If the web server needs a cookie, it should request one
  This adds a round-trip, which slows down the website on slow connections.
  > the client can submit again to "reply" to this "request"
  This requires significantly overhauling HTTP and load-balancers. The public-suffix list exists because it's an easy workaround that didn't take a decade to specify and implement.
  
  3 replies →
gmueckl 4 months ago
A part of the issue is IMO that browsers have become ridiculously bloated everything-programs. You could take about 90% of that out and into dedicated tools and end up with something vastly saner and safer and not a lot less capable for all practical purposes. Instead, we collectively are OK with frosting this atrocious layer cake that is today's web with multiple flavors of security measures of sometimes questionable utility.
End of random rant.
- lukan 4 months ago
  
  "You could take about 90% of that out and into dedicated tools "
  But then you would loose plattform independency, the main selling point of this atrocity.
  Having all those APIs in a sandbox that mostly just work on billion devices is pretty powerful and a potential succesor to HTML would have to beat that, to be adopted.
  The best thing to happen, that I can see, is that a sane subset crystalizes, that people start to use dominantly, with the rest becoming legacy, only maintained to have it still working.
  But I do dream of a fresh rewrite of the web since university (and the web was way slimmer back then), but I got a bit more pragmatic and I think I understood now the massive problem of solving trusted human communication better. It ain't easy in the real world.
  
  27 replies →
- nemothekid 4 months ago
  
  >A part of the issue is IMO that browsers have become ridiculously bloated everything-programs.
  I don't see how that solves the issue that PSL tries to fix. I was a script kiddy hosting neopets phishing pages on free cpanel servers from <random>.ripway.com back in 2007. Browsers were way less capable then.
  
  5 replies →
- Kim_Bruning 4 months ago
  
  Are you saying we should make a <Unix Equivalent Of A Browser?> A large set of really simple tools that each do one thing really really really pedantically well?
  This might be what's needed to break out of the current local optimum.
  
  3 replies →
- sefrost 4 months ago
  
  You are right from a technical point, I think, but in reality - how would one begin to make that change?
Groxx 4 months ago
I'm under the impression that CORS largely solves it?
which is still much too new to be able to shut down the PSL of course. but maybe in 2050.
- Groxx 4 months ago
  
  Since this is being downvoted: no, I'm quite serious.
  CORS lets sites define their own security boundaries between subdomains, with mutual validation. If you're hosting user content in a subdomain, just don't allow-origin it: that is a clear statement that it's not "the same site". PSL plays absolutely no part in that logic, it seems clear to me that it's at least in part intended to replace the PSL.
  Do other sites (like google's safety checks) use CORS for this purpose? Dunno. Seems like they could though? Or am I missing something?

jacquesm 4 months ago

I think we lost the web somewhere between PageRank and JavaScript. Up to there it was just linked documents and it was mostly fine.

formerly_proven 4 months ago

Why is it a centrally maintained list of domains, when there is a whole extensible system for attaching metadata to domain names?

a456463 4 months ago

I love the web. It's the corporate capitalistic ad fueled and govt censorship web that is the problem.

vladms 4 months ago

> God I hate the web. The engineering equivalent of a car made of duct tape.

Most of the complex thing I have seen being made (or contributed to) needed duct tape sooner or later. Engineering is the art of trade-offs, of adapting to changing requirements (that can appear due to uncontrollable events external to the project), technology and costs.

Related, this is how the first long distance automobile trip was done: https://en.wikipedia.org/wiki/Bertha_Benz#First_cross-countr... . Seems to me it had quite some duct tape.

szszrk 4 months ago
Why would you compare Web to that? A first fax message would be more appropriate comparison.
Web is not a new thing and hardly a technical experiment of a few people any more.
If you add the time since announcing the concept of Web to that trip date, you have a very decent established industry already. With many sport and mass production designs:
https://en.wikipedia.org/wiki/Category:Cars_introduced_in_19...
- vladms 4 months ago
  
  For me the web is something along the lines at the definition of: https://en.wikipedia.org/wiki/World_Wide_Web to sum up "...universal linked information system...". I think the fax misses many aspects of the core definition to be a good comparison.
  Not sure what is your point about "decent established industry" if we relate to "duct tape". I see two possibilities:
  a) you imply that the web does not have a decent established industry (but I would guess not).
  b) you would claim that there was no "duct tape" in 1924 car industry. I am no expert but I would refer you to the article describing what was the procedure to start the car at https://www.quora.com/How-do-people-start-their-cars-in-the-..., to quote:
  > Typical cold-start routine (common 1930s workflow)
  > 1. Set hand choke (pull knob).
  > 2. Set throttle lever to slight fast‑idle.
  > 3. Retard spark if manual advance present.
  > 4. Engage starter (electric) or use hand crank.
  > 5. Once running, push choke in gradually, advance spark, reduce throttle.
  Not sure about your opinion but compared to what a car's objective is (move from point A to point B) to me that sounds rather involved. Not sure if it qualifies as "duct-tape" but definitely it is not a "nicely implemented system that just works".
  To resume my point: I think on average progress is slower and harder than people think. And that is mostly because people do not have exposure to the work people are doing to improve things until something can become more "widely available".

starfallg 4 months ago

That's the nature of decentralised control. It's not just DNS, phone numbers work in the same way.

ApolloFortyNine 4 months ago

All web encryption is backed by static list of root certs each browser maintains.

Idk any other way to solve it for the general public (ideally each user would probably pick what root certs they trust), but it does seem crazy.

account42 4 months ago

We already have a solution to solve it: DNS-based Authentication of Named Entities (DANE)
This solution is even more obvious today where most certificates are just DNS lookups with extra steps.

thomasjb 4 months ago

What we need is a web made in a similar way to the wicker-bodied cars of yesteryear

jrochkind1 4 months ago

I'm not sure I'm following what inherent flaw you are suggesting browsers had that the public suffix list originators knew they had.

samlinnfer 4 months ago

Wait until you learn about the HSTS preload list.