Comment by carlsborg
6 years ago
If you strace chrome on linux it also picks up /etc/machine-id (or it did back when I looked), which is a 32 byte randomly generated string which uniquely identifies you and on some systems is used as the DHCP ID across reboots.
First I thought reading /etc/machine-id would be expected if Chrome uses D-bus or pulseaudio libraries which depend on D-bus, and /etc/machine-id is part of D-bus. But no, they really use it for tracking purposes.
And in a sick twist they have this comment for it:
In fairness, the guidelines they reference suggest you do exactly what the comment says they're doing (assuming they're keying the hash). The guidelines seem explicitly written with the idea that unique identifiers _derived from_ this value are not similarly quarantined, provided that you cannot take the derived value and "reverse" it back to the original identifier.
Quoting from https://www.freedesktop.org/software/systemd/man/machine-id....:
This ID uniquely identifies the host. It should be considered "confidential", and must not be exposed in untrusted environments, in particular on the network. If a stable unique identifier that is tied to the machine is needed for some application, the machine ID or any part of it must not be used directly. Instead the machine ID should be hashed with a cryptographic, keyed hash function, using a fixed, application-specific key. That way the ID will be properly unique, and derived in a constant way from the machine ID but there will be no way to retrieve the original machine ID from the application-specific one.
> Instead the machine ID should be hashed with a cryptographic, keyed hash function, using a fixed, application-specific key.
Reading https://cs.chromium.org/chromium/src/chrome/browser/policy/b..., I do not not see it being hashed with a key, just unkeyed sha1.
I think it doesn't make much sense to protect it because in popular Linux distributions an unprivileged user can access such identifiers as MAC addresses of network interfaces, HDD serial numbers etc.
> If a stable unique identifier that is tied to the machine is needed for some application,
Ideally there should be no stable identifiers accessible to untrusted applications.
Now go and read http://jdebp.uk./Softwares/nosh/guide/commands/machine-id.xm... and RFCs 3041 and 4941.
What else is going to break if one randomises that ID (per boot or per hour, say)?
2 replies →
> which is why we hash it first and then encode it in base64 before transmitting it.
This made me chuckle. "As per the rules, we'll put on a boxing glove before we punch your lights out". You wont get privacy, but at least there is some security!
> As per the rules, we'll put on a boxing glove before we punch your lights out
This also made me chuckle
"Tracking purposes" is such a weasel word, when we're really talking about device management in an enterprise setting, and this code only gets activated if the root/administrator user has installed a token file on your computer.
That really is a cynical comment. It almost bothers me more than this header.
Which (among many other things) can be faked with firejail, if you absolutely have to run Chromium (e.g. for testing):
Chromium doesn't seem to read that file.
When puppeteer first came out I was nervous to use it for scraping because I could totally see Chrome pulling tricks like this to help recaptcha in identifying the bots. I’m still not convinced they aren’t.
firefor / tor also read this file
What does tor do with it? Maybe pass it along in packet timing intervals, or something ... ;o)
That's not a correct description.
* http://jdebp.uk./Softwares/nosh/guide/commands/machine-id.xm...
True, more precisely - 16 bytes, 32 hex characters. Your link is in agreement "The machine ID is usually generated from a random source during system installation or first boot and stays constant for all subsequent boots." And See https://wiki.debian.org/MachineId at least one distro uses it for the DHCP ID.
"At least one distro" is not correct either. It's used by systemd-networkd, specifically.
* http://jdebp.uk./Softwares/nosh/guide/commands/machine-id.xm...
3 replies →
And this is a legal thing to do?