Comment by stephengillie
13 years ago
The requested URL /~mc2zk/cs451/mco_P4.ppt‎ was not found on this server.
I don't know who is mangling the URL (Chrome, Apache, MITM?) nor why it's happening.
13 years ago
The requested URL /~mc2zk/cs451/mco_P4.ppt‎ was not found on this server.
I don't know who is mangling the URL (Chrome, Apache, MITM?) nor why it's happening.
There's an invisible Unicode U+200E left-to-right mark at the end of the URL, probably picked up when the parent cut-and-paste the URL into HN. This in UTF-8 is E2 80 8E, which gets misinterpreted by the server as Windows-1252 character set: E2 = â, 80 = €, 8e = Ž. (It could be iso-8859-1, except that doesn't include €.) Interestingly, Chrome's DOM inspector shows this character as the HTML entity ‎ while view-source has it as the actual invisible character.
I think the poster of the URL originally mangled it, but it would be nice if the HN software filtered out invisible characters from URLs. There's not much the destination server can do about it.
(Yes, I've dealt with too many character set issues in the past.)
http://www.cs.virginia.edu/~mc2zk/cs451/mco_P4.ppt
Does that work?
I copied and pasted it from a Google search result (because otherwise the file downloads without showing me the URL). Google of course has decided "copy link to" shouldn't work.
Google of course has decided "copy link to" shouldn't work.
There's an addon for that.
It shouldn't need an addon though. I'm often on Google's side on debates but this is just ridiculous.
1 reply →
Addon to what?
1 reply →
I’ve noticed this issue a lot of times. Is Google deliberately inserting Unicode crap at the end of URL:s in order to prevent copy-and-paste from their search results? Or perhaps it’s to prevent scraping, and the loss of copy and paste functionality is just collateral damage.
There's a U+200E character between "www.cs.virginia.edu/~mc2zk/cs451/mco_P4.ppt" and "."