Comment by remram
2 months ago
I'm not too surprised by this, but I'm annoyed that no amount of configuration made those bytes visible again in my editor. Only using hexdump revealed them.
2 months ago
I'm not too surprised by this, but I'm annoyed that no amount of configuration made those bytes visible again in my editor. Only using hexdump revealed them.
Here's a POC that works in emacs. Doesn't cover all of the relevant characters, but:
And as a higher-level configuration you can set most, maybe even all, of the relevant invisible characters (still not sure how 0x34f grapheme joiner fits in) at once with something like:
This will modify values in glyphless-char-display, but it's OK to modify those directly if you need to.
Here is the bare minimum this is built on, which you can type in yourself if you're paranoid or want to start from the bottom up. Swap in the hexadecimal codepoint of the invisible character after the ?\x
I use vim. It seems like `:set binary enc=latin1` works, though I don't understand why the latin1 part is required.
[dead]
vscode's "Unicode Highlight: Non-basic ASCII" causes the character to get highlighted. Sadly, the more appropriate "Unicode Highlight: Invisible Characters" setting does not reveal them.
My mind went the same place.
Anyone know a more convenient way to search larger blocks of text for this?