← Back to context

Comment by grishka

4 years ago

Huh, spaces. There's way too much software, especially on Windows, that breaks when there are Cyrillic characters in a path. I'll let you guess how I found out.

I had a really odd one last year where a Grave I ( well known brand name) got converted by office/excell into a Double Grave I.

The double grave I is used by some obscure orthodox religionious texts

A friend had the username "Rubén" and jfc it broke everything other than windows itself xD

  • The problem isn't the Cyrillic or the é but the fact that Windows lets you put those characters in file names in non-Unicode encodings which will create sequences of bytes which are invalid UTF-8. It's 2021, FFS, stop using legacy encodings.

    • All win32 functions that accept or return strings come in two varieties, with A and W suffixes, MessageBoxA/MessageBoxW. The A works with the system default 8-bit encoding (cp1251 in case of Cyrillic), the W works with unicode in wide chars. There shouldn't be much of a problem with string handling if you stick exclusively with W functions.

      6 replies →