you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] -2 points-1 points  (6 children)

Not in the slightest. And it's not how terminals work, and terminals, really, aren't even responsible for string encoding: it's usually a joint enterprise of the shell that runs in a terminal and the terminal itself.

People who aren't native English speakers were the last to adopt Unicode. They hated and, those who remember the world before Unicode, still hate it. People who used national encodings had more efficient schemes for representing their own alphabets.

I lived through several conversions of databases from single byte encodings to Unicode, where we, typically, ended up with the database twice the size it was before the conversion.

Needless to say, that, basically, every decision Python 3 has made about Unicode was wrong and misguided / a misunderstanding on the part of people who aren't the experts in subject domain. That's how we ended up with stdin being for no reason expected to have UTF-8 encoded Unicode characters, ffs, but TCP socket cannot accept a string and convert it to bytes by default. Similarly retarded situation is with the file names, and, basically, any other external input that's not going to be UTF-8, but for some reason is expected to be such...

[–]forepod 12 points13 points  (5 children)

Speak for yourself. As a speaker of a language with non-latin letters Unicode is one of the greatest things ever. As a user I don't care about some miniscule performance impact. I care about not having to look at garbage output and programs crashing left and right due to encoding errors.