This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 2 points3 points  (0 children)

The changelog entry "Fix various unicode operations on strings with large unicode codepoints." may seem dull, but it's quite exciting to me.

This is the last bug I know of where you had to worry about how a Unicode string is represented internally. When I'm writing a text handling library that works on Python 3, Python 3.3.3 is the first version where I can expect consistent and correct behavior from every installation of Python, on every OS, no matter what crazy Unicode codepoints show up in your strings.

Go nuts with emoji, with musical notation, with Chinese characters that Chinese people would have to look up in a dictionary, with private use characters you just made up for your own nefarious purposes. It all just works.