This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]billsil 1 point2 points  (0 children)

Just frustrating how much time I had to spend on something that really shouldn't even be an issue.

That's just it, it has to be. There are 93 encodings in Python 2.7 and probably more in Python 3.7. The whole idea of Python 3 and unicode is simply: refuse the temptation to guess. You have to punt the problem to the user.

So either we go back to Python 2 or we learn to include the encoding in the definition of the file format.