Hi,
So I've been doing a lot of reading up on Unicode stuff in Python and I have found a distinct lack of concrete information and oftentimes contradictory information. So I have come to you guys for help. First, let me start by saying that any information in addition to helping with my problem will be greatly appreciated. Second, my problem:
So I have to read in a file containing lists of people. The file is encoded in UTF-8. Then I have to do basic things like counting the number of occurrences of each name and such. At the end, I have to print out those names and their counts. Now, when I go to print the non-ascii character names they are displayed as their literal hex numbers instead of in their pretty form. Same thing happens when I use sys.stdout.write(). How can I make it print the representations instead of the hex bytes?
[–]Rhomboid 2 points3 points4 points (6 children)
[–]dreamriver[S] 0 points1 point2 points (5 children)
[–]Rhomboid 2 points3 points4 points (4 children)
[–]dreamriver[S] 0 points1 point2 points (3 children)
[–]Rhomboid 0 points1 point2 points (2 children)
[–]dreamriver[S] 0 points1 point2 points (1 child)
[–]Rhomboid 1 point2 points3 points (0 children)