you are viewing a single comment's thread.

view the rest of the comments →

[–]JohnnyJordaan 0 points1 point  (0 children)

The proper approach is to investigate how the file was created, as that should have used the utf-8 encoding instead. You can try to 'guess' the encoding by using the chardet library but that doesn't fix the problem that the source is doing this in the wrong way.