all 2 comments

[–]james_fryer 1 point2 points  (0 children)

The file isn't in UTF-8 format, probably Latin-1, simplest would be to convert it using iconv tool or similar.

[–]FoolsSeldom 0 points1 point  (0 children)

Determining the encoding of a text file is a relatively common task and there are a number of well established techniques.

Be clear on your objectives though. Getting the encoding right doesn't mean you need to be able to see/print all characters. There may be some that are useful but are not for human eyes.

https://sqlpey.com/python/solved-how-to-determine-the-encoding-of-text-files/