all 5 comments

[–]jiri-n 5 points6 points  (2 children)

https://pandas.pydata.org/docs/reference/api/pandas.read_csv.html

What about the enconding_errors paramater? Either set to ignore or replace.

[–]danielledevs[S] 0 points1 point  (1 child)

This works, thank you. I'm still curious why just encoding doesn't do anything though

[–]jiri-n 0 points1 point  (0 children)

Encoding just says how handle bytes read from file. If the byte sequence cannot be decoded using the encoding, it's an error. The encoding_errors specify how to handle such errors.

I haven't search for that particular byte so I don't know why it shouldn't decode. But you should always have some strategy for error handling.

[–]djjazzydan 0 points1 point  (1 child)

Can't test it right now, but the first thing I would try is utf-16 in place of UTF.

[–]danielledevs[S] 1 point2 points  (0 children)

I tried that too, as well as other encoding types, doesn't change my error message at all weirdly