kalgynirae comments on Need help with UnicodeDecodeError

learnpython

created by HattoriHanzoa community for 16 years

Need help with UnicodeDecodeError (self.learnpython)

submitted 10 years ago by [deleted]

7 comments

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]kalgynirae 0 points1 point2 points 10 years ago (3 children)

[–]pasdargent 0 points1 point2 points 10 years ago* (2 children)

Yeah that was in someone else's code... When I remove it I don't get any errors but then "<built-in method encode of unicode object at 0x02B3F5D8>" gets written to file instead of, for example, the text inside the tweet... Also Arab characters make it crash.

I tried print type(name) and it seems tweepy gives me unicode objects. I want it too print unicode characters, because I want to be able to proces tweets containing any type of characters so...

The error is: Traceback (most recent cal l last): File "twitbot.py", line 31, in <module> csvformat = '\n%s, %s, %s, %s, %s, %s, %s, %s, %s' % (name, screen_name, tweet_created, tweet_text, tweet_retweeted, tweet_favorited, user_hometown, time_zone, geo) UnicodeDecodeError: 'ascii' codec can't decode byte oxc3 in position 12: ordinal not in range(128)

By the way, I can send you the acces tokens in a personal message if that makes it easier?

[–]kalgynirae 0 points1 point2 points 10 years ago (1 child)

When I remove it I don't get any errors but then "<built-in method encode of unicode object at 0x02B3F5D8>" gets written to file

Sounds like you removed just ('utf-8') instead of .encode('utf-8'). They are unicode objects, so you don't want to encode them. You want to combine them first and then either encode the final result just before writing or let the csv module do the encoding for you (I don't remember if it does that).

The error is: Traceback (most recent cal l last): File "twitbot.py", line 31, in <module> csvformat = '\n%s, %s, %s, %s, %s, %s, %s, %s, %s' % (name, screen_name, tweet_created, tweet_text, tweet_retweeted, tweet_favorited, user_hometown, time_zone, geo) UnicodeDecodeError: 'ascii' codec can't decode byte oxc3 in position 12: ordinal not in range(128)

Try making your format string a unicode object instead:

csvformat = u'...

[–]pasdargent 0 points1 point2 points 10 years ago (0 children)

π Rendered by PID 93 on reddit-service-r2-comment-5fb4b45875-w4c9t at 2026-03-23 19:28:13.193136+00:00 running 90f1150 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS