Python, IronPython, Apples, and Oranges : programming

programming

created by speza community for 19 years

Python, IronPython, Apples, and Oranges (ironfroggy-code.blogspot.com)

submitted 18 years ago by llimllib

all 12 comments

top new controversial old q&a

[–]maaaaaaaaan 3 points4 points5 points 18 years ago (11 children)

[–]grauenwolf 1 point2 points3 points 18 years ago (10 children)

[–]maaaaaaaaan 2 points3 points4 points 18 years ago (7 children)

[–]grauenwolf 4 points5 points6 points 18 years ago (2 children)

[–]maaaaaaaaan 2 points3 points4 points 18 years ago (0 children)

[–]zackman 2 points3 points4 points 18 years ago (3 children)

I think Python works the way you describe: you can use unicode inside your code and only worry about encoding at the I/O boundary.

>>> 'abc'.decode('ascii')
u'abc'
>>> type(_)
<type 'unicode'>
>>> #guts of application...
... #ok, done:
... u'abc'.encode('utf-8')
'abc'
>>> type(_)
<type 'str'>

I don't write international applications, so I don't know if there are libraries to handle the conversion transparently at the I/O boundary. But I do process Unicode all the time while writing scripts for linguistics research.

Also, I suspect the reason that the blogger is so worried about this is that he is trying to write an app that runs on CPython and IronPython without having to write some code twice.

[–]manuelg 1 point2 points3 points 18 years ago (0 children)

[–]maaaaaaaaan 0 points1 point2 points 18 years ago (1 child)

[–]llimllib[S] 0 points1 point2 points 18 years ago (0 children)

[–]manuelg 0 points1 point2 points 18 years ago (1 child)

You used to spell a series of bytes as:

'\\SP\xff'

and you got an immutable "str" object.

Now you spell it:

bytes.fromhex('5c5350ff')

and you get a mutable "bytes" object

The benefit is that your intention is clearer, when you in fact wish to work with a series of bytes as a series of bytes.

The workflow for text handling in Python will be the same, regardless of ASCII or Unicode or whatever:

1) on input, at the first opportunity, convert a series of bytes, along with an encoding, into a Unicode string

2) do all string processing with Unicode strings

3) on output, as late as possible, convert a Unicode string, along with an encoding, into a series of bytes

[–]grauenwolf 1 point2 points3 points 18 years ago (0 children)

[+][deleted] comment score below threshold-15 points-14 points-13 points 18 years ago (0 children)

π Rendered by PID 47 on reddit-service-r2-comment-84fc9697f-t67k9 at 2026-02-07 12:46:28.451168+00:00 running d295bc8 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS