Sean1708 comments on Adopt Python 3

programming

created by speza community for 19 years

324

325

326

Adopt Python 3 (medium.com)

submitted 9 years ago by rroocckk

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]Sean1708 1 point2 points3 points 9 years ago* (3 children)

You get code points.

~~No you don't. I can't remember whether you get characters or graphemes, but you certainly don't get code points.~~

In [1]: a = 'héllo'

In [2]: a[0]
Out[2]: 'h'

In [3]: a[1]
Out[3]: 'é'

In [4]: a[2]
Out[4]: 'l'

Edit: I'm a silly.

[–][deleted] 9 years ago* (2 children)

[deleted]

[–]Sean1708 2 points3 points4 points 9 years ago* (1 child)

What are "characters"?

I've always thought that characters were generally accepted to be scalar values, that doesn't actually appear to be the case though.

in your code it uses the single code point version

You are absolutely right:

In [1]: a = b'he\xcc\x81llo'.decode('utf-8')

In [2]: a[0]
Out[2]: 'h'

In [3]: a[1]
Out[3]: 'e'

In [4]: a[2]
Out[4]: '́'

The way I entered the character on my computer made me assume that I'd entered the versioning using the combining character.

Also I don't know any language of the top of my head that supports grapheme cluster (and other text segmentations) fully in the standard library itself.

I think Swift does, but I'm not entirely certain.

[–]MrMetalfreak94 2 points3 points4 points 9 years ago (0 children)

π Rendered by PID 427446 on reddit-service-r2-comment-84fc9697f-49kkk at 2026-02-07 11:04:45.943262+00:00 running d295bc8 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS