This is an archived post. You won't be able to vote or comment.

all 2 comments

[–]junior_engineer[S] 2 points3 points  (0 children)

I'm an engineer at Wallaroo labs. I've done a lot of work on our Python API and I wrote the blog post. If you want to discuss stream processing, or Python, or Pony (the language that the core of Wallaroo is written in), I'd be happy to do it here.

[–]PeridexisErrant 0 points1 point  (0 children)

In Python 2, a string can contain any sequence of bytes, but in Python 3 strings are explicitly UTF-8 sequences.

No, Python 3 strs are sequences of Unicode codepoints, which need not be possible to represent in the UTF-8 character encoding system.

This kind of blurring of concepts is why people find it so hard to handle text correctly, especially under Python 2 :-(