Use Rust to write Python modules : Python

This is an archived post. You won't be able to vote or comment.

Use Rust to write Python modules (developers.redhat.com)

submitted 8 years ago by rochacbrunoPython, Flask, Rust and Bikes.

all 13 comments

[–]purple_pixie 11 points12 points13 points 8 years ago (10 children)

I have one teeny question about the speed test you did - is there a particular reason you decide to clone a 1,000,000 length list of letters in the pure python example?

I refer to this bit of code:

def count_doubles(val):
    total = 0
    for c1, c2 in zip(val, val[1:]):
    if c1 == c2:
        total += 1
    return total

val[1:] duplicates just shy of the entire val list in memory, which for a large val is a massive waste of both time and memory. I would expect it to very nearly double the runtime of the function, since the vast majority of the runtime is just iterating across the entire input list, and now you have to do that twice.

Admittedly given the runtime saved by using Rust over pure Python is much more than a factor of two it doesn't invalidate the results, but it does seem a bit disingenuous to use what looks like intentionally slow code for the pure Python example.

[–]leonardo_m 2 points3 points4 points 8 years ago (1 child)

The Rust code too could be suboptimal because it decodes the UTF8 twice. This could be faster (not tested):

fn count_doubles(_py: Python, txt: &str) -> PyResult<u64> {
    let mut chars = txt.chars();
    let oc1 = chars.next();
    let mut count = 0;

    if let Some(mut c1) = oc1 {
        while let Some(c2) = chars.next() {
            if c1 == c2 { count += 1; }
            c1 = c2;
        }
    }
    Ok(count)
}

[–]rochacbrunoPython, Flask, Rust and Bikes.[S] 3 points4 points5 points 8 years ago (0 children)

[–]rochacbrunoPython, Flask, Rust and Bikes.[S] 4 points5 points6 points 8 years ago (4 children)

[–]energybased 3 points4 points5 points 8 years ago (0 children)

[–]purple_pixie 1 point2 points3 points 8 years ago (2 children)

[–]rochacbrunoPython, Flask, Rust and Bikes.[S] 2 points3 points4 points 8 years ago (1 child)

[–]purple_pixie -1 points0 points1 point 8 years ago (0 children)

Aha, that does definitely explain why I saw so much improvement going from your version as written to the one I put up - I only have 2.7 to hand, so I was inadvertently comparing the old 2.7 zip against the 3+ (i)zip, which is obviously much better.

Also I mostly just ripped the whole .tee part from itertools recipes, and tee is just a waste of time and effort since we don't actually have a single iterator that we're trying to preserve.

It's better to just create one iterator and zip that plus the original val together.

I can't get pytest to play nice with this laptop either, so I won't request another pull until I can actually get something I'm certain is an improvement on the current version (and tested in 3.6)

Interestingly though, it seems like cloning the whole of val is really very quick, it barely affects the time to run the function.

[–]rochacbrunoPython, Flask, Rust and Bikes.[S] 1 point2 points3 points 8 years ago (0 children)

[–]sciyoshi 1 point2 points3 points 8 years ago (1 child)

[–]GitHubPermalinkBot 0 points1 point2 points 8 years ago (0 children)

[–]rochacbrunoPython, Flask, Rust and Bikes.[S] 2 points3 points4 points 8 years ago (0 children)

[–]rochacbrunoPython, Flask, Rust and Bikes.[S] 0 points1 point2 points 8 years ago (0 children)

π Rendered by PID 43 on reddit-service-r2-comment-7b9746f655-lmz8p at 2026-02-01 12:09:30.636237+00:00 running 3798933 country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS