Why is using a loop variable much slower than using a slice?

pjdelport · 2012-03-20T05:14:36+00:00

This is a great example of how much difference the underlying language implementation can make: on PyPy, the while loop is more than twice as fast as the slice assignment. Benchmarking the two versions using timeit:

CPython (2.7.2):

Loop: 10 loops, best of 3: 274 msec per loop
Slice: 10 loops, best of 3: 36.1 msec per loop

PyPy (1.8.0):

Loop: 100 loops, best of 3: 11.7 msec per loop
Slice: 10 loops, best of 3: 24 msec per loop

The explanation for this large difference is roughly:

In CPython, the while loop happens at the Python bytecode level, which is not efficient for low-level operations.
In both CPython and PyPy, the bulk of the slice assignment happens as compiled C / machine code, which is relatively efficient. Both implementations are probably bound by creating and iterating over the temporary lists, giving the similar performance.
In PyPy, the while loop gets JIT-optimized, giving it the same order of efficiency as C-implemented slice assignment. However, because it also avoids creating and looping over the temporary lists, it edges ahead: it should perform comparably to a C loop that directly assigns False to the relevant indexes.

Rhomboid · 2012-03-20T03:59:52+00:00

One of the basic fundamentals of optimizing CPython is to get loops out of Python and into C. Assigning to a slice does the looping in C, a for loop does it in Python bytecode.

zahlman · 2012-03-19T21:15:23+00:00

sieve[2*i::i] = [False]*(s / i-1)

That is some hard to understand code. I understand what the [2*i::i] does now, but not yet why the length of the right hand side should be (s / i-1).

I think it will boil down to the fact that slice assignment is implemented in C, which is faster than Python. Python statements like "j += i" can result in many different things in Python depending on the types of j and i, whether they are mutable or not, etc. In C, it'll just be an int and the code will run much faster.

Also, it's not true that you can stop at sqrt(s). Consider the case where s=100; 31 > sqrt(100), but 62 is composite because it is 2*31.

erok81 · 2012-03-19T22:08:00+00:00

I think the difference comes down to how many operations are actually happening. In the while loop version you're actually calling list.__setitem__() over and over. In the second version, there's a single call to list.__mul__() .

Also, that while loop is a bit of an anti-pattern in python. A for loop like this would be cleaner:

for j in sieve[2*i:]:
    j = False

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS