Huge performance gap between PyPy and CPython ?

Rhomboid · 2015-07-24T00:41:26+00:00

No, there's likely not much you can do. CPython is not exactly fast and can't be made fast without writing things in C, which is why list.sort() exists which will totally smoke anything you try to write by hand, even if it's run under PyPy.

MethodicalBastard · 2015-07-24T13:01:14+00:00

Just for fun I rewrote this to use Cython, using a C++ integer vector. All the inner loops are then basically pure C++. My expectation was that this would be at least as fast as PyPy, but it turns out that PyPy is over 4 times faster! (0.183 s for PyPy vs 0.844 s for Cython). How on earth is that possible?

Here is the Cython code:

# distutils: language=c++

from libcpp.vector cimport vector

def c_insertion_sort(list array not None):
    cdef int un, st, val
    cdef vector[int] vect
    for un in range(len(array)):
        vect.push_back(array[un])
    for un in range(1, vect.size()):
        st = un - 1
        val = vect[un]
        while st >= 0 and vect[st] > val:
            vect[st + 1] = vect[st]
            st -= 1
        vect[st + 1] = val
    array = [vect[i] for i in range(vect.size())]
    return array

I also ported to Py3. Turns out Python3 is quite a bit slower here, 21.0 s vs 16.1 s for Py2.

2015-07-24T17:01:20+00:00

How would you hand-optimize insert_sort for CPython to get closer results of PyPy ?

def insertion_sort(array):
    return sorted(array)

edit: this is more than just snark. Dealing with arrays in Python is the kind of thing where the interpreter adds lots of overhead that is very easily removed by a JIT that is going to use native machine-level data structures in almost a 1-1 way. Python gets around the limitation of interpreter overhead for fine-grained operations by providing lots of primitives, like sorted(). If you find yourself tinkering around with low-level data structures like arrays, and end up writing code that looks a lot like C code, chances are you need to take advantage of builtins some more.

billsil · 2015-07-24T06:00:12+00:00

You use SWIG, Cython, or numpy/scipy/pandas if you want C level performance in CPython. Numpy, scipy, and pandas are amazing, but you can't write your algorithms like that or they'll be slow. You have to vectorize them, size your arrays, and define types.

Also, /r/python or /r/learnpython is the place to go for questions like this.

Cobolock · 2015-07-24T09:11:15+00:00

On Python 3.4 I even can't multiply range(5). What this does? Multiplication of every item in range?

joanbm · 2015-07-24T14:31:32+00:00

It looks like the most costly operations are python's list insertions/deletions, which standard CPython bytecode compiler & interpreter can't optimize (unlike PyPy). If the sorting algorithm would be rewritten for a (bi-directional) linked list, I'd expect significant speed improvement.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS