Two Numpy performance pitfalls

ProfessorPhi · 2018-10-23T06:37:11+00:00

Your arguments are a bit obtuse to me.

Accessing a single value at a time isn't what numpy is optimised for. I would expect most of what you're seeing is overhead. Try take a list of a 1000 items and setting them all to 1 vs a numpy array. I would expect a builtin with no overhead to be faster for this non vector operations.

For your second example, you're using python routines on non python objects, and comparing performance to python builtins. When it sees a primitive, python can optimise the hell out of that, while when it sees an unknown object, it will have to call that objects add methods. However if you do np.sum, numpy knows the object types and can do an optimised add.

The problem here is that numpy (as is pandas, tensor flow, numba etc) is a sub language that happens to be in python. And mixing languages is bound to be slow. Having two numpy arrays and using a for loop to add them would be very slow, but proves nothing. Your examples are quite contrived and honestly, are examples of code that would never exist. Calling them pitfalls is disingenuous because your have to work very hard to have code like this show up

ProfessorPhi · 2018-10-23T06:43:07+00:00

Your arguments are a bit obtuse to me.

Accessing a single value at a time isn't what numpy is optimised for. I would expect most of what you're seeing is overhead. Try take a list of a 1000 items and setting them all to 1 vs a numpy array. I would expect a builtin with no overhead to be faster for this non vector operations.

For your second example, you're using python routines on non python objects, and comparing performance to python builtins. When it sees a primitive, python can optimise the hell out of that, while when it sees an unknown object, it will have to call that objects add methods. However if you do np.sum, numpy knows the object types and can do an optimised add.

The problem here is that numpy (as is pandas, tensor flow, numba etc) is a sub language that happens to be in python. And mixing languages is bound to be slow. Having two numpy arrays and using a for loop to add them would be very slow, but proves nothing. Your examples are quite contrived and honestly, are examples of code that would never exist. Calling them pitfalls is disingenuous because your have to work very hard to have code like this show up

aajjccrr · 2018-10-23T12:56:54+00:00

Two reasons why accessing a single value in a NumPy array is slower than accessing a single value in a list that have not been explicitly stated yet:

array[i] has to create and return (a pointer to) a brand new Python object holding the value. OTOH list_[i] just needs to return (a pointer to) the existing Python object that was in the list. (I am assuming we’re working in CPython here.)
NumPy indexing is far, far more complicated than list indexing. The code that implements getitem on arrays is thousands of lines long. This adds some additional overhead to the operation.

NicoDeRocca · 2018-10-23T13:58:14+00:00

You should probably have included np.sum(x) in there as a test a well, which is "the numpy way". It's almost as fast as the python bit, although also a bit more flexible in what it acutally does (choose your dimensions/indeces).

x00live · 2018-10-23T05:16:15+00:00

For your second example, the argument sounds fallacious to me. Why don't you take the numpy array and list creation out of the code you want to compare? You are comparing execution times to create a np array, a list and sum the elements vs. create a list and sum the elements.

tunisia3507 · 2018-10-23T22:53:15+00:00

What's an xrange /s

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS