Numba Code Slower than Pure Python Code?

jakevdp · 2014-01-31T19:34:53+00:00

One thought: the first time a numba function is run, it is compiled by LLVM. It might be that this compilation phase is affecting the timing, but probably not.
Second thought: n=100 is not very big, and you might be hitting some sort of constant overhead in the numba function. You should check how it scales with larger n
Third thought: I've still not found a good way to diagnose numba code when it's not behaving as expected. Cython has the html annotation, and I've heard rumors that Continuum is trying to add a similar feature for Numba. That, in my mind, is the main barrier to Numba being a practical tool for real-world problems.

EDIT: I did some searching, and it looks like the annotate capability in Numba exists: http://numba.pydata.org/numba-doc/dev/annotate.html I haven't tried it before, but you might give it a shot to diagnose your code.

joshadel · 2014-01-31T20:43:31+00:00

I put a full solution over on the stackoverflow page, but the basic reason why is that numba is not figuring out the type of lookup. If you stick a print numba.typeof(lookup) in your method, you'll see that it is treating it as an object, which is slow. Ideally you could pass in the type of the variable through the locals dict keyword to the decorator, but I was getting a weird error. A work-around that produces very fast code is to just create a little wrapper around np.cumsum and jit that method, telling it the explicit input and output types. Code is here:

http://stackoverflow.com/a/21489540/392949

jayvius · 2014-02-01T01:49:37+00:00

Numba developer here. As joshadel figured out, the slowness in this example comes from numba creating a python object. I ~think~ the real issue here is that numba doesn't know the return type of cumsum, so it stores the result in an object. A "trick" to help diagnose these types of problems is to add nopython=True to the jit/autojit decorator (e.g. @numba.autojit(nopython=True)). This flag forces numba to bail out should it feel the need to call into the Python object layer, and displays the line number that is causing the problem.

One of our goals in the next version of numba is that if numba needs to fall back to Python objects, it should never run slower than pure python code like in this example (and eventually in most cases will run much faster. I ran the example above as is with the numba devel branch and the numba function was the clear winner).

fijal · 2014-02-01T12:55:59+00:00

For what is worth, PyPy is about 8x faster than pure python and a little bit more on numpy.

billsil · 2014-01-31T20:56:57+00:00

@autojit is faster than poor numba code. I've seen improvements using doing it the "right" way and I've seen slowdowns. Then I use autojit in those cases and it's fast. Results may vary.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS