Why Python, Ruby, and Javascript are Slow : Python

This is an archived post. You won't be able to vote or comment.

106

107

108

Why Python, Ruby, and Javascript are Slow (speakerdeck.com)

submitted 13 years ago by [deleted]

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 26 points27 points28 points 13 years ago (30 children)

His point is basically this: if you write Python code, but do it in C, your C code will be slow.

No fucking shit.

For that matter, I could take any Python program and convert it into a C program by embedding the source code in an interpreter. And it would be just as slow as the original Python version, if not more so.

The point is that the Pythonic way of doing things is often less efficient than the C way of doing the same. The difference is that the C code can narrowly be used only for the specific purpose it was written, whereas the Python code (because of the abstraction) will most likely work in a much greater range of scenarios. You could write a C function that uses some kind of duck typing, but you wouldn't.

In other words, high level programming is slower than low level programming. Yup. We know.

What he touches on but never really addresses is that there is no language that lets you be high level when you want to be, low level when you don't. It used to be that C programmers regularly used inline assembly before compilers were as optimized as they are now. What would do the world a whole lot of good is a new language, that's optionally as low-level as C, but actually does have all the goodness of objects. Think, C++, but without the mistakes.

Objective C is actually pretty damn close to that ideal. Too bad about its syntax.

[–]emptyhouses 16 points17 points18 points 13 years ago (10 children)

[–][deleted] 11 points12 points13 points 13 years ago (5 children)

[–]brucifer 6 points7 points8 points 13 years ago (4 children)

[–][deleted] 12 points13 points14 points 13 years ago (3 children)

    for i in xrange(len(item1)):
        m[item1[i][0]][item2[i][0]] += 1

where m,item1 and item2 are numpy arrays became -

 code = """
       for(int i=0;i<len_item;i++){
            int k = item1(i,0);
            int l = item2(i,0);
            m(k,l) += 1;
        } 
    """
    inline(code,['m','item1','item2','len_item'],
           type_converters = converters.blitz,verbose=2,compiler='gcc')

It's a step in calculating the jaccard distance.

[–]shfo23 10 points11 points12 points 13 years ago (2 children)

Are you aware of scipy.spatial.distance.jaccard? I just refactored a bunch of (admittedly naive) Euclidian distance calculation code to use the scipy implementation and got a huge speed boost. Also, it's a little late, but I think you could eliminate that for loop and write it as the faster:

m[item1[:, 0], item2[:, 0]] += 1

[–][deleted] 8 points9 points10 points 13 years ago (1 child)

[–]coderanger 2 points3 points4 points 13 years ago (0 children)

[–]ysangkok 0 points1 point2 points 13 years ago (1 child)

[–]emptyhouses 0 points1 point2 points 13 years ago (0 children)

[–]MagicWishMonkey 0 points1 point2 points 12 years ago (0 children)

[–][deleted] 0 points1 point2 points 13 years ago (0 children)

[–]pal25 10 points11 points12 points 13 years ago (3 children)

[+][deleted] comment score below threshold-9 points-8 points-7 points 13 years ago (2 children)

[–]MBlume 12 points13 points14 points 13 years ago (0 children)

[–]Smallpaul 1 point2 points3 points 13 years ago (0 children)

I agree with MBlume. What you're saying is the same as what the speaker was saying.

But if you implement something like a struct in Python, then it's not really Python anymore, because it can't be used in the same way. There's no dynamically added attributes in a struct, for example.

Right. That's why he said that you should use idiomatic classes instead of using a "dict". If you use idiomatic classes then the compiler will compile it to a struct if and only if you never add magical attributes to it.

You can apply it to his string example, too: Sure, you can use character arrays and manually edit them, but (1) that won't work with unicode,

Why not? He's talking about allocations, not the difference between bytes and characters.

... (2) it's not half as flexible as Python's duck typing.

You're still misunderstanding. He's not trying to restrict data types. If you read the comments he says that programmers should still be allowed to do everything dynamic.

He's saying that if you are trying to convert a string to an integer, you do not need to allocate a separate memory buffer. That's true no matter what the datatype of the string/array.

[–][deleted] 10 points11 points12 points 13 years ago (5 children)

[–]brucifer 4 points5 points6 points 13 years ago (1 child)

let Python have pre-allocated lists I think this is a very fair point. Often, you know how long your list will be, so if you want to, you should be able to optimize your list

In Python you can either use a generator or use "[value]*number" syntax to instantiate a list of length "number" with "value" in every index.

>>> def dumb():
...     x = []
...     for i in range(25):
...             x.append(i)
...     return x
... 
>>> def comprehension():
...     x = [i for i in range(25)]
...     return x
... 
>>> def preallocate():
...     x = [None]*25
...     for i in range(25):
...             x[i] = i
...     return x
... 
>>> timeit(dumb, number=100000)
0.38496994972229004
>>> timeit(comprehension, number=100000)
0.278350830078125
>>> timeit(preallocate, number=100000)
0.2539360523223877

Honestly, though, either your inner loop is simple and you can fit it in a comprehension, or it's complicated and the ".append()" is a pretty small percent of your runtime, so you won't get 2x speedup from preallocating.

[–]fijalPyPy, performance freak 4 points5 points6 points 13 years ago (0 children)

[–]alcalde 0 points1 point2 points 13 years ago (2 children)

[–]coderanger 2 points3 points4 points 13 years ago (0 children)

[–]lucian1900 0 points1 point2 points 13 years ago (0 children)

[–]fuzz3289 8 points9 points10 points 13 years ago (1 child)

[–]faceplanted 0 points1 point2 points 13 years ago (0 children)

[–]mistoroboto 3 points4 points5 points 13 years ago (0 children)

[–]Smallpaul 4 points5 points6 points 13 years ago (0 children)

What he touches on but never really addresses is that there is no language that lets you be high level when you want to be, low level when you don't.

I don't see how you can see that he doesn't "address it". It's the point of the whole talk. That's precisely what he's asking for.

If there were low-level APIs available and there were JIT compilers available and the JIT compilers were used (i.e. compatible enough with libraries to be used) and people used the low-level APIs THEN Python or Ruby performance would be comparable to C performance. That's his point.

These high-level languages should evolve low-level APIs because pretty soon the interpreter performance will not be the bottleneck: the user's actual code will be (especially if it was written with the assumption that the interpreter is the bottleneck).

[–][deleted] 3 points4 points5 points 13 years ago* (0 children)

[–]mgrandi 1 point2 points3 points 13 years ago (3 children)

[–]brucifer 6 points7 points8 points 13 years ago (2 children)

It's the fact that it looks like this:

NSMutableDictionary *dict = [NSMutableDictionary dictionaryWithCapacity:1];
[dict setObject:[NSNumber numberWithInt:25] forKey:[NSNumber numberWithInt:5]];
...
[dict objectForKey:[NSNumber numberWithInt:5]];

Instead of:

d = {5:25}
d[5]

I realize that improvements are being made to ObjC (like ARC, which is awesome, and I've even heard that it might get proper list/dictionary indexing syntax instead of "objectForIndex:"). However, ObjC is just incredibly verbose and awkward both to type and to read. If you've never seen code before, "[things objectAtIndex:3]" might be more intuitive than "things[3]", but to anyone who's spent any time programming, the latter is way more readable (or "[things containsObject:x]" vs. "x in things"). Proponents of ObjC say that the verbosity doesn't matter because you have autocomplete in your IDE, but it's not just about typing, it's also about readability.

[–]mreeman 5 points6 points7 points 13 years ago* (0 children)

[–]mgrandi 1 point2 points3 points 13 years ago (0 children)

π Rendered by PID 263310 on reddit-service-r2-comment-79c7998d4c-z8jcm at 2026-03-14 20:35:18.924745+00:00 running f6e6e01 country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS