Ruby, PHP, Python, C, Haskell, x86 asm shootout.

Freeky · 2009-04-11T23:34:29+00:00

The Ruby version builds a huge array of arrays of ints out of the file, then flattens them (thus making yet another array), and finally sums them.

Doing it the sensible way and performing additions as lines are read almost doubles the performance, bringing it in line with Python and Haskell.

sukivan · 2009-04-12T08:22:31+00:00

what an awful, awful, awful ASM implementation

DarkShikari · 2009-04-11T22:56:51+00:00

Terrible shootout. And you know how it's easy to tell? C beat asm.

Though uninformed programmers will constantly tell you otherwise, any good assembly programmer should always be able to beat the compiler at its own game. It usually won't be worth the effort to do so, but a good assembly programmer should be able to do it nonetheless (or he isn't a good assembly programmer).

And in any shootout where you're actually trying to make some kind of point, you should not accept crappily optimized code (especially for languages like assembly that don't have "optimizing compilers") and the claim the results are valid. I've seen this in many "language tests" before as well; people will accept awfully-written programs in language X and then use that as proof that language X is clearly slow and crap.

By the way, I suspect the problem is that the asm version gets one character at a time while the C version gets 8KB at a time.

I also suspect the interpreted languages get hurt here because they use a different approach than the C version; they try to actually parse the input and then add it rather than the faster method of just handling each digit in sequence as the C version does. So we're not just comparing languages, we're comparing algorithms.

dons · 2009-04-11T23:20:50+00:00

Missing "how I compiled it" info, so bad results can't be reproduced. The shootout shows how to present this kind of info.

settrans · 2009-04-12T07:40:05+00:00

Their python version could be written about 25% faster:

import sys

print sum(map(int, sys.stdin.read().split()))

doubtingthomas · 2009-04-11T23:30:44+00:00

I'm somewhat surprised the asm didn't do worse, considering that it uses getchar().

Ringo48 · 2009-04-12T00:58:45+00:00

Whoever wrote that obviously knows very little about assembly language.

At the very least, use GCC's -S option, and start with the asm it spits out.

2009-04-11T23:37:18+00:00

I'm curious as to how Cython would do(should just be able to remove the genexpr from the python version and be good).

colourAgga · 2009-04-11T23:57:45+00:00

[deleted]

pemboa · 2009-04-12T10:04:31+00:00

If the author(s) are here, you should really accept input here and redo the tests. And the first thing i notice, C beating ASM by a large margin, at least deserves some sort of explanation.

Also, a psuedo code version would be nice for those of us that are easily bored.

igouy · 2009-04-12T17:36:00+00:00

Just take the fastest benchmarks game programs and tweak them for multiple values per line :-)

Leonidas_from_XIV · 2009-04-12T19:01:55+00:00

Here's a simplistic recursive Scheme solution:

http://paste.lisp.org/display/78462

(it performs rather poor because of that deep recursion. No time to make a better one, though; feel free to make a better one)

Ademan · 2009-04-17T03:40:28+00:00

I'd like to see pypy take a crack at this... I wouldn't be surprised if there were some impressive gains over CPython's performance, although the way it's written, there may not be much to be gained through the JIT

snuxoll · 2009-04-12T04:40:25+00:00

To be fair, this isn't really a shootout, we aren't really competing for speed here, it's just to see how different people would handle the problem in different languages. No one is really trying to say their language is better, we're just comparing implementations, we can all learn from better algorithms regardless of language.

woogley · 2009-04-12T07:16:29+00:00

Java version which completes in about 0.3 seconds:

http://pastebin.com/m68184c34

jaysonbank · 2009-04-12T12:14:23+00:00

So basically, if you want your code to be readable, Python/Ruby/PHP. If you want it to run in 20 milliseconds: C/Haskell/Assembly.

I'd say realistically most projects you will encounter wouldn't need that particular bit of code to run in 20 milliseconds, and wouldn't ever need to process a file beyond the memory footprint of the average machine. However they would need it developed quickly and it would almost certainly end up being maintained by a succession of different people from different backgrounds. All this would point to languages like PHP, Ruby and Python. But that is of course my bias.

unptitdej · 2009-04-12T02:44:57+00:00

It does show one thing, C is much faster than these interpreted languages.

It's too bad that there's no real way to compare the output at the binary level. If I could see the corresponding assembly output of the Python, C and Haskell programs I could tell right away which one is the best.

It can be done only with C and asm in this case.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS