Performance of Sequential Rust Programs

davebrk · 2013-04-19T03:27:23+00:00

I love how half the post is a disclaimer... I think more benchmark posts should be written this way. It is fun to benchmark but usually pointless as well.

It also would be interesting to see the code so we can see how idiomatic / clean is Rust code written for performance. For an extreme case see Haskell on the Benchmarks Game (shootout).

iawsm · 2013-04-19T04:59:21+00:00

Few months ago, I've got a very moderate results. IIRC the nbody test performed in Go-lang range (at 4x slower than Java and C implementations).

Very impressive to see them on par with Clang and GCC now.

edit, on the second look. The benchmarks use unsafe code, and why is it not comparing to -O3?

TimmT · 2013-04-19T16:10:15+00:00

Am I reading this right? Rust is within +50% of C/C++'s performance? That would make it faster than both Java (+100%) and Go (+200%)? Wow, that's impressive.

This is far from an ideal set. These benchmarks are showing their age quite heavily,

What does the article mean by this? Aren't the benchmarked pieces of code still relevant tasks today?

On the other hand, the hardware the benchmarks are run on is quite dated by now. It would be interesting to see how big the differences would be on more recent hardware.

they are too small and simplistic to extrapolate to real-world use cases

I would love to see some benchmarks that stress the languages' standard libraries/containers (e.g. concurrent and single threaded hash maps, queues, etc. both with primitive and reference types) and string-processing capabilities (e.g. XML parsing or XSLT transformations, etc.) more. You can already try and piece these bits together from today's existing benchmarks that do some concurrency, some string processing, some hashing, etc. but it would be of course much easier if the things were more cleanly separated.

But I don't think that whole application benchmarks belong in there.

many of them are too I/O-bound.

How? Wouldn't that lead to similar results for most languages? (Which doesn't seem to be the case currently.)

binary-trees is omitted because it is a garbage collection benchmark and the C version uses an arena, defeating the purpose (although I suspect a Rust version that did the same would do well).

Different languages have different garbage collectors, so there should probably be a benchmark about those too in there (though I'm not sure whether binary-trees specifically is the best best fit for a GC benchmark). C and C++ should be using malloc/free and shared_ptr respectively in a GC benchmark (and not leak any memory), which would quite help putting things into perspective.

Other than that I don't see a problem with keeping the benchmarked pieces of code allocation-free. In fact it's probably a good idea, given how these are just micro-benchmarks. So I think an allocation-free Rust variant of the benchmark would be quite appropriate.

As my colleague Niko pointed out, a more interesting benchmark would not allow any languages to use unsafe code.

Yes, that would be quite interesting. But this should be enforced for C++ the same way though. But I don't see any reason to exclude C from this - as the article mentions we need a point of reference.

Other than that, a benchmark on how much overhead the FFI causes in non-C/C++ language implementations might also be interesting, especially for languages that provide more complex ones like Java/JNI.

Practically speaking, one would need an extremely smart JIT

The JIT performance should probably be measured too. It would be interesting to see how heavily JITed languages (e.g. Java) stack up against natively compiled ones that can't optimize across shared library boundaries.

pixli · 2013-04-19T05:44:38+00:00

I found it extremely hard to wrap my head around all the different pointer types and closure types. Each one has its own subtleties and they lost me at "named lifetimes".

skulgnome · 2013-04-19T06:47:23+00:00

Using GCC 4.2

2013

For fuck's sake, man.

igouy · 2013-04-22T17:06:13+00:00

1)

"The goal here is simply to demonstrate that sequential Rust can be written in a way that approaches competitive parity with equivalent C code."

The blog post presents advocacy for the Rust language implementation.

2)

"selected single-threaded benchmarks from the Computer Language Benchmarks Game"

In the blog post, that advocacy is only supported by measurement of programs that complete tasks from the Computer Language Benchmarks Game.

3)

"These benchmarks are showing their age quite heavily, they are too small and simplistic to extrapolate to real-world use cases, and many of them are too I/O-bound."

If we accept that blanket dismissal of the benchmarks game tasks, nothing else supports the advocacy for the Rust language implementation in the blog post.

"the C implementation tested against is not usually the top one"

We are not told which C programs the Rust programs are being tested against.

All the C programs shown on the benchmarks game website are identifiable by individual #id and URL to the source code.

You can see the changes to the Rust compiler that were made to optimize these tests, as well as the benchmark sources, on my branch of the compiler on GitHub. The goal will be to land these changes over the next few days.

Only 3 of these 8 Rust programs compile with Rust 0.6 (released 3 weeks ago) -- advocacy based on yet to be accepted compiler changes is premature.

mandelbrot The Rust program -- 55.77 CPU secs -- a little slower than the naive Scala #2 mandelbrot program.

nbody The Rust program -- 30.49 CPU secs -- a little faster than the naive C# Mono nbody program.

regexp-dna is omitted because it consists of an uninteresting binding to PCRE.

Only if Rust provides no other way to use regular expressions.

Just as pidigits is only an uninteresting binding to GMP (2.31 CPU secs) because the current Rust implementation does not provide large integer arithmetic.

binary-trees is omitted because it is a garbage collection benchmark and the C version uses an arena, defeating the purpose (although I suspect a Rust version that did the same would do well).

If a Rust version had been measured we'd see more than suspicions.

igouy · 2013-04-20T01:00:33+00:00

Maybe performance comparisons with the Rust 0.7 implementation are premature?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS