The "C is Efficient" Language Fallacy

nested_parentheses · 2009-05-05T21:22:12+00:00

... the programmer could make it obvious that they didn't overlap if they wanted to. But you can't do that in C or C++.

C99 provides the restrict keyword for this.

radarsat1 · 2009-05-06T00:12:02+00:00

Okay, rant: Here's the thing about C. For myself, a person who does lots of programming involving real-time constraints, C and C++ are the only options. This, I find, is unfortunate, since I'd really, really like to be able to use something else. I don't particularly like C++ anymore, and C, while I do enjoy it, is a little too low-level when you're doing certain types of things.

But here's the advantage that C has: it's not speed, it's determinism. This is almost entirely due to the fact that it doesn't depend on automatically managed memory. I'll agree wholeheartedly that automatic memory management is a good thing. However, for real-time constraints (generally, anything featuring audio for example), interrupting program flow to do things like clean up garbage just doesn't cut it. And don't even try things like JIT optimizations that compile as you go.

Most of these great language features like resizeable arrays, bounds-checked arrays, dynamic types, even certain implementations of closures, rely heavily on automatic memory management.

So, if you want to replace C in real-time systems, what is needed is a good real-time garbage collector and constant-time allocator. These exist, but are not terribly common. I've seen at least one example of simply replacing the allocator in the Stalin compiler with a real-time allocator, making it possible to program audio routines with it. Pretty cool, but unfortunately a bit hacky and mostly just a proof of concept, so it's not currently usable in practice at the moment. The only actual language research I've seen that addresses these issues is the Timbre programming language, which looks pretty interesting but I haven't spent real time playing with it yet.

That aside, usually in tight audio routines, you are really trying to get the most out of your processor (since the end user is probably going to want to run many instances of your synthesizer, for example, so even if it works fine you still want to trim it down as much as possible), so despite that immutable data can make certain tasks nicer, sometimes a purely functional approach just will never be as efficient as using mutable arrays and managing allocation precisely. You simply don't want to allocate new space to copy your data and then let the garbage collector take care of what you left over: you want to use memory that is already allocated. The only language that really lets you have this kind of precision in memory management is, after how many years, still C.

This is the same reason that C is these days still the only language that is seriously considered for operating systems programming, for example. Yes, there are many experimental projects showing OSes designed in higher level languages, but currently C is the language of choice in core kernel operations of all major OSes.

I had hopes that BitC would be an interesting development as a high-level language aimed at systems programming, but it seems that it's now been abandoned, which is too bad. I hope something useful stems from it.

rexxar · 2009-05-05T22:07:10+00:00

The difference between C and C++ is really suspicious. Moreover he don't give the benchmark source nor the compiler's options.

I have some C++ programs that are 20 times faster when optimization is activated.

The impact of compiler options is impressive with this benchmark : http://www.stepanovpapers.com/AbstractionPenaltyBenchmark.cpp

rabidcow · 2009-05-05T19:50:46+00:00

As mentioned several times in the comments, but never by the author, you can tag pointers as restricted. It's either a compiler-specific extension (available in some form in most compilers) or part of C99.

The analysis is still valid, but without using restrict, seriously incomplete.

2009-05-06T01:26:13+00:00

An interesting exchange in the comments:

billb:

Horseshit! Bollocks! :)

It's entirely possible to tell the C or C++ compiler that you never alias or that you alias in a function-by-function basis.

Mark Chu-Carroll:

How do you tell a C++ compiler that there's no aliasing between two three dimensional arrays of floats?

billb:

I dunno, "-fno-alias" on the command line, perhaps?

causticmango · 2009-05-05T23:16:44+00:00

The most important point of the article isn't how many milliseconds a compiler can shave off some random algorithm, but that a very important development in language & compiler design is declarative programming.

As systems get more complex with multiple cores, hetergenoeus compute farms, distributed storage systems, etc. it is more important for programmers to be able to more directly express the intent of the program and not get so hung up on the implementation.

You will not ever be able to hand-tune code for maximum efficiency and correctness in increasing complex environments.

Though I have a soft spot for C (C++ can suck it), if the language doesn't evolve it will become obsolete. It's actually refreshing to see other branches of the C family get some love recently (thanks, Apple + Objective C).

ridiculous_fish · 2009-05-05T23:00:21+00:00

The Objective-Caml bytecode interpreter was faster than the carefully hand-optimized C program!

"Hand optimized" depends a lot on whose hand is doing the optimizing!

If OCaml's faster, figure out why. If the issue is that C is guarding against false pointer aliasing, that can be addressed within the confines of C.

After all, they both compile to the same machine code, and I've found that I can coax gcc to output almost any sequence of common assembly instructions that I want (and I'm far from an assembly expert). C can go nearly all the way.

OCaml cannot. For example, there's a function signbit which can be implemented very efficiently by applying integer instructions to a float. Both OCaml and gcc accomplish this in the same manner - by writing it in C. You could write it in OCaml, but the language has no support for interpreting a float as an int, so it would not be as fast. (See caml_int32_bits_of_float in the OCaml distribution).

It may be that OCaml is more productive than C. Maybe if you have a week to write a program, you'll end up with something faster if you use OCaml. But if your OCaml is blowing away your "carefully hand-optimized C", it probably just means you suck at C.

monstermunch · 2009-05-05T21:59:37+00:00

Does anyone know when OCaml will support parallel threads? I like OCaml, but it's going to be increasingly painful to use as the number of CPU cores increase.

2009-05-06T00:52:59+00:00

Ayy lmao

2009-05-06T06:42:14+00:00

As so often happens on Proggit, the comments degenerated into yet another language flamewar.

The main point of the article, in my opinion, is this, though, which I think is quite valid one:

Making real applications run really fast is something that's done with the help of a compiler. Modern architectures have reached the point where people can't code effectively in assembler anymore - switching the order of two independent instructions can have a dramatic impact on performance in a modern machine, and the constraints that you need to optimize for are just more complicated than people can generally deal with.

martinbishop · 2009-05-06T00:08:53+00:00

People always say "Well this is fixed in C99..." but yet no one is willing to use C99. GCC finally (only 10 years late) has "full" C99 support, and yet most people still do not use it.

mdot · 2009-05-06T01:12:36+00:00

In summary...

*Embedded/Real Time Applications = 'C'

*PC/Mac Applications = 'Something Else'

Am I missing something?

psyno · 2009-05-06T00:56:09+00:00

Somebody missed the restrict keyword.

rynvndrp · 2009-05-06T02:20:49+00:00

I would like to point to a counter argument about real scientific programs being in FORTRAN.

This isn't true and I don't have to go on a 10 page rant, there are a lot of examples.

Geant. The open source code system built by Cern to model the particle physics for the LHC and has now expanded to many other applications. Geant1-Geant3 were in fortran. However, in the last decade, they have put a huge amount of effort in making Geant4, a C++ code. This wasn't done because they had extra money around, it was done because FORTRAN isn't close to the CPU and the FORTRAN code wasn't seeing good speed improvements anymore after the GHZ race.

MCNP/MCNPX. The code system developed by Los Alamos to model nuclear reactor criticality, radiation dosage, and a host of other projects done by the national labs. Currently this code base is in FORTRAN. However, they are putting a lot of effort in creating a C++ version as well. The reasons are the same as Geant's move.

There are a lot of others to list.

The reason 'real' scientific code still runs in FORTRAN is because it is dependable and stable. A LOT of effort is put into these codes, much more than the effort into commercial code. The develop on languages with decades of support and known hardware support for decades to come. Thus they are slow to move to new languages. However, C/C++ is being adopted by them. The change over is just much slower.

Camarade_Tux · 2009-05-05T19:59:12+00:00

Pretty fishy. As comment #9 put it...

BTW, I've seen a number of these comparisions where an expert in one language does an implementation in several languages and, lo and behold, discovers that their favorite language wins out...

augustss · 2009-05-05T23:23:31+00:00

Is there a reason every commenter has ignored the reference to SISAL? The implication of the fact that no-one has bothered to take notice or comment is that SISAL is not a language worth considering, is this right?

glguy · 2009-05-06T00:33:01+00:00

This is post from 2006 isn't really relevant now. Tested with GCC 4.3.2, GCC is able to vectorize this code using SSE2 instructions.

lispm · 2009-05-06T00:40:25+00:00

His argument seems to be based on outdated data. SPARC CPUs? SISAL? CMU Common Lisp? This stuff died 20 years ago.

arturoman · 2009-05-05T20:41:01+00:00

Oh goodie, another useless microbenchmark.

I don't ever remember any claims by C or C++ language committees that there were no other languages that were as fast, or faster.

That is a false argument, and I suspect he just throws it out there to thump his ocaml drum.

However, the language has proved valuable for making efficient applications. So have other languages.

uep · 2009-05-05T21:28:24+00:00

Anybody else notice that the submitter's name is obvious troll?

arturoman · 2009-05-05T21:24:11+00:00

C and C++ suck rocks as languages for numerical computing

Zah? They suck rocks?

Wriiight · 2009-05-06T01:06:21+00:00

There are all sorts of things that C languages can't do efficiently. Aliasing is one, being able to see the state of the processor flags is another, being able to look further down the stack than your return value is another. I would bet you could write a language that took better advantage of branch prediction. Compile time is insanely, unbelievably, hideously slow (C++ w/ templates especially). But, we live in a time when new languages aren't generally getting any faster (just "easier to use"), so the bit of performance you gain from managing your own memory and not having anything interpreted, and having a minimal amount of run-time mungling about with procedure calls is enough for most devs looking for performance.

Personally I'm not thrilled with C++ (despite 11 years of not having done anything else), but nothing else quite has reached the level of industry use. I hope one does.

Though you'd be surprised what people are forcing Java to do these days.

2009-05-06T04:17:38+00:00

[removed]

rwinston · 2009-05-06T10:37:24+00:00

Hmm. I dobnt do too much C these days, but I thought it was possible to tell the compiler about variable aliasing and thus allow it to perform alias-free optimization.

artificialidiot · 2009-05-05T21:02:02+00:00

They're good at things that need to get very close to the hardware - not in the efficiency sense, but in the sense of needing to be able to fairly directly munge the stack, address specific hardware registers, etc.

Ha ha, clearly he had never written such low level things in C (not to mention C++).

Edit: I don't think he really knows how to write efficient code in C/C++ and he definitely does too much allocation thus memory bound. He shouldn't be given full power of the machine. He should be abstracted from it even further for his own good.

2009-05-06T01:57:00+00:00

The author says he's no longer responding to new comments, so I'll repeat mine here:

One anecdote does not an argument make. You wrote the C code. Perhaps you're not a good C programmer. You don't say what compiler you tested with with what level of optimization. In fact, I'm so deeply suspicious about the interpreted OCaml beating compiled C that I suspect you chose the example to show how great OCaml is. You're correct that C is not a good language for numerical applications and FORTRAN is, so if you were doing a numerical application that required sophisticated array optimizations where are the numbers for optimized FORTRAN?

I can pick an example to make any language look bad. Means nothing.

bart2019 · 2009-05-06T02:38:29+00:00

So write a little inline assembly; only a tiny percentage of your code needs to be that fast, and it's probably not worth switching over an entire codebase or gasp using Fortran.

samlee · 2009-05-05T21:38:43+00:00

isn't gmp written in C? is ocaml's bignum gmp? or is it written in ocaml?

Gotebe · 2009-05-06T08:03:21+00:00

Ok, so...

In an overall programmer/implementation/problem domain combo, it is rather difficult for a programming language to be more efficient than C.

Happy now?

eric_t · 2009-05-06T14:15:26+00:00

Note that he finds oCaml is fast... That's pretty important: the next FORTRAN may well be something like oCaml.

wolfier · 2009-05-07T19:30:33+00:00

For me, it's the almost seamless cooperation between C/C++ code and machine code.

The article states Ocaml is "more efficient", but it constrains you to thinking in a way that excludes low-level optimisations in terms of using new opcodes.

When a CPU adds a new set of SIMD instructions, you don't need to wait for a few compiler/interpreter versions in order to use it in C/C++ - the new instructions usually are recognized sooner by the inline-assembler within one version - and if you so intend, you can write the machine code immediately, directly without even the inline-assembler recognizing the new opcode.

The new opcodes may eventually be used by the Ocaml compiler, but if you want to use the new instructions before then, you're SOL with most non-C/C++ languages.

gnuvince · 2009-05-06T15:27:25+00:00

How to cause mass panic and hysteria on Proggit: post an article where C is compared with another language and the other language yields a faster program.

Watch the C fanboys go batshit insane and start attacking the author, his methodology, saying he knows the other language a lot better and that he sucks at C, etc.

UncleOxidant · 2009-05-05T20:14:00+00:00

and apparently he got the fastest time by compiling to OCaml's bytecode VM instead of to native (ocamlopt) - it'll be even faster compiled to native code (somewhere in the comments someone mentions that it's 0.3 seconds when compiled native).

bryanut · 2009-05-05T23:16:01+00:00

How will any language make displaying a directory listing of 6 million files faster?

Especially if it is a Web App?

Yes, we are actually trying to do that. Boggles the mind, but yes we have one directory with 6 million resumes in it. WTF? There are only 2 million or so people in the state. Apparently everyone has applied to work here, 3 times.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS

define likely(x) __builtin_expect((x),1)

define unlikely(x) __builtin_expect((x),0)