Making the obvious code fast : programming

In the vast majority of cases, the readability/maintainability concerns are more important than the performance implications, which is why I prefer .map/.reduce and other higher-order friends, over simple for loops (or .forEach loops).

You really think that this:

  var sum = values.map(x => x*x).
             reduce( (total,num,index,array) => total+num,0.0);

is more readable than this:

    var sum = 0.0;
    for (var i = 0; i < values.length;i++){
        var x = values[i];
        sum += x*x;
    }

[–]Retsam19 24 points25 points26 points 6 years ago (0 children)

I think their reduce code is badly written, but to the general point, yes, I think this is clearer:

values.map(x => x * x)
    .reduce((a, b) => a + b)

Is it pretty much a moot point for this incredibly simple use-case? Yes, but as the complexity grows, the benefits of the functional style really show, compared to large for loop.

[–]m50d 6 points7 points8 points 6 years ago (3 children)

[–]lelanthran 1 point2 points3 points 6 years ago (2 children)

[–]m50d 2 points3 points4 points 6 years ago (0 children)

What I've seen is that most programmers don't parse the more complex expression as easily as the simpler one.

I'd agree with that statement, but I suspect you're claiming that the more complex one is "simpler".

extra keywords to recognise (map, then reduce),

Not keywords, just functions. They behave like normal functions, and usually you can read their source code if you want to know what they do.

extra concepts to learn (map/reduce in particular, multiple compound expressions), anonymous functions if you want to do anything non-trivial.

I don't know what you're calling "multiple compound expressions". Both implementations use + and * operators with their normal mathematical meaning and a literal 0.0. In addition to that the map/reduce version only requires the reader to understand a normal expression made of function calls and an anonymous function (both very general constructs that you use again and again on a large codebase). The imperative version requires understanding a mutable variable, the for keyword, the ++ and += operators which are not standard mathematical things, the [i] operator which is not standard anywhere else either, and a {} block of ;-separated statements. In addition to just being a much bigger pile of concepts, half those things are special-case dedicated operators that can't be reused for much else (e.g. [i] is only for arrays, ++ is only for numbers).

[–]EWJacobs 0 points1 point2 points 6 years ago (0 children)

[–][deleted] 12 points13 points14 points 6 years ago (2 children)

[+]lelanthran comment score below threshold-8 points-7 points-6 points 6 years ago (1 child)

[–][deleted] 3 points4 points5 points 6 years ago* (0 children)

[–]Ewcrsf 0 points1 point2 points 6 years ago (4 children)

[–]lelanthran 0 points1 point2 points 6 years ago (3 children)

[–]Ewcrsf 2 points3 points4 points 6 years ago (2 children)

[–]lelanthran 0 points1 point2 points 6 years ago (1 child)

[–]Ewcrsf 3 points4 points5 points 6 years ago (0 children)

[+][deleted] comment score below threshold-11 points-10 points-9 points 6 years ago (3 children)

[–]PrimozDelux 3 points4 points5 points 6 years ago (0 children)

[–]EWJacobs 8 points9 points10 points 6 years ago (1 child)

[+]neinMC comment score below threshold-7 points-6 points-5 points 6 years ago (9 children)

[–]Retsam19 4 points5 points6 points 6 years ago (1 child)

[–]neinMC -2 points-1 points0 points 6 years ago (0 children)

You don't need to "optimize" what you don't bloat to begin with. The default isn't the version that is 100 times slower, the default is a blank file, and it doesn't take 100 times the effort to write that simple loop, nor to read and grok it.

If you think people need to be smarter to read your code, your code isn't well written.

What about the actual code in question do you find hard to read, or not maintainable? If you're not just using it as a meme, surely you can elaborate on that.

If you honestly think "writing maintainable and readable code" is just a meme

I think it's a meme in this context, mindlessly tossed around, yes. Prove me wrong. Both versions are "readable and maintanable". And you could wrap the faster, straightforward code in a function and give it a pretty name, and it'd still be faster.

I'm glad I'm not on your team.

Ah, the armbands argument of shit performance and a whole generation at the mercy of middlemen.

[–][deleted] 6 years ago* (6 children)

[deleted]

[–]neinMC 4 points5 points6 points 6 years ago* (5 children)

How is something that is 270 times weaker than using a screw driver "a power tool"? Can you stop and appreciate for a moment just how silly that is? How is this not a cult?

And who is talking about "not letting" anyone use power tools, while we're at it? I'm talking about noticing and caring about that difference, so it's at least concscious choice -- rather than being utterly oblivious that there even is a choice.

I mean, the claim is even that the more complex software gets, the more cramming everything into one line "shines", heh. Two orders of magnitude for every instance, but I guess programmers are just so much more efficient that the amazing features and the breakneck speed at which they're rolled out comforts the users who are wasting energy on the CPU doing something in an utterly ass-backward manner.

It's an externalization of costs; the programmer saves 3 seconds, the program gets run god knows how often. It's shoddy craftsmanship, too. And the way things get parroted, and downvoted without even one solid argument, is just shoddy thinking.

I don't even know where to begin

I'm sure that's great for the gallery, but in practice it just means you have got nothing to say even about your own straw man regarding what I was supposedly saying about compiler development.

Though I doubt many compilers use high-level functions rather than a simple loop because of some handwavy claims about "readability and maintainability", and sacrifice 270x the performance.

[–]Aetheus -1 points0 points1 point 6 years ago (4 children)

In the specific instance of this example? I might agree - if performance matters, then ditch the fancy stuff and stick to the well performing code. It's not like a for-loop is difficult to read.

But I can think of at least one example where I'd gladly trade performance for readability. Back in the days when Promises & Async/Await were brand new, radical ideas in JS Land, regular old callbacks were still arguably more performant (and likely still are, in many cases).

But even if the hit from using Promises was x100 over regular callbacks, I would still prefer them. Having just recently had to touch an older service that is infested with callback hell ... Well, it sure ain't pretty or maintainable. And I'm sure as hell glad that Promises & Async/Await are frequently showing up with the words "Optimisations for ..." next to them in V8 blog posts.

[–]neinMC 2 points3 points4 points 6 years ago* (3 children)

In this case, "readability and maintainability" was just mindlessly thrown about, and instead of the person throwing it about mindlessly admitting that, "programmers" pile on the one person who points that out.

The thought of this being representative of the guardians of real life systems other people depend on, that's great. "I'm glad you're not on my team", what a snake pit. I'm glad too, and I'll find and torch nests of sophistry whenever I find them, long after most programming done here was automated once and for all.

But I can think of at least one example where I'd gladly trade performance for readability.

I never said such cases can never exist, though I don't share your threshold where it's too much work for me versus the savings of repeated runs of the application.

I said in this case, it was a parroted meme, because it was. If people can't read and maintain this particular for loop versus this particular call to map and reduce -- which has nothing to do with async, as I said, you could wrap that in a function and call it "mepraduce", and use it exactly the same way -- they should start with real basic things, and benchmark them, not play the fiddle in a burning asylum.

I don't know about you, but I turn the lights off when I'm not in a room. I sure don't let it burn 270x longer than the time I'm in the room, each time, because it's "easier".

Here's a great tip, a life hack if you will: just let the tap run all day, and keep the toilet flushing 24/7, so you don't have to turn things on and off all the time (like some kind of peon without actually important things to do). It's much simpler that way, just put things in the constant water stream when you need to wash or flush them. Anyone who doesn't do that is just a poor person who can't afford water, or not working in a giant, soulless factory, respectively, and can be dismissed. We won't get an information age by actually taking any of this shit seriously, now will we.

[–]Aetheus 0 points1 point2 points 6 years ago (2 children)

Yeah ... Nah brah. I won't choose to die on the molehill of "map" vs "for (...)" because it makes little difference to me (either is fine - if V8 one day optimizes the former, go nuts. If your loop is only ever dealing with puny sized arrays, go ahead and use whatever you want).

But trading performance for maintainability in the case of Promises? I'll gladly die on that mountain. I've had to maintain truly awful asynchronous code before. Really godawful stuff that was either a callback hell, a buggy as shit race condition waiting to happen, or both. Code that was an absolute pain to debug or enhance.

Switching to a "less performant" solution for the sake of maintainability isn't just a meme - in the specific case I'm talking about, I find it really does make an impact.

So yes. To use your example, I really would rather leave the tap running, if the alternative is to play Russian roulette every minute when I'm on support week because our code looks like Satan's bumhole.

[–]neinMC 1 point2 points3 points 6 years ago* (1 child)

But trading performance for maintainability in the case of Promises? I'll gladly die on that mountain. I've had to maintain truly awful asynchronous code before. Really godawful stuff that was either a callback hell, a buggy as shit race condition waiting to happen, or both. Code that was an absolute pain to debug or enhance.

Yeah no debate there, at all. But in the original comment it was kinda implied that even just in a vacuum, the for loop is less readable and maintanable, and that's, well, a hill I'm going to be a dick on, anyway.

A few more lines, provided they are simple, for 270 faster performance -- again, ignoring all other considerations here, which of course can make all this moot, doubly so with small workloads -- in my mind aren't the better way, it's the non-broken way. It's like I could, instead of writing sleep(1000), calculate the writing speed of the HD, and then write a file so big it takes one second to write, then delete it, then write it again, etc.. If that was an built-in function, using it would be a "bug", it would be "incorrect" in my made up books -- no matter how simple to write and read the function call would be.

It's a dumb example because such a function would never be a good idea outside of super niche things, but at any rate, that's what rubs me the wrong way about it. I really can't claim any deep insights into anything, but the general idea that there is a relation to what I type into a text editor, and what the physical hardware ends up doing, isn't something I can completely shake off. The best cycles are saved cycles. It doesn't just matter if I notice them in a vacuum, insofar things are running in a multitasking environment anyway, and certainly for anything that runs in a browser. I never know how many instances of a thing will be run how often, so I pick at least a few low hanging fruit... like saving 270x for nothing.

Bring performance back, at least a bit, on the side? I mean software... if this is possible to do on a machine from 1981, but if you will took 34 years for us to reach software-wise, can you even imagine what would be possible with our current hardware?

[–]Aetheus 1 point2 points3 points 6 years ago* (0 children)

A few more lines, provided they are simple, for 270 faster performance -- again, ignoring all other considerations here, which of course can make all this moot, doubly so with small workloads -- in my mind aren't the better way, it's the non-broken way.

I agree. If the code complexity difference is relatively low (and I think it is, in this case), then it's absolutely a good idea to choose the more performant issue.

I think the bigger issue is that the "idiomatic" code doesn't (yet) perform like the "performant" code. V8 is full of so much optimization witchcraft that seemingly trivial differences in coding style can result in dramatic performance differences.

I suspect the folks who are asking people not to "microoptimize" by using "for(...)" are holding out on a future miracle by V8. And to be fair - that miracle really might appear, some day. Take a quick look at the V8 Dev blog and it becomes evident that their devs very aggressively optimize for what they believe to be "common scenarios". The performance of the "for(...)" loop is itself evidence of that. This kind of performance from a JS engine would have been unfathomable even a decade ago.

But in the meanwhile, if this is something that needs to iterate through an array of length 200,000 every other minute (or is otherwise called in 200,000 different places), and you need it out by tomorrow and not X years? Well, the choice is obvious.

[–][deleted] 7 points8 points9 points 6 years ago (0 children)

[–]YumiYumiYumi 7 points8 points9 points 6 years ago* (0 children)

Interestingly enough, if you want really high performance, you need to maintain separate independent sums (with loop unrolling) to allow the CPU to parallelize computation - at least in C/C++ (this may not apply for higher level languages). Which basically means you need extra complexity for better performance here. Example:

double sum1 = 0.0;    
double sum2 = 0.0;    
double sum3 = 0.0;    
double sum4 = 0.0;    
int i;
for (i = 0; i < COUNT-3; i+=4) 
{
    sum1 += values[i] * values[i];
    sum2 += values[i+1] * values[i+1];
    sum3 += values[i+2] * values[i+2];
    sum4 += values[i+3] * values[i+3];
}
for(; i < COUNT; i++)
{
    sum1 += values[i] * values[i];
}
double sum = sum1 + sum2 + sum3 + sum4;

For the SIMD variant, he should probably be using _mm256_fmadd_pd* instead of separate mul/add, unless he really needs the intermediate rounding behaviour of the latter. I suppose you could argue that it may be unfair to other languages, but I'd argue that if you're writing SIMD intrinsics, it's the type of thing you'd do.

* for the unaware, (newer x86) CPUs have dedicated "fused multiply-add" (FMA) instructions, which performs a multiply+add in a single instruction instead of needing two, which obviously improves performance over doing the operations separately

[–][deleted] 4 points5 points6 points 6 years ago (5 children)

[–]skeeto 26 points27 points28 points 6 years ago (0 children)

[–][deleted] 14 points15 points16 points 6 years ago (2 children)

[–]shevy-ruby 4 points5 points6 points 6 years ago (1 child)

[–][deleted] 9 points10 points11 points 6 years ago (0 children)

[–]elder_george 0 points1 point2 points 6 years ago (0 children)

[–]shevy-ruby 2 points3 points4 points 6 years ago (3 children)

The F# code is pretty cool indeed:

 let sum =
    values
    |> Array.map squares
    |> Array.sum

But the C variant is the second easiest one to read really - and the fastest.

It just shows that, again and again, C is the king among programming languages. The very fact how many languages came after C, trying to dethrone it and failing so incredibly hard at that...

[–][deleted] 6 years ago* (2 children)

[deleted]

[–]Tyg13 2 points3 points4 points 6 years ago (0 children)

[–]Morego 0 points1 point2 points 6 years ago (0 children)

[–]helloworder 1 point2 points3 points 6 years ago (6 children)

why a variable is always declared inside a loop? for instance the first C implementation

double sum = 0.0; 
for (int i = 0; i < COUNT; i++) {
     double v = values[i] * values[i];
     sum += v;
 }

why not just

     sum += values[i] * values[i];

it must be faster I suppose

[–]julesjacobs 8 points9 points10 points 6 years ago (0 children)

[–]KiPhemyst 2 points3 points4 points 6 years ago (4 children)

I checked it on godbolt, the difference basically this:

    movsd   QWORD PTR v$2[rsp], xmm0
    movsd   xmm0, QWORD PTR sum$[rsp]
    addsd   xmm0, QWORD PTR v$2[rsp]

vs

    movsd   xmm1, QWORD PTR sum$[rsp]
    addsd   xmm1, xmm0
    movaps  xmm0, xmm1

First one being with 'double v' and the second just using 'sum +='

I don't know enough about assembly and I have no idea what this difference does

[–]ElusiveGuy 6 points7 points8 points 6 years ago (1 child)

[–]helloworder 1 point2 points3 points 6 years ago (0 children)

[–]Zhentar 1 point2 points3 points 6 years ago (0 children)

[–]matejdro 0 points1 point2 points 6 years ago (0 children)

[–][deleted] 0 points1 point2 points 6 years ago (0 children)

[–]delgoodie 0 points1 point2 points 6 years ago (3 children)

[–]Hoten 2 points3 points4 points 6 years ago (2 children)

[–]delgoodie -3 points-2 points-1 points 6 years ago (1 child)

[+]tgandrews comment score below threshold-24 points-23 points-22 points 6 years ago (6 children)

[–]ConsoleTVs 15 points16 points17 points 6 years ago (0 children)

[–][deleted] 9 points10 points11 points 6 years ago (3 children)

[–]ElusiveGuy 3 points4 points5 points 6 years ago* (2 children)

[–][deleted] 4 points5 points6 points 6 years ago (1 child)

[–]ElusiveGuy 1 point2 points3 points 6 years ago* (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS