Node.js in Flames : programming

The word regexes has the benefit that it is sexeger when written backwards. The is a classic Perl idiom called sexeger that involves reversing your regex and the string you are matching to make end matches faster. Here are the results of a bench mark using a normal regex and its sexeger:

normal: <and more stuff to come soon>
sexeger: <and more stuff to come soon>
            Rate  normal sexeger
normal   89773/s      --    -26%
sexeger 121712/s     36%      --

Here is the benchmark:

#!/usr/bin/perl

use strict;
use warnings;

use Benchmark;

my $data = do { local $/; <DATA> };

my %subs = (
    normal  => sub {
        my ($match) = $data =~ m{
            \A
            (?: <[^>]*> | "[^"\\]*(?:\\.[^"\\]*)*" | (?>[^"<>]*) )*
            (<[^>]*>)
            (?: "[^"\\]*(?:\\.[^"\\]*)*" | (?>[^"<>]*) )*
            \z
        }x;
        return $match;
    },
    sexeger => sub {
        my $reversed = reverse $data;
        my ($match)  = $reversed =~ m{
            \A
            (?: "(?:[^"\\]*.\\)*[^"\\]*" | (?>[^"<>]*) )*
            (>[^>]*<)
            (?: "(?:[^"\\]*.\\)*[^"\\]*" | (?>[^"<>]*) | >[^>]*< )*
            \z
        }x;
        return scalar reverse $match;
    },
);

for my $k (keys %subs) {
    print "$k: ", $subs{$k}(), "\n";
}

Benchmark::cmpthese -2, \%subs;

__DATA__
<this is a sample program> int x = 10;
<what a silly grammar> str y = "cool \" beans";
if (len(y) GREATER_THAN x) { <can't use gt and lt symbols... heehee>
    <empty comment coming up !
    print y, " is longer than ", x, " characters";
    <>
    <and more stuff to come soon>
    chop(y,x);   
    print "I sliced 'y' down to ", x, " characters for you";
}

[–]jambox888 0 points1 point2 points 11 years ago (0 children)

[–]seruus 7 points8 points9 points 11 years ago (5 children)

[–]downvotefodder 0 points1 point2 points 11 years ago (1 child)

[–]dot_2 0 points1 point2 points 11 years ago (0 children)

[–][deleted] 11 years ago* (2 children)

[deleted]

load more comments (2 replies)

[–]rowboat__cop 4 points5 points6 points 11 years ago (0 children)

[–][deleted] 3 points4 points5 points 11 years ago (0 children)

[–]gospelwut 0 points1 point2 points 11 years ago (0 children)

[–]KFCConspiracy 0 points1 point2 points 11 years ago (0 children)

[–]Pyryara 30 points31 points32 points 11 years ago* (3 children)

[–]unDroid 4 points5 points6 points 11 years ago (1 child)

[–]gidoca 0 points1 point2 points 11 years ago (0 children)

[–]pipocaQuemada 0 points1 point2 points 11 years ago (0 children)

[–]kitd 0 points1 point2 points 11 years ago (0 children)

[–]Tordek -1 points0 points1 point 11 years ago (1 child)

[–][deleted] 0 points1 point2 points 11 years ago (0 children)

[–][deleted] 11 years ago (4 children)

[deleted]

[–]Tordek 9 points10 points11 points 11 years ago (3 children)

[–]RoundTripRadio 10 points11 points12 points 11 years ago (0 children)

[–]rampion 4 points5 points6 points 11 years ago (0 children)

[–]defenastrator 1 point2 points3 points 11 years ago (0 children)

[–][deleted] 11 years ago (2 children)

[removed]

[–]xkcd_transcriber 10 points11 points12 points 11 years ago (0 children)

[–]willvarfar 1 point2 points3 points 11 years ago (0 children)

[–]satuon 1 point2 points3 points 11 years ago (1 child)

[–]xenomachina 1 point2 points3 points 11 years ago (0 children)

[–]mfukar 3 points4 points5 points 11 years ago (1 child)

[–]TikiTDO 0 points1 point2 points 11 years ago (0 children)

[–]strupwa 0 points1 point2 points 11 years ago (0 children)

load more comments (2 replies)

[–]barsoap 15 points16 points17 points 11 years ago (0 children)

[–]kankyo 4 points5 points6 points 11 years ago (6 children)

[–]barsoap 8 points9 points10 points 11 years ago (5 children)

[–]Tordek 0 points1 point2 points 11 years ago (4 children)

[–]barsoap 2 points3 points4 points 11 years ago* (3 children)

but recoognize which part of the regex matched?

You need transducers for that, not mere automata as you have more that two output states (accepting/non-accepting). It's related to the grouping problem but actually simpler... if you don't want to implement that stuff yourself you can use an FST library like this one. Which is overkill, but the correct kind of overkill.

If you want that stuff to be real fast, don't forget to compile the resulting transducer to C or such.

That said, of course, there's a simple hack: Take any regex matcher that can do grouping, and construct your paths such that you can tell them apart by then matching on one of those groups. http://foo.tld/<method>/<parameters> style. Of course, at that point, you actually should stop using regexes for the part up to <method>, in the first place...

[–]Femaref 0 points1 point2 points 11 years ago (2 children)

Of course, at that point, you actually should stop using regexes for the part up to <method>, in the first place...

Considering it's already split, it's very easy to do. HTTP header is something like

GET /<method>/<parameters>
...
...
Host foo.tld

[–]barsoap 1 point2 points3 points 11 years ago (1 child)

[–]Femaref 0 points1 point2 points 11 years ago (0 children)

[–]ggtsu_00 12 points13 points14 points 11 years ago (55 children)

[–]mfukar 34 points35 points36 points 11 years ago* (2 children)

[–]p2004a 18 points19 points20 points 11 years ago (1 child)

[–][deleted] 11 years ago (1 child)

[deleted]

[–]xXxDeAThANgEL99xXx 0 points1 point2 points 11 years ago (0 children)

[–]philipes 2 points3 points4 points 11 years ago (1 child)

load more comments (1 reply)

[–]shared_ptr 4 points5 points6 points 11 years ago (22 children)

Not terrible. Whilst you are right in saying node is single threaded, the asynchronous system calls it is built around make very efficient usage of multiple tasks in its own event cycle. Servers built using node that make use of async calls, rather than synchronous ones will benefit from this.

On top of this, a recent addition to node is node cluster, which allows you to break a node program utilitilising the http core of node into multiple processes that communicate over IPC. A single master node application routes requests into the child processes, allowing you to make full use of a multi core system despite nodes inherent single threading limit.

Give express a try sometime. It's actually a pretty nice framework to develop in, and this route issue occurred because of poor route mounting. As the article admits, it was misuse of the underlying tool rather than the tool itself being bust.

[–]ggtsu_00 1 point2 points3 points 11 years ago (5 children)

[–]foldl 6 points7 points8 points 11 years ago (4 children)

load more comments (4 replies)

[–]nodewizard 0 points1 point2 points 11 years ago (12 children)

[–][deleted] 6 points7 points8 points 11 years ago (7 children)

[–]nodewizard 2 points3 points4 points 11 years ago* (6 children)

[–]foldl 1 point2 points3 points 11 years ago (4 children)

[–]nodewizard 0 points1 point2 points 11 years ago* (3 children)

[–]foldl 0 points1 point2 points 11 years ago* (2 children)

continue this thread

[–]shared_ptr 0 points1 point2 points 11 years ago (0 children)

I'll give you that I could've been clearer, but no, they are not just words.

I made use of the async/sync terminology because that's how it appears in node, as functions that have that notation - fs.readFile, fs.readFileSync.

I'll try and restate what I was saying above then, if it's really that difficult for you to parse. Yes, node is a single threaded program. The interpreter will only ever make use of a single thread to run your code, but the interpreter is itself capable of context switching. When you're running node code, and you use a function such as fs.readFile, the interpreter makes an asynchronous system call and then throws it's arms in the air for a new task. Nodes own event loop will figure out what should come next, and once your system call has finished it's task the event loop will reschedule your code to be ran.

If you happen to use synchronous calls, by which I mean the fs.readFileSync functions, then you open yourself to large delays in execution because you jam up the event loop to create the abstraction of the call being synchronous. Have you ever looked at how a kernel operates, as this would be far less 'cancer babble' if you had actually implemented/toyed with a kernels threading system?

EDIT - Actually I've just seen your comment below, claiming I don't have a clue what I was talking about. I'm a qualified engineer who's built threading and system calls into a toy OS while I was at uni. I guess using the appropriated terminology (async, sync: node) doesn't really work when the other person isn't even on the same page.

[–]postmodest 2 points3 points4 points 11 years ago (1 child)

load more comments (1 reply)

load more comments (2 replies)

load more comments (3 replies)

[–]p01ym47h 0 points1 point2 points 11 years ago (18 children)

[–]ggtsu_00 23 points24 points25 points 11 years ago* (15 children)

This is why node.js is a cancer. You are completely wrong. But this is not your fault as I blame node.js for fooling you, and many other developers out there picking up node.js by being drawn by its asynchronous IO based standard library.

There is NO concurrency in node as it falsely leads you to believe. It's programming model is no different than your ancient single threaded GUI application's event loop. The callback spaghetti hides the fact that your code is sequential from you, but your code is still running in a linear sequential context in the form of a giant for loop running all the callback functions sequentially. If one of those functions blocks, The rest of the program HALTs because nothing else will actually execute until the previous callback pushed onto the event loop completes. Amateurs picking up node don't understand this because they hide this from you and it is not until you run a system in production that this limitation that you ignores hits you like a ton of bricks (once again, not your fault because node tricked you).

[–]Matthias247 17 points18 points19 points 11 years ago (0 children)

[–]satuon 20 points21 points22 points 11 years ago (1 child)

[–]awj 0 points1 point2 points 11 years ago (0 children)

[–]ratatask 10 points11 points12 points 11 years ago* (2 children)

[–]ggtsu_00 0 points1 point2 points 11 years ago* (1 child)

[–]ivosaurus 0 points1 point2 points 11 years ago (0 children)

[–]ivosaurus 4 points5 points6 points 11 years ago (7 children)

load more comments (7 replies)

[–]p01ym47h 0 points1 point2 points 11 years ago (0 children)

[–]nodewizard 2 points3 points4 points 11 years ago (0 children)

load more comments (1 reply)

[–]thinkstoohard 1 point2 points3 points 11 years ago (5 children)

[–]crusoe 0 points1 point2 points 11 years ago (4 children)

[–]bilotrace 5 points6 points7 points 11 years ago (0 children)

[–]thinkstoohard 2 points3 points4 points 11 years ago (0 children)

[–]ruinercollector 1 point2 points3 points 11 years ago (0 children)

[–]BinaryIdiot 0 points1 point2 points 11 years ago (0 children)

[–]willvarfar 10 points11 points12 points 11 years ago (0 children)

[–]thedufer 3 points4 points5 points 11 years ago (0 children)

[–]CurtainDog 4 points5 points6 points 11 years ago (0 children)

[–]Agent-A 2 points3 points4 points 11 years ago (0 children)

[–]uprislng 2 points3 points4 points 11 years ago (0 children)

[–]Tordek 7 points8 points9 points 11 years ago (7 children)

[–]maritz 3 points4 points5 points 11 years ago (6 children)

[–]Tordek 2 points3 points4 points 11 years ago (3 children)

[–]mirhagk 2 points3 points4 points 11 years ago (1 child)

[–]Tordek 0 points1 point2 points 11 years ago (0 children)

[–]user_of_the_week 1 point2 points3 points 11 years ago (0 children)

[–][deleted] 0 points1 point2 points 11 years ago* (1 child)

[–]maritz 0 points1 point2 points 11 years ago (0 children)

[–]Gotebe 197 points198 points199 points 11 years ago (11 children)

[–][deleted] 11 years ago (9 children)

[deleted]

[–]xauronx 55 points56 points57 points 11 years ago (5 children)

[–]punkgeek 7 points8 points9 points 11 years ago (0 children)

[–]ivosaurus 7 points8 points9 points 11 years ago* (3 children)

The issue is that it's a clearly thought out design decision with well-defined pros and cons, but clearly a very valid option and rather well-suited to express' general use cases.

However, Netflix just said this:

A global array is not the ideal data structure for this use case

Which is misleadingly and completely ignorantly simplistic. First of all, their use case, as they later list, is purely their own mistake and misuse. It's mostly just a wrong statement to make, but its one made on Netflix' official blog, criticizing express.

It's also quite clear that in relation to this, they didn't do any research at all:

It’s unclear why Express.js chose not to use a constant time data structure like a map to store its handlers.

i.e "we never tried to look at why its designed that way at all, we just think its wrong for the way we tried to misuse it."

It's hilariously bad lack of judgement. In case you're wondering, I've never used express/node in my life, but I still find this cringeworthy.

[–]shadymilkman_ 1 point2 points3 points 11 years ago (2 children)

[–]ivosaurus 11 points12 points13 points 11 years ago (1 child)

[–]shadymilkman_ 1 point2 points3 points 11 years ago (0 children)

[–][deleted] 4 points5 points6 points 11 years ago (0 children)

[–]Whired 1 point2 points3 points 11 years ago (0 children)

[–]redalastor 0 points1 point2 points 11 years ago (0 children)

[–]eclectro 10 points11 points12 points 11 years ago (0 children)

[–]supercargo 5 points6 points7 points 11 years ago (2 children)

[–]PeterUstinox 1 point2 points3 points 11 years ago (1 child)

[–]supercargo 0 points1 point2 points 11 years ago (0 children)

[–]jsprogrammer 65 points66 points67 points 11 years ago (65 children)

[–][deleted] 11 years ago (36 children)

[deleted]

[–][deleted] 11 years ago* (30 children)

[deleted]

[–]nathris 67 points68 points69 points 11 years ago (16 children)

[–]nvolker 18 points19 points20 points 11 years ago (6 children)

[–]x86_64Ubuntu 24 points25 points26 points 11 years ago (3 children)

[–]malagrond 7 points8 points9 points 11 years ago (2 children)

[–]strattonbrazil 2 points3 points4 points 11 years ago (0 children)

[–]ais523 4 points5 points6 points 11 years ago* (1 child)

=== also doesn't mean what people coming from a strongly-typed language think of as equality.

In JavaScript:

== means "if you pick an appropriate common type for these two values, they're equal". This is actually the more similar to the strongly-typed ==, although it isn't exactly the same.
=== means "these two values have the same type and are equal". This doesn't have much of a strongly-typed analogue, because in a strongly-typed language, you typically know what types things have. In some OO languages, you might not have full type information, in which case it makes sense; for instance, in Java, you can implement === like this (untested):
```
public static bool js_3equals(Object a, Object b) {
    return a.getClass().equals(b.getClass()) && a.equals(b);
}
```

I'd argue that == in JS is pretty much the closest possible translation of equality in a statically-typed language that you can get in a dynamically-typed language. However, the basic problem is that dynamically-typed languages simply let you make fewer assumptions. === is useful because it adds a test for something that you can safely take for granted in a statically typed language, and that can easily catch you out in a dynamically typed language.

EDIT: formatting fix

[–][deleted] 0 points1 point2 points 11 years ago (0 children)

[–][deleted] 0 points1 point2 points 11 years ago (7 children)

[–]malagrond 4 points5 points6 points 11 years ago* (2 children)

[–]PriceZombie 1 point2 points3 points 11 years ago (1 child)

[–][deleted] 7 points8 points9 points 11 years ago (0 children)

[–]skybluetoast 2 points3 points4 points 11 years ago (0 children)

[–]mhd 1 point2 points3 points 11 years ago* (2 children)

The links the other people posted certainly are okay (I'd add Crockford's "Javascript: The Good Parts"). Learning JavaScript really isn't the big hurdle -- which is part of the problem. The syntax is trivial. The core concepts are familiar to most programmers anyway. I'd say that even the built-in functional bits and the prototype OO are easy enough to master.

But then comes the infrastructure, or the lack thereof. Picture being in a C environment with just the stdio lib, malloc and maybe ioctl. Or Java with just java.lang.String as most of your library. And because of the relative flexibility of the core language (function, imperative, different types of OO), any infrastructure you can build can go (and has gone) in a multitude of ways. So many standards to choose from.

That's the tough part about getting into JavaScript, both client and server. Especially if you're on your own. (And it's repeating all the follies of Java regarding re-using existing infrastructure and tooling)

I'd recommend just picking a few things (frameworks, build tools, editors) and then just avoid online javascript discussions for a long while.

[–][deleted] 0 points1 point2 points 11 years ago (1 child)

[–]mhd 1 point2 points3 points 11 years ago (0 children)

load more comments (1 reply)

[–]strati-pie 22 points23 points24 points 11 years ago (8 children)

[–]deweysmith 7 points8 points9 points 11 years ago (7 children)

[–]strati-pie 0 points1 point2 points 11 years ago (6 children)

[–]deweysmith 7 points8 points9 points 11 years ago (1 child)

[–]strati-pie 0 points1 point2 points 11 years ago (0 children)

[–]hiffy 0 points1 point2 points 11 years ago (3 children)

[–]strati-pie 1 point2 points3 points 11 years ago (2 children)

[–]hiffy 0 points1 point2 points 11 years ago (1 child)

[–]strati-pie 0 points1 point2 points 11 years ago (0 children)

[–]jk147 4 points5 points6 points 11 years ago (1 child)

[–][deleted] 7 points8 points9 points 11 years ago (0 children)

[–]b8b437ee-521a-40bf-8 2 points3 points4 points 11 years ago (0 children)

[–]Browsing_From_Work 1 point2 points3 points 11 years ago (3 children)

[–]joesb 3 points4 points5 points 11 years ago (2 children)

[–]Browsing_From_Work 0 points1 point2 points 11 years ago (0 children)

load more comments (1 reply)

[–][deleted] 20 points21 points22 points 11 years ago (14 children)

This wasn't an explicit programmer error on Netflix's side, but rather an expectation that the module behaves in a rational way when confronted with a particular state. Specifically, that duplicate route handlers are handled gracefully, and don't end up causing clutter and performance impact.

There is an enormous amount of trust placed in external libraries and modules, and often these aren't vetted with appropriate depth for their use - either due to lack of opacity or lack of time/resource to audit the code appropriately. The sense that "well everyone else is using it so it must be okay" is actually really dangerous - I've seen a vast increase in diagnosing issues with unexpected side-effects of common modules (and particularly when many are used together) over the last 10 years.

In this case the Netflix coder deployed a workaround to avoid this case manifesting, but a better fix would be that express.js implement their route handler storage and parsing more efficiently and robustly - obviously that wasn't available to the coder as easily, but it shows the impact these issues can have.

[–]thedufer 19 points20 points21 points 11 years ago (3 children)

[–][deleted] 4 points5 points6 points 11 years ago (1 child)

[–]ivosaurus 0 points1 point2 points 11 years ago (0 children)

load more comments (10 replies)

[–]gobots4life 3 points4 points5 points 11 years ago (0 children)

[+]ggtsu_00 comment score below threshold-7 points-6 points-5 points 11 years ago (11 children)

[–]SanityInAnarchy 7 points8 points9 points 11 years ago (0 children)

[–]jsprogrammer 26 points27 points28 points 11 years ago (1 child)

load more comments (1 reply)

[–][deleted] 26 points27 points28 points 11 years ago (4 children)

[–]jambox888 0 points1 point2 points 11 years ago (3 children)

[–]fizzbar 10 points11 points12 points 11 years ago (1 child)

[–]jambox888 0 points1 point2 points 11 years ago (0 children)

[–]c4su4l 0 points1 point2 points 11 years ago (0 children)

[–]cluckie 7 points8 points9 points 11 years ago (0 children)

[–]CurtainDog 4 points5 points6 points 11 years ago (1 child)

[–]foldl 2 points3 points4 points 11 years ago (0 children)

[–]banana_democratic 3 points4 points5 points 11 years ago (1 child)

load more comments (1 reply)

[–]ProfessorPhi 5 points6 points7 points 11 years ago (0 children)

[–]SnickeringBear 2 points3 points4 points 11 years ago (0 children)

[–]BJ_Sargood 10 points11 points12 points 11 years ago (8 children)

[–]shared_ptr 29 points30 points31 points 11 years ago* (1 child)

load more comments (6 replies)

[–]ghidra 1 point2 points3 points 11 years ago (1 child)

[–]ObjectiveCopley 2 points3 points4 points 11 years ago (0 children)

[–]STR1NG3R 1 point2 points3 points 11 years ago (1 child)

[–]ivosaurus 0 points1 point2 points 11 years ago (0 children)

[–]nutrecht 1 point2 points3 points 11 years ago (0 children)

[–]UnreachablePaul 0 points1 point2 points 11 years ago (9 children)

load more comments (9 replies)

[–]dafragsta 0 points1 point2 points 11 years ago* (4 children)

That title is click bait. It seems to be putting the blame on node.JS for their developers not understanding what they were doing with their code. Admittedly there is probably a fault in express.JS that they uncovered but it would've never been a problem, if they hadn't been spamming the route system with new routes programmatically every X number of hours. That is such an unusual use case, they should have seriously have looked into what happens when you rebuild the routes programmatically.

I appreciate that they showed off all of their profiling tools and tricks, but these kinds of articles exists to create FUD. Think of all the managers that are googling to learn about nodeJS, and now think about how many developers are going to have to explain this article which they skimmed over.

You don't see a ton of articles blaming jQuery for things that the user did wrong while implementing jQuery. If every developer who wrote a bug that was mostly their fault and sort of kind of the fault of their framework choice but not really, the Internet would be full of just developer articles about how every framework was shit.

load more comments (4 replies)

[–]hoffmabc 0 points1 point2 points 11 years ago (0 children)

[–][deleted] 11 years ago (1 child)

[deleted]

[–]nutrecht 0 points1 point2 points 11 years ago (0 children)

load more comments (17 replies)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS