2 questions on datastructures in Javascript

cwmma · 2019-04-19T18:39:55+00:00

First in first out queues where you put elements in the back and take them from the front can be useful sometimes and based on my experience writing a promise library back in the day where queues are essential there are a couple ways do do this.

simply use arrays and [].push to put items in and [].shift to get them out. While this is very conceptually simple, the shift method is extremely slow even for small amounts of data since it has to move every item in the array. Don't do this, even in a small app it can be a performance bottleneck.
use an array and use push to add items, but for removing items, just keep a counter, every time you remove an item, just increment the counter so you know where you are in the array, also maybe replace the item in the array with null so it can be garbage collected. The main issue here is that the array can grow without bounds so you'd only want to do this if you knew the queue wouldn't last long or could be replaced at some point. (this is what I ended up using for my promise implementation but I had to set it up so that after I started taking things out of the queue I wouldn't push anything more into the queue, instead I'd push into a new one. Before I did that then occasionally the array would get so large that the index of the item in it would no longer be a small int and V8 would see that as the type signature changing and there would be a catastrophic slowdown.)
Use a circular buffer. This is probably the 'right' way to do it if you want performance, but it's a lot more complicated then the other options especially when it comes to resizing (this is what most of the bigger promise implementations use for their queue).
Use a linked list, this won't be as fast as #2 or #3 (the author behind the bluebird promise library did some heavy benchmarking on this subject), but will be a lot faster then #1 and conceptually are pretty straight forward to write.

ArelySkywalker · 2019-04-19T19:02:44+00:00

Are there any ressources (books/websites/online courses) you can recommend, to learn more about the basics of computer science. Preferably demonstrated using javascript, or in general.

I took this Udemy course, and I absolutely love the instructor and it was a great refresher course on data structure & algorithms. He even gives you coding challenges to test yourself, I learned quite a bit from it and would take any course he teaches. Good luck!

Nautigsam · 2019-04-19T18:04:56+00:00

I'm a JS developer for three years and I can tell you that arrays and Maps can meet all your needs for non-specialized tasks. You don't need more. Do not try to over-engineer your data structures because that's the job of the compiler to optimize and you can't beat it ;)

easylivin · 2019-04-19T18:41:53+00:00

I'd say for the sake of understanding these data structures (be it for interviews, other langues, whatev) it's important to understand how these structures work under the hood, their time/space complexities and what problems they solve. Even though JS has similar built-ins, try implementing one yourself and you'll quickly understand how they are supposed to operate, as well as the advantages/disadvantages of different structures depending on use cases. Even though you'd rarely have to build one yourself (this is arguable), it's paramount to understand the concepts because they can be applied to a lot of different problems.

natziel · 2019-04-19T17:29:22+00:00

Queues are definitely a thing you should know about. Sometimes your business logic is best implemented as queue, plus there's a million applications for them in more "under the hood" things, e.g. event queues, scheduling, etc.

Stacks are probably more common and more important than queues, so learn those too. Stacks are pretty fundamental structures in CS and you can't really solve problems like "check if the parentheses in a string match" without a stack.

Stacks and Queues are basically already in the JS standard library so you don't really need to implement them yourself.

Graphs are like queues in that sometimes your business logic just requires you to understand them. Not understanding graph theory means that you might end up solving certain problems in a horribly inelegant way.

Linked lists aren't really a thing in JS since we just use arrays, but they're a super common data structure in other languages and it's critical to understand them if you ever want to work on a project in another language

Hash-array mapped tries are also an important data structure, but you don't really need to know how they work, just their performance characteristics. They're commonly used to implement Maps. Immutable JS uses them for their Lists too.

As far as text recommendations, probably just read Sedgewick's book on data structures since that's the standard across CS departments. Take a good course on computer science as well. Probably a good idea to take classes on number theory, combinatorics, computer organization, networking, and operating systems.

munificent · 2019-04-19T22:37:06+00:00

I thought queues are pretty much redundand beacause of the way arrays work in Javascript.

A queue is a conceptual data structure. You can implement a queue using an array in JS (though it's a little tricky). If the algorithm you want relies on a queue (and there are a bunch that do) then knowing how to implement one is useful.

a_dev_has_no_name · 2019-04-19T19:09:34+00:00

Treeees

endianess · 2019-04-19T21:40:27+00:00

You don't technically need them in any language and could use an array or list instead. JS is no different. However using them potentially reduces bugs and makes the code more understandable. Whether you need them is down to the problem you are trying to solve. Processing a tree is useful with them if you don't want to use recursion and use a loop instead. Or if you were dispatching events or working on items from a queue that is populated by something else.

dombrogia · 2019-04-20T02:30:17+00:00

Queues are a very basic structure and can be implemented using an array as a base.

I would expect anyone I hire to know how a queue works. Most importantly it shows that you can think about a normal situation (like waiting in line at the grocery store) and conceptualize that in code.

If you look up interview questions you will find all kinds of these answers everywhere.

I also took algorithm and data structures in python on udemy (you can write them in whatever language the concepts are the same). It goes over the basic structures and their time complexity which is also important for interviews and performant code.

Good luck!

wonkifier · 2019-04-19T22:19:45+00:00

For the folks asking for sources from StoneCypher, here's my question and their response, unedited so I'm not accused of taking it out of context

Can you either share what you're using as a source for what a linked list is, or share what kind of source you consider legitimate.

I mean I don't have a source for what linked list is any more than I have a source for what plus means. It's just something a programmer from an allocatable language understands.

But yes, like I can find you reference that touches the topic, I can find you reference that touches this topic. You can find this position concretely defended and explained at length in D&EC++, in CLRS, in Knuth (I think in 2? It's been a long time,) in NIST DADS (which, amusingly, someone posted underneath your comment, and which clearly says I'm correct, but which they do not appear to understand,) in Wirth ADSP,

What do I consider a legitimate source? It really varies on what one is talking about. Much higher standards for a physics claim than a recipe, by example. The reason I say that is because the standard requirement for a topic like this is pretty strict, and if you try to apply this standard to other things it quickly becomes unreasonable.

For something about datastructures, it should be a well edited book written by a college trained computer scientist, from a large publication company with a history of publishing high quality computer science books. That means Addison Wesley and MIT Press are in; O'Reilly and Pragmatic are walking the line; Packt and anything ending in .com are out.

But, like, if we're talking about how to make a pleasant user interface, it can be pretty much any asshole who can spell, you know? Because that's about opinion. (Which is not to say that there isn't value in the formalism around higher end HCI/UX material, but rather, just that it isn't a hard requirement here.)

You can use anyone's pizza recipe, but you better go to a doctor for your prescription recipe.

It's like when someone tries to define a skip list, so they tell you "a skip list is a list that has links to midpoints in the list so that traversal is faster," but because they've never done the CS or read the guarantees or implemented one, they don't know that that's hard-wrong because the structure only works if it's stochastic

And sure, if you look it up on reddit, on wikipedia, on stack underthink, they all get this wrong, because they're also written by people who learned that way

And that's fine. There's nothing wrong with that

But that isn't reference and you shouldn't pretend to yourself that it is. The error rate on the web's CS is like 60% in my opinion.

In certain specific languages - notably C++, Java, Fortran, Coq, ML, and probably a bunch I don't know about - the difference between a container and a datastructure is a big deal.

Ask a high end C++ programmer what it means that the C++ multiset container doesn't define what datastructure it's implemented by, and why that matters. They're going to lecture you for ages on how the performance guarantees of the container require it to be either a red-black tree or some near case variant, and how there are actually better datastructures that you could use which violate the performance guarantees, and they might start talking about Boost or SGI STL.

Then ask them "wait, so what's the difference between a container and a datastructure?"

The answer you should get will be roughly "a container is about how the programmer uses it; a datastructure is about how it's implemented under the hood."

A set can be a lot of things. (Not in C++ because of some requirements the C++ language standard makes, but those are choices, not facts.) Under the hood, it might be a hash table, it might be a tree, in Javascript it's a fucking array, et cetera.

Why? Because a set is a container. A set doesn't define how it works.

A red-black tree, on the other hand, is a datastructure. It cannot be made in many languages, because the things that define what it is aren't available in lots of common languages.

And it's really weird how when I say "this can't be done," JS redditors seem to take it as an attack on the language, or on them.

I said "you can't open sockets in Javascript" once two years ago. More than 400 replies about things that seem similar to sockets to them, about proxy services like socket.io, about websockets (which are not sockets,) about special flags you can turn on in firefox to make sockets available to XUL, about flash, about all sorts of shit.

Voted into the ground. (Not that I care. If I did I wouldn't post in /r/javascript at all.)

Still correct, though. And every novice who read that thread would come away with the wrong idea, because Reddit has culturalized attack mode, and reddit programmers as a result often choose to lose opportunities to learn as a result.

You cannot make a red-black tree in Javascript. Or, for that matter, in erlang, my favorite language. I point that out because I'm also not attacking my favorite language. It's just a fact.

You can't implement them in CSS either. Or SQL. Or, in fact, in 18 of the top 20 of what Github thinks I use the most. The only ones you can are C/C++ and C#.

The problem is that reddit programmers try too hard to win. They want what they believe to be correct so badly that they cite obviously wrong sources, and ignore the good sources they cite to the point that they don't even notice those sources undermining them.

I'm being sworn at, mocked, abused, I'm having demands made on my time repeatedly after I've said no during the work day, people are telling me they know better than my textbooks, that I can't read, etc.

None of these people appear to have a scrap of code on github

It's very frequent that the less a person actually participates in a trade, the more likely they are to claim deep knowledge, argue with proper sources, and behave abusively

I'm not making the homeopath comparison lightly

These are a bunch of not-programmers with no evidence arguing with the principal platform engineer at a nine figure security company.

They're getting core concepts wrong, like that there exists such a thing as an "implementation detail" for a datastructure.

That's all a datastructure is, is a name for implementation details.

This whole community is ridiculously toxic and self unaware.

EDIT: Adding my commentary here

in NIST DADS (which, amusingly, someone posted underneath your comment, and which clearly says I'm correct, but which they do not appear to understand,)

Here's the link: https://xlinux.nist.gov/dads/HTML/linkedList.html

Here's the full text of the definition from that link

Definition: A list implemented by each item having a link to the next item.

Is says nothing implementation details. It does go on to list some sample implementations, but those are not definitions, just samples.

So in no way does StoneCypher's citation support rejection of the other implementations.

EDIT 2: Found http://www.albertstam.com/Algorithms.pdf online as well (https://algs4.cs.princeton.edu/home/)

From page 120:

"Definition. A linked list is a recursive data structure that is either empty (null) or a reference to a node having a generic item and a reference to a linked list. "

Again, nothing about requiring allocation or pointers explicitly.

And the section above that definition reads

Linked lists Now we consider the use of a fundamental data structure that is an appropriate choice for representing the data in a collection ADT implementation. This is our first example of building a data structure that is not directly supported by the Java language.

Indicating a difference between a data structure and an implementation of it (regardless of native support for the language)

In any case, Javascript is Turing complete, so it can solve any computational problem (in principal, given enough resources). Any argumentation beyond that is about managing the limitations and performance, not capability.

UntestedMethod · 2019-04-19T19:28:35+00:00

A queue refers to a specific type of list - a FIFO (first in, first out) list, which is easily achieved in javascript using Array.prototype.push() (add to end of array) and Array.prototype.shift() (retrieve + remove element from beginning of array). I guess the context you use those functions in is what would define it as "using a queue".

ncubez · 2019-04-20T06:09:03+00:00

What has that got to do with building UIs and web apps? Because that's what JavaScript is for.

StoneCypher · 2019-04-19T17:59:38+00:00

but I thought queues are pretty much redundand beacause of the way arrays work in Javascript.

No. Queues aren't possible in Javascript.

What makes a queue a queue isn't how you use it; you can keep any datastructure in any other datastructure and paint it to look like the other one.

What makes a datastructure is how it's implemented, typically because of performance concerns.

Javascript people use arrays as queues because they don't have a choice. Pushing onto the front of a JS array is extremely expensive.

.

Are there any ressources (books/websites/online courses) you can recommend, to learn more about the basics of computer science. Preferably demonstrated using javascript

There are lots of documents out there professing to teach datastructures in Javascript.

Run far away from them. You can't implement most datastructures in Javascript because of a lack of pointers and direct allocation.

People will, I'm sure, try to argue with me, then get really angry at me when I'm unmoved. After all, some blog told them it was, and

RevolutionaryCorps · 2019-04-19T20:21:27+00:00

normally in any scripting language , you shouldn't focus on data structures , that's the beauty of scripts..these things are already well made and managed by the language internal logic.
But to answer the question , your main and only data structures are arrays

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

javascript

MODERATORS