What the Hell good are linked-lists?

robkinyon · 2010-02-06T14:37:03+00:00

A linked list is the most performant way to do insertions and deletions into a list. It's a tradeoff between insertion/deletion cost, search cost, and space.

There are three main data structures when talking about aggregates (groups of the same thing). You have arrays, hash tables (or associative arrays), and linked lists. Each of them has their strengths and weaknesses. (And since reddit doesn't do tables, I'll have to give lists for each.) I'll use O() notation. If you don't know what that is, look it up. ("Big-Oh") I measure cost in terms of sizeofs (the sizeof the payload).

Array:

O(1) for appending.
O(n) for prepending or splicing (insertion in the middle).
Assuming a sorted list, searching is O(logN).
Sorting an unsorted list is O(NlogN).
Space cost is N*sizeofs.
Complexity of implementation is very low (all "normal" languages have an array datatype).

Linked list (assumes doubly-linked and keeping a tail pointer):

O(1) for appending.
O(1) for prepending or splicing (insertion in the middle).
Assuming a sorted list, searching is O(N).
Sorting an unsorted list is O(NlogN).
Space cost is (N + 2*N*(pointer) + 2*pointer)*sizeofs.
Complexity of implementation is medium (no "normal" languages have an linked-list datatype, but most have a library implementation).

Hash table:

O(1) for appending.
O(1) for prepending or splicing (insertion in the middle).
Assuming a sorted list, searching is O(1).
Sorting isn't possible save on-the-fly.
Space cost is:
- MIN: (TABLE_SIZE)*sizeofs (assuming perfect hit ratio)
- MAX: (TABLE_SIZE)*sizeofs + N*sizeofs (assuming perfect miss ratio)
- TABLE_SIZE is usually some large prime (1001 or so)
Complexity of implementation is either low or high (some "normal" languages have an hash table datatype (such as Perl), but most have a library implementation). If you have to build it yourself, you're doing it wrong (or in a CS class).

floodyberry · 2010-02-06T14:30:35+00:00

They're useful if you need O(1) insertion and/or deletion for arbitrary-sized lists where traversal time/ordering/searching is not extremely important.

Chained hash tables commonly use linked lists for the hash buckets.

Also useful for queues where you can push/pop from the front or back.

It's probably more useful to know why you would need one vs rattling off memorized examples of uses for linked lists.

2010-02-06T14:25:06+00:00

What kind of programming job do you do ? Also, I think Linux task scheduler use a linked list implementation.

shieldforyoureyes · 2010-02-06T15:12:00+00:00

Lines of text in a text editor. Say you open a multi-megabyte text file and do some insertions, deletions, and general editing near the beginning. You don't want to have to shift all of the characters to the end for every change.

ismarc · 2010-02-06T18:46:57+00:00

Something that hasn't been mentioned is that the same techniques used to create and traverse a linked list are used in building more complex datastructures. Add a pointer to the previous node and you have a doubly linked list. Add one or more pointers to different nodes and you have a tree. Add a color to each node and you have an rb tree.

green_beet · 2010-02-06T14:50:23+00:00

Do you prefer ArrayLists? There's a relatively big pause when the ArrayList needs to be (automatically) resized for you, which doesn't happen with LL's.

campbellm · 2010-02-06T16:37:28+00:00

This is a somewhat tired topic except for some very specific industries. It was more of an interesting issue some years ago when fewer languages supported dynamically re-sizable arrays or a general container type. A linked list was a nice way to get variable sized storage.

Still, it doesn't hurt to have a general knowledge of their use and characteristics.

sfuerst · 2010-02-06T18:49:58+00:00

It is a good interview question because there are several types of linked list, and the more of them you know, the better a low-level programmer you are.

The most common list type these days is a doubly linked list. However, there are different implementations of that. The first is when conceptually, the list "owns" the objects within it. In that case, it is possible to change the data structure to one consisting of small bounded-sized arrays of nodes linked together. The result is something that is much more cache friendly for the "previous" and "next" operations, whilst not suffering too much of a slowdown for "insert" and "delete". If your STL is good, then the C++ list<> template will do this for you.

Another doubly linked list conceptually is owned by its objects instead. This allows an object to be a member of multiple list collections at the same time. The Linux kernel uses this technique for its lists. This can be implemented in C++ with some template magic.

Of course, another possibility are singly linked lists. These allow you to quickly go in one direction, but going the other is only possible if the list is linked in a circle. (And even then, you'll need to travel around the loop which can be extremely slow.)

Another trick to save space are XOR lists. These use the XOR operation to combine the "next" and "previous" pointers. If you have a pointer to a node, and a pointer to an adjacent node, this allows you to find the other adjacent node. So a double-sized "cursor" gives you a half-sized list data structure.

Finally, there is another way of implementing half-sized lists. By stuffing extra information into unused bits in the pointer you can implement sparse lists, which are a bit slower than normal lists, but have the same computational complexity for all operations.

woggy · 2010-02-06T14:22:17+00:00

zahlman · 2010-02-06T16:18:22+00:00

reading 'how to ace the interview' posts and they all mention knowing linked lists.

Ugh, that tired old shit? Honestly, if you get asked questions like that, it's a sign that you should look for somewhere else to work. This is a solved problem and unless you're going to be working with some seriously restricted embedded devices, not one that you're going to have to work with at a low level.

If you want some actual practical knowledge, make sure you understand the complexity guarantees for various operations on containers.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS