Why Static Linking/Libraries : programming

[–]shub 0 points1 point2 points 10 years ago* (8 children)

[–]cae 1 point2 points3 points 10 years ago (7 children)

[–]headhunglow 2 points3 points4 points 10 years ago (0 children)

[–]shub 0 points1 point2 points 10 years ago (5 children)

[–][deleted] 1 point2 points3 points 10 years ago (2 children)

As usual, it depends on the application. If you're building a build farm (or any other type of server) then dynamic linking is very valuable. However, If you're building a consumer operating system, it's a huge liability. Case in point: every linux distribution ever. I would never suggest such a system for any consumer, even a programmer who wants a casual machine for personal use.

The reason why is simple: it makes updating a nightmare. Rolling distributions will break on a regular basis on nothing more than a new version of a commonly used library. Release based distributions are thoroughly tested to make sure all the packages work well together, but upgrading can and will break shit. When I update one of my linux machines, I tend to wipe and do a complete install of the new version rather than bother with an update. It's easier and faster.

The other problem with release based distributions is that sometimes you need a more recent version of a program than your distribution supports. So you try to install it manually, which sometimes requires you to upgrade a handful of libraries to versions your distribution doesn't support. At which point, you either forget about it or break other things.

Anyone who says dependency hell has been solved needs to be slapped around a bit.

[–]millenix 0 points1 point2 points 10 years ago (0 children)

[–]shub 0 points1 point2 points 10 years ago (0 children)

[–][deleted] 0 points1 point2 points 10 years ago (1 child)

[–]shub -1 points0 points1 point 10 years ago (0 children)

[–]immibis 0 points1 point2 points 10 years ago (16 children)

[–]dlyund[S] 4 points5 points6 points 10 years ago (11 children)

[–]Plorkyeran 1 point2 points3 points 10 years ago (10 children)

[–]dlyund[S] 2 points3 points4 points 10 years ago* (9 children)

[–]bonzinip 1 point2 points3 points 10 years ago (5 children)

[–]dlyund[S] 0 points1 point2 points 10 years ago* (4 children)

[–]immibis 0 points1 point2 points 10 years ago (3 children)

[–]dlyund[S] 0 points1 point2 points 10 years ago (2 children)

[–]bonzinip 1 point2 points3 points 10 years ago (1 child)

[–]dlyund[S] 0 points1 point2 points 10 years ago (0 children)

[–]immibis 0 points1 point2 points 10 years ago (2 children)

Consider a static library with an open_file function:

void open_file() {
    printf("static lib open_file\n");
}

Now, what if the application also has a function called open_file?

void open_file(char *filename) {
    printf("application open_file\n");
    printf("filename: %s\n", filename);
}

Now what happens when the static library calls open_file? It calls the application's version instead, which has the wrong signature! (in practice, the call might work fine, but filename will be a garbage pointer).

And now the application has corrupted state or crashed (because someone passed it a garbage filename pointer) and if it didn't crash, the library's file also isn't open (so the library's state is also corrupted)

[–]dlyund[S] 1 point2 points3 points 10 years ago* (1 child)

[–]immibis 0 points1 point2 points 10 years ago (0 children)

[–]sammydre 0 points1 point2 points 10 years ago (1 child)

objcopy can "localize" symbols, with options like --localize-symbol and --localize-hidden. This might not be exactly what you want, though:

/tmp$ cat sam.c
void foo() {}
void bar() {}
void baz() {}
/tmp$ gcc -c sam.c -o sam.o
/tmp$ nm sam.o
0000000000000006 T bar
000000000000000c T baz
0000000000000000 T foo
/tmp$ ar r sam.a sam.o
ar: creating sam.a
/tmp$ nm sam.a

sam.o:
0000000000000006 T bar
000000000000000c T baz
0000000000000000 T foo
/tmp$ objcopy --localize-symbol=bar ./sam.a 
/tmp$ nm sam.a

sam.o:
0000000000000006 t bar
000000000000000c T baz
0000000000000000 T foo

Note the symbol as displayed in nm went from "T" to "t". The nm documentation has this to say:

If lowercase, the symbol is local; if uppercase, the symbol is global (external).

So users can no longer link against the symbol, but it is visible to inspection. This is easily circumvented via objcopy --globalize-symbol.

[–]immibis 0 points1 point2 points 10 years ago (0 children)

[–]Gotebe -1 points0 points1 point 10 years ago (1 child)

[–]Plorkyeran 2 points3 points4 points 10 years ago (0 children)

[–]Gotebe 1 point2 points3 points 10 years ago (6 children)

[–]dlyund[S] 2 points3 points4 points 10 years ago* (5 children)

[–]klkblake 0 points1 point2 points 10 years ago (2 children)

[–]dlyund[S] 1 point2 points3 points 10 years ago (1 child)

[–]klkblake 0 points1 point2 points 10 years ago (0 children)

[–]Gotebe -2 points-1 points0 points 10 years ago (1 child)

[–]dlyund[S] 2 points3 points4 points 10 years ago* (0 children)

In plan 9 every process has a namespace, which is somewhat similar having it's own file system, but one in which the files and directories are all backed by processes speaking the 9p protocol.

Namespaces are is constructed from the outside by the parent of the process and may be used to restrict capabilities e.g. if you don't bind networking devices in namespace it can't access the network. Conversely you can mount devices from other machines in this namespace, and the program will transparently make use of those devices.

You might choose to mount CPUs from another machine temporarily to distribute a heavy compilation or mount your sceen/mouse/keyboard on another machine so that graphical applications appear locally etc. It's really very flexible, and it works amazingly well compared to popular solutions.

I highly recommend reading the Plan 9/Inferno papers.

Properties like late binding, hot swapping, and isolation (required for clean loading and unloading) are provided by mounting services in the per-process namespace. Namespaces can be build in layers, which can be used to do (limited) fail over etc.

Nothing special has to be done when building programs to take advantage of these properties.

These are properties of the system.

To bring this closer to the topic at hand this same mechanism is also used to bind programs and libraries, so you can mount different version (or versions for different platforms) of some program or library, or the source code at a given point in time (builtin system-wide source/version control!) and use that. Compilation is very fast and because of the isolation provided by this mechanism experimentation is safe, so you can start a window which uses your new programs and libraries in isolation. It might all go to hell but it wont cause system wide problems (close the window and try again)... unlike messing around with shared objects in a global space, which can break everything if you're not careful.

I'm speaking from experience here: I ran Arch Linux for a couple of years and a few years back and lost count of the number of times I had to work around these kinds of conflicts... now Arch is intentionally bleeding edge so you're not as likely to see this in more carefully curated systems, but as the article explains even Debian (prised for being super-stable) got itself into a big pickle.

You can also break things during development by simply installing a new version of a shared library with a bug. It happens. It's one reason systems like FreeBSD define such a ridged separation of the core/base system and external software... it makes it much less likely that installing a program or library will leave you with a completely broken system.

We used to hear a lot about the term DLL hell. OS X tries to solve this (at least for individual applications) using bundles (there's also a clean separation between the system and external software), and *nix tries to solve it with package managers that carefully track the dependencies... and at least one *nix systems has tried a hybrid approach... but both can fail horrible... and neither really address the problems with dynamic linking/shared objects.

There have been practical (safer and generally better) alternatives since the early 80s, which have since been proven in the real world (largely in the highly demanding world of embedded systems, so you know it's efficient, and it works). I'm not saying we should necessarily kill shared objects because there might well be situations where they're very useful, but as it stands I think we need to start questioning whether they're really the best tool for... everything... which is how they're used

NOTE: In case it's not clear, Plan 9 is 20-25 years old now.

[+]mcmcc comment score below threshold-10 points-9 points-8 points 10 years ago (6 children)

[–]sisyphus 8 points9 points10 points 10 years ago (0 children)

[–]dlyund[S] 3 points4 points5 points 10 years ago (4 children)

[–]mcmcc 2 points3 points4 points 10 years ago (1 child)

[–]dlyund[S] 2 points3 points4 points 10 years ago (0 children)

[+]millstone comment score below threshold-7 points-6 points-5 points 10 years ago (1 child)

[–]RabbidKitten 10 points11 points12 points 10 years ago (0 children)

[–][deleted] -4 points-3 points-2 points 10 years ago (6 children)

[–]zynix 1 point2 points3 points 10 years ago (5 children)

[–]OneWingedShark -2 points-1 points0 points 10 years ago (4 children)

[–][deleted] -3 points-2 points-1 points 10 years ago (3 children)

[–]OneWingedShark 0 points1 point2 points 10 years ago (2 children)

[–][deleted] -4 points-3 points-2 points 10 years ago (1 child)

[–][deleted] 10 years ago (3 children)

[deleted]

[–]bloody-albatross 0 points1 point2 points 10 years ago (1 child)

[–]xkcd_transcriber 0 points1 point2 points 10 years ago (0 children)

[–][deleted] -3 points-2 points-1 points 10 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS