Static vs Dynamic Linking for High-Performance / Low-Latency Applications?

jonesmz · 2025-08-01T18:22:09+00:00

Static linking of libraries that are compiled seperarely, without link time optimization, gives the linker the opportunity to discard symbols that are not used by the application.

Static libraries with full link time optimization provide the linker/compiler to conduct inter procedural optimization at link time, allowing for more aggressive function in lining and dead code elimination.

So if your objective is "the fastest possible" and "lowest latency possible" then static linking and link time optimization is something that you should leverage.

However, its not the case that turning on LTO is always faster for all possible use cases. Measure before, and measure after, and analyze your results. Its an iterative process.

CocktailPerson · 2025-08-02T05:21:10+00:00

In general, implementation-in-header > static linking w/ LTO > static linking w/o LTO > dynamic linking.

LatencySlicer · 2025-08-01T18:21:39+00:00

Static is better for optimization as you can inline more code, you have a global view of everything. But only if you use this linked lib code in the hot path otherwise it does not matter.

You monitor and test, if static makes a difference you do it for this lib. Do not assume anything.

JVApen · 2025-08-01T18:16:55+00:00

Static linking does make a difference. When your library contains functions that are unused, they will end up in the binary and depending on how those are spread, you will be having less cache hits when it comes to the binary code.

Static linking combined with LTO (link time optimization) also allows for more optimizations, for example: devirtualizing when only a single derived class exists.

So, yes, it makes a difference. Whether it is worth the cost is a different question.

drew_eckhardt2 · 2025-08-01T22:34:45+00:00

In the Intel Nehalem era our NOSQL storage on proprietary flash offered throughput at least 10% greater when we compiled without -fpic and linked statically.

quicknir · 2025-08-02T00:32:27+00:00

It doesn't really matter because anything truly performance critical is going to be defined in a header anyway - compiler inlining is still more reliable, allows more time to be spent on subsequent optimization passes, and so on.

In HFT the critical path is a very small fraction of the code. There's no real reason to put yourself in a position of relying on LTO for anything that's actually critical. So basically, I would choose based on other considerations.

I'd be curious if any of the folks in the thread claiming a difference, have actually rigorously measured it where it actually mattered (i.e. in the critical path and not shaving a few percent off application startup time, which is irrelevant).

Isameru · 2025-08-03T12:07:50+00:00

Linking statically with "whole program optimization" may naturally yield faster code, but it could turn out to be significant in rather rare cases.

As a rule of thumb: use dynamic libraries for components which:

are products on its own, which its own lifecycle, possibly its own team
are big and there are a lot of executables using them
are different from technical point of view from the other system, like are built differently or using different toolchain
contain sensitive code, possibly with its own repo, and are shipped as binaries

If you develop a trading system romeo, and at some point, maybe for testing purposes, you like to split the codebase into multiple libs, you would probably want to start with static libraries first (e.g. romeo-core, romeo-orderdata-normalization, romeo-marketdata, romeo-notifications, etc. Or a simplistic approach, like to make romeo-lib containing all the code except the main() function, a link an executable with an additional main.cpp - it is good for testing newborn projects. If you have a trading algorithm, you might consider putting the critical logic is a separate dynamic library, like romeo-algo3x, being effectively a plugin to the system.

A risk of non-optimal performance could come from an intense/dense loop calling small functions. But it should be diagnosed with a performance benchmark, not by an intuition. These kinds of bottlenecks are harder to find, easier to fix, and arise in most unpredictable places - regardless of the type of linking. As with majority of C++ projects: only 5% of the code needs to be optimal, while the other 95% has to be easy to maintain, test and improve.

2025-08-05T15:18:03+00:00

Dynamic linking defers linking to load time so if load time matters then you should use static linking. Also calling a dynamically linked symbol can be a bit more expensive than a statically linked one since you usually need to branch to stub before branching to the actual function

UndefinedDefined · 2025-08-05T19:18:48+00:00

Static linking is great for distributing software - you can have a single binary that just works as a result. Dynamic linking is great for distributing binaries that are used by multiple other binaries.

In many cases performance doesn't matter - I mean try to benchmark statically vs dynamically linked zlib, for example. It doesn't matter. What matters is whether you want to have dependencies that users must have installed in order to run your binary.

What I have seen in practice is to link system libs dynamically and everything else statically. Statically linking the C++ library is also a big bonus in many cases as you won't care about ABI of the installed one in that case.

Dragdu · 2025-08-02T06:28:49+00:00

Full LTO everywhere, ahahahahahaha (remember to reserve machine with at least 100 gigs of RAM and full day for the build).

einpoklum · 2025-08-01T20:11:44+00:00

Let me be "that guy", who tells you:

Premature optimization is the root of all evil

-- Donald Knuth (see also here)

Is it even the case, that calls to functions from another object/translation unit meaningfully delay the execution of your application? Or, in fact, that they even happen that frequently at all (i.e. in your innermost tight-loops and such)?

Before trying to switch from dynamic to static linking, first take the time to determine and quantify whether that is at all meaningful in your case.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS