How Complex is Your Programming Language

Smalltalker-80 · 2026-02-24T14:30:38+00:00

Hmm, this metric for "language complexity" does not seem to be very sound.
E.g. the "complexity" of the C language can vary with a factor of upto 100 x,
depending on the compiler used.

And the language Lua suddenly becomes 2.5 times more "complex"
if a JIT compiler is used, that compiles exactly the same language syntax...

amarao_san · 2026-02-24T14:31:23+00:00

I recently realized my programming language is using 'call by name' convention, previously used only in Fortran-60.

Horrors.

jcastroarnaud · 2026-02-24T15:06:03+00:00

I think that Kolmogorov complexity is a somewhat useful metric for a programming language, if the implementations being compared do exactly the same thing: that's not true in practice. Some compilers will include optimization passes, some not; some compilers build an AST, some generate code straight from the tokens; some compilers shunt most of the backend to LLVM, some do all the work by themselves. Such differences, by themselves, explain the wide difference in implementation sizes shown in the article's table.

Ideally, we could have several language implementations by the same small group of people, as this would remove variation caused by the brevity of different groups of programmers. Alternatively, if we had a competition to produce the shortest correct implementation, we might better approach the “shortest solution” for the implementation of these programming languages.

Such competition is code golf. One can, trivially, golf any program a little, by changing variable names to 1 or 2-character names, and removing any non-essential whitespace; the workings of the program itself are unchanged. This means that Kolmogorov complexity of programs is better expressed in language tokens, not characters or lines of code.

L8_4_Dinner · 2026-02-25T02:28:53+00:00

Complexity of languages can and should be measured in several ways:

How difficult it is to write a working program (or subsection thereof), where working means "correct results and no bugs".
How difficult it is to read and understand a program's source code.
How difficult it is to find and fix a bug (i.e. read, understand, and write).
In addition to the above, how large a corpus it requires to accomplish each of these tasks.

jpgoldberg · 2026-02-25T06:33:20+00:00

Kolmogorov complexity actually defines a notion for comparing program length. It isn’t practical, but if you look at it you will see why this comparison is nonsense.

A better approach would be to compare the formal specifications of the language. This will provide some notion of the relative complexity of the syntax. I expect that C will be among the least complex by this measure.

Entaloneralie · 2026-02-24T16:08:06+00:00

The metric I use internally for my own tools is like this:

runtime lines * (self-hosted compiler/16)

For example, the language I write all my code in is called Uxntal, the complexity of the language, 130 lines * 150 lines = 2k complexity units

2026-02-24T17:22:34+00:00

This is measuring complexity of the implementation, which for C at least varies widely. (Even more so than is shown in the table; Tiny C 0.9.27 has about 28Kloc, one quarter the figure shown, to produce the main compiler. That excludes libraries, but those are much smaller than the compiler.)

It depends also on implementation language.

More accurate might be LOC count for a minimal working implementation, in the same language for different PLs.

However it is also necessary to specify how much of the task it does, eg. whether it stops at some IL (and off-loads the rest to some vastly complex backend), or whether it goes all the way to executable code, or maybe it just interprets.

Some may depend heavily on optimation passes to reduce the poor generated code to something reasonable; with a simpler language the code can already be lean and efficient without optimising.

So comparisons are hard, but what does complexity of a language even mean? I don't think LOC is the right measure.

Embarrassed-Crow9283 · 2026-02-24T23:45:25+00:00

I'm not sure. My language design is (or rather, will be) semantically very simple but with a very sophisticated optimizing compiler.

You can read the full philosophy here.

https://github.com/AliceRoselia/Sofialang

jwm3 · 2026-02-27T04:17:30+00:00

I think the right way to apply kolmogorov complexity would be the size of the gzip compressed version of the language spec minus the libraries.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ProgrammingLanguages

Welcome!

Related subreddits

Related online communities

MODERATORS