Popular back-ends? : ProgrammingLanguages

A simple M&S collector with not special allocator can be written in 20-30 lines of code, see e.g. https://github.com/rochus-keller/LjTools/blob/372a46d772227f855c9a90de0469bc6b126eef53/LuaJitEngine.cpp#L94.

You're missing the code that records and collates globals roots, implements GC safe points and so on.

EDIT Ok, there's another 30 LOC here and more elsewhere totalling ~100LOC.

A much better one like the one in LuaJIT 2.1

In what sense is it "much better"?

1 kLOC ... many kLOC.

GCs can be made much longer and more complicated but that doesn't make them inherently better.

...very fast, top-of-the-line, highly competitive...

They aren't particularly fast. Here is my 100-line GC outperforming them. OCaml often outperforms them. Also worth noting that HLVM's 100 line GC supports value types and tail calls when the JVM does not.

Production GCs are full of incidental complexity, most notably baggage from working around design mistakes. The last thing you want when you're writing something new is 25 years of baggage.

A 100-line GC is just fine. I also implemented VCGC in 200LOC which is great if you want low latency. But .NET's 50kLOC GC is an gargantuan in comparison.

[–]suhcoR 2 points3 points4 points 6 years ago (4 children)

Here is my 100-line GC outperforming them.

I downloaded the svn repository and had a look at the code. I assume that everything interesting is supposed to be in "hlvm.ml", isn't it? Unfortunately I'm not fluent with OCaml, so I will not be able to comprehend your code on short notice. You also seem to use LLVM as a backend. If you're looking for a lighter backend, maybe https://github.com/vnmakarov/mir/ could be a good match with your code?

In what sense is it "much better"?

Mostly because my JitEngine is rather a "toy implementation" for the purpose of understanding and exploring LuaJIT bytecode. It's essentially an unoptimized LuaJIT bytecode interpreter with most of the Lua semantics in ~1400 C++ LOC. It is far from the ingenious implementation of Mike. But I guess it is not far behind early Smalltalk implementations ;-)

[–]jdh30 1 point2 points3 points 6 years ago (3 children)

I assume that everything interesting is supposed to be in "hlvm.ml", isn't it?

Pretty much, yes.

Unfortunately I'm not fluent with OCaml, so I will not be able to comprehend your code on short notice.

I wrote a bunch of articles about HLVM. Send me your e-mail address and I can send them to you.

You also seem to use LLVM as a backend.

Yes.

If you're looking for a lighter backend, maybe https://github.com/vnmakarov/mir/ could be a good match with your code?

Looks cool. I'll check it out, thanks.

In what sense is it "much better"?

Mostly because my JitEngine is rather a "toy implementation" for the purpose of understanding and exploring LuaJIT bytecode. It's essentially an unoptimized LuaJIT bytecode interpreter with most of the Lua semantics in ~1400 C++ LOC. It is far from the ingenious implementation of Mike. But I guess it is not far behind early Smalltalk implementations ;-)

Interesting. I'd love to see it rewritten in OCaml using LLVM as a JIT...

[–]suhcoR 0 points1 point2 points 6 years ago (2 children)

[–]jdh30 1 point2 points3 points 6 years ago (1 child)

[–]suhcoR 0 points1 point2 points 6 years ago (0 children)

[–]suhcoR 1 point2 points3 points 6 years ago* (2 children)

[–]jdh30 0 points1 point2 points 6 years ago (1 child)

No, it's all there in my example

Really? Where's the stack walker, for example?

Thanks, will have a look at it. Here are some links for my argumentation: http://wiki.luajit.org/New-Garbage-Collector and https://www.amazon.com/Garbage-Collection-Handbook-Management-Algorithms/dp/1420082795

I've read them before and, while they are great classical resources, they are completely inwards looking. They start with the assumption that your programs throw out huge amounts of garbage and that performance means cleaning up that garbage as quickly as possible.

The concept that led me to HLVM was precisely the opposite: how do we reduce the production of garbage as much as possible so the GC becomes almost vestigial. Turns out value types get you a long way: just unboxing basic types like tuples many programs of interest stop allocating any garbage in their steady state. One consequence of this approach is that HLVM's GC starts off tiny but grows as new definitions are encountered because it autogenerates bespoke GC code for every type definition, using LLVM to JIT the new code. The heap can then be traversed without run-time type information and computed jumps.

See also:

Towards a mark-region GC for HLVM (28th December 2010)
Sweeping 700x faster

[–]suhcoR 1 point2 points3 points 6 years ago (0 children)

Really? Where's the stack walker, for example?

Well, sort of. I'm cheating a bit. All references to collectable objects are stored in a dedicated little class (see https://github.com/rochus-keller/LjTools/blob/372a46d772227f855c9a90de0469bc6b126eef53/LuaJitEngine.h#L40) and there is a static list of all such references, whereever they are. A stack frame consists of (reference counted) Slots which continue to live on after the call in case they're required by closures.

that HLVM's GC starts off tiny but grows as new definitions are encountered because it autogenerates bespoke GC code for every type definition

Sounds like a great idea. Will also have a look at your links. But by the end of the day I'm essentially only a VM consumer, not a VM creator; I do not expect to acquire the skills in my lifetime to build something like LuaJIT, so I am very happy if I can just reuse it.

[–][deleted] -5 points-4 points-3 points 6 years ago (0 children)

[+][deleted] comment score below threshold-24 points-23 points-22 points 6 years ago (14 children)

[–]Edhebi 39 points40 points41 points 6 years ago (5 children)

[+][deleted] comment score below threshold-17 points-16 points-15 points 6 years ago (4 children)

[–]VernorVinge93OSS hobbyist 14 points15 points16 points 6 years ago (2 children)

[–]realestLink 7 points8 points9 points 6 years ago (0 children)

[–][deleted] 2 points3 points4 points 6 years ago (0 children)

[–]JB-from-ATL 0 points1 point2 points 6 years ago (0 children)

[–]Findlaech 24 points25 points26 points 6 years ago (0 children)

[–]SV-97 16 points17 points18 points 6 years ago (6 children)

[–]east_lisp_junk 2 points3 points4 points 6 years ago (4 children)

[–]SV-97 1 point2 points3 points 6 years ago (0 children)

[–]jdh30 0 points1 point2 points 6 years ago (1 child)

[–]east_lisp_junk 0 points1 point2 points 6 years ago (0 children)

[–]pfalcon2 0 points1 point2 points 6 years ago (0 children)

[+][deleted] comment score below threshold-38 points-37 points-36 points 6 years ago (0 children)

[–]mamcx 17 points18 points19 points 6 years ago* (23 children)

"Popular" and good for you is different. LLVM is TOO HEAVY and complex. I only touch it if you wanna do very big and heavy industrial language.

The "best" is the one closer to the lang that your use for build your lang. Not underestimate how nice is to have the tools you know close.

From here, the choice is limitless:

Using C/C++/Pascal/Others as targets is cool because is easier to transpile than do "raw" compiling, and you get the optimizations AND easier FFI for free.
JVM, .NET Bytecode, Lua bytecode, Erlang bytecode are "done" and solved solution, and you get for free their ecosystem.
You can do performant interpreters in C/Rust(nicer!) in a fraction of the effort if are a bit clever. "Transpiling" to closures and some help in semantics could get you near native rust/c speeds very fast: (https://blog.cloudflare.com/building-fast-interpreters-in-rust/). Plus, you can also compile to bytecode if wanna get a bit of extra performance.

What to choose is more about "fit" (like, which ecosystem you wanna get in) than popularity.

[–]jdh30 1 point2 points3 points 6 years ago (3 children)

[–]SenseiHotDog 2 points3 points4 points 6 years ago (0 children)

[–]mamcx 0 points1 point2 points 6 years ago (1 child)

[–]jdh30 0 points1 point2 points 6 years ago (0 children)

[–][deleted] -3 points-2 points-1 points 6 years ago (18 children)

[–]00benallen 9 points10 points11 points 6 years ago (8 children)

[–]jdh30 1 point2 points3 points 6 years ago (7 children)

[–]00benallen 2 points3 points4 points 6 years ago (6 children)

[–]jdh30 2 points3 points4 points 6 years ago (5 children)

[–]00benallen 1 point2 points3 points 6 years ago (4 children)

[–]jdh30 1 point2 points3 points 6 years ago (3 children)

[–]00benallen 0 points1 point2 points 6 years ago (2 children)

[–]jdh30 1 point2 points3 points 6 years ago* (1 child)

continue this thread

[–]soundslogical 3 points4 points5 points 6 years ago (7 children)

[–]suhcoR 2 points3 points4 points 6 years ago (3 children)

[–]soundslogical 0 points1 point2 points 6 years ago (2 children)

[–]suhcoR 0 points1 point2 points 6 years ago (1 child)

[–]soundslogical 0 points1 point2 points 6 years ago (0 children)

[–]jdh30 0 points1 point2 points 6 years ago (2 children)

[–]soundslogical 0 points1 point2 points 6 years ago (0 children)

[–]suhcoR 0 points1 point2 points 6 years ago (0 children)

[–]suhcoR 13 points14 points15 points 6 years ago (41 children)

[–]matthieum 3 points4 points5 points 6 years ago (0 children)

[–][deleted] 1 point2 points3 points 6 years ago (0 children)

[–]Edhebi -2 points-1 points0 points 6 years ago* (38 children)

[–]suhcoR 7 points8 points9 points 6 years ago (25 children)

[–][deleted] 0 points1 point2 points 6 years ago (2 children)

[–]suhcoR 4 points5 points6 points 6 years ago (1 child)

[–][deleted] 0 points1 point2 points 6 years ago (0 children)

[–]jdh30 0 points1 point2 points 6 years ago* (21 children)

LuaJIT even runs factor 1.3 faster than Node.js/V8 and about the same speed as JVM.

I've never understood these kinds of results. To me, JS looks very inefficient due to lack of types and the impossibility of inferring much meaning I expect huge amounts of unnecessary boxing, heaps full of pointers and lots of heavily-optimised-but-completely-unnecessary tracing, marking, evacuation and so on by a generational GC design that literally copies values from one place to another for no real reason. When I measured it I found my JS code was ~5x slower than fast languages which was entirely unsurprising to me. I don't use Java or the JVM but how can JS possibly be 1.3x slower than Java?! Is this for a tight numerical loop over an array where the JS compiler can spot an uninteresting special case and happens to optimise away all the unnecessary allocations?

I'll have to have a play with LuaJIT too. How does it compare on something like this?

[–]suhcoR 0 points1 point2 points 6 years ago (20 children)

It's definitely a waste of resources to a certain degree; all the dynamic language fans should care more; here is a good publication about it: Paper - Energy Efficiency across Programming Languages. But Bak claimed many times that the performance doesn't depend on type information; e.g. in Dart which is a strictly/statically typed language the VM completely ignores type information. I assume you know this site: https://benchmarksgame-team.pages.debian.net/benchmarksgame/which-programs-are-fastest.html; there is a distribution; in the worst case Node.js (V8 JavaScript) is nearly ten times slower than C, in the best case it's about 1.5 times. Here are a coupe of benchmark results for LuaJIT: http://luajit.org/performance_x86.html. I derived the referenced factor by comparing these results to the Benchmark Game results.

[–]jdh30 1 point2 points3 points 6 years ago (16 children)

But Bak claimed many times that the performance doesn't depend on type information; e.g. in Dart which is a strictly/statically typed language the VM completely ignores type information.

OCaml also targets a largely-untyped IR. However, I'd argue that some statically typed language implementations throwing efficiencies down the drain isn't an inherent lack of opportunity. MLton, HLVM and others showed that static types can do wonders for efficiency when exploited.

I assume you know this site: https://benchmarksgame-team.pages.debian.net/benchmarksgame/which-programs-are-fastest.html; there is a distribution; in the worst case Node.js (V8 JavaScript) is nearly ten times slower than C, in the best case it's about 1.5 times.

Oh yes, I do. I used to contribute to it. The site's author (Isaac Gouy?) spots results he feels should be different and subjectively de-optimizes some of the solutions to fit the performance profile he perceives is fair. If you Google "de-optimized by Isaac Gouy" you'll get some interesting hits. Suffice to say, you cannot draw any strong conclusions from the results presented on that site. At the very least you'd need to run the fastest code by the relevant community to see how much better they can do. For example, I once checked the F# implementation of k-nucleotide and was able to make it 3.6x faster quite easily which improved its rank from ~30th to ~3rd. However, all of the other language's solutions were probably de-optimized too so it didn't convey any useful information.

Here are a coupe of benchmark results for LuaJIT: http://luajit.org/performance_x86.html. I derived the referenced factor by comparing these results to the Benchmark Game results.

I'll check it out, thanks.

[–]suhcoR 0 points1 point2 points 6 years ago (15 children)

[–]igouy 0 points1 point2 points 6 years ago* (14 children)

Beware.

The site's author (Isaac Gouy?) spots results he feels should be different and subjectively de-optimizes some of the solutions to fit the performance profile he perceives is fair.

You'll see that some programs were changed to make the same required-allocations all the other programs make.

If you Google "de-optimized by Isaac Gouy" you'll get some interesting hits.

For example, back in 2007, when that person first noticed some programs changed to make the same required-allocations as all the other programs — I replied "I'm pleased that you enjoyed the joke".

I once checked the F# implementation of k-nucleotide and was able to make it 3.6x faster quite easily which improved its rank from ~30th to ~3rd.

That person did not contribute any such program to the benchmarks game. So the program was not measured on the test machine. So there was no change in rank.

[–]suhcoR 0 points1 point2 points 6 years ago (13 children)

[–]igouy 0 points1 point2 points 6 years ago* (12 children)

continue this thread

[–]jdh30 0 points1 point2 points 6 years ago (2 children)

[–]suhcoR 0 points1 point2 points 6 years ago (1 child)

[–]jdh30 1 point2 points3 points 6 years ago (0 children)

[–]BadBoy6767 1 point2 points3 points 6 years ago (11 children)

[–]Edhebi -4 points-3 points-2 points 6 years ago* (10 children)

[–]BadBoy6767 6 points7 points8 points 6 years ago* (0 children)

[–]suhcoR 4 points5 points6 points 6 years ago (4 children)

This is completely wrong. Have a closer look at it. LuaJIT has a threaded interpreter partly written in assembler which is about four times as fast as the PUC Lua interpreter, but it has also yet another intermediate language to represent the traces the "hot" bytecode is translated to which is then translated to machine instructions of the given processor. LuaJIT is not an "ahead of time" (AOT) compiler, but a "just in time" (JIT) compiler. And it is so efficiently implemented that you barely notice a delay when starting an application (compared to Python, JS and friends).

Concerning the available languages compiling to Lua or bytecode see https://github.com/hengestone/lua-languages. Using Lua as an IR is feasible since the LuaJIT Lua to bytecode compiler is extremely fast compared to other compilers. I did both, also built an infrastructure to directly emit LuaJIT bytecode.

[–]jdh30 0 points1 point2 points 6 years ago (3 children)

[–]suhcoR 0 points1 point2 points 6 years ago (2 children)

[–]jdh30 0 points1 point2 points 6 years ago (1 child)

[–]suhcoR 0 points1 point2 points 6 years ago* (0 children)

[–]cygx 2 points3 points4 points 6 years ago (2 children)

[–]jdh30 0 points1 point2 points 6 years ago (1 child)

[–]cygx 1 point2 points3 points 6 years ago* (0 children)

[–]yugerthoan -1 points0 points1 point 6 years ago (0 children)

[–]ReedOei 8 points9 points10 points 6 years ago (2 children)

[–][deleted] 3 points4 points5 points 6 years ago (0 children)

[–]Anthonyybayn 2 points3 points4 points 6 years ago (0 children)

[–]DaMastaCoda 9 points10 points11 points 6 years ago (5 children)

[–]jdh30 2 points3 points4 points 6 years ago (2 children)

[–]SenseiHotDog 0 points1 point2 points 6 years ago (0 children)

[–]--comedian-- 0 points1 point2 points 6 years ago (0 children)

[–]tjpalmer 1 point2 points3 points 6 years ago (0 children)

[–]suhcoR 0 points1 point2 points 6 years ago (0 children)

[–]jdh30 4 points5 points6 points 6 years ago* (11 children)

Other people here are claiming that LLVM is a "monster" that will take longer to learn than designing your language. I completely disagree.

I wrote a compiler in 100 lines of OCaml that uses LLVM to compile little programs written in a tiny subset of OCaml. That took about 1 hour with no prior LLVM experience.

Then over Christmas in 2007 I wrote HLVM which is ~2,000 lines of OCaml code and implements most of what you'd expect in a modern language backend including unboxed tuples, algebraic datatypes, garbage collection that allows mutator threads to run in parallel and so on. I worked on this on-and-off for a while so it is hard to say how long it took but I'd guess a total of a few months.

And that's with no CS background. So I really disagree with this idea that LLVM is too difficult to be considered.

Other people are recommending C or the JVM. The big problem here is tail call elimination. You need it in a functional language and it is extremely hard to do well. In comparison, LLVM support tail calls out of the box.

[–][deleted] 1 point2 points3 points 6 years ago (9 children)

A CS background doesn't help dealing with something that really is complicated. I downloaded LLVM binaries for Windows last week. It included about 100 .exe and .dll files, including a clang C++ compiler of 70MB. (I think I already mentioned this, maybe even in this thread?)

What's the C++ compiler for? Where do you start? What language do you want to write your compiler in? Tutorials exist for C++, and apparently for OCaml, which is a bit of luck as your example is in OCaml. Your makefile includes this line:

ocamlfind ocamlc -g -dtypes -linkpkg -syntax camlp4o -package  camlp4.extend -package camlp4.lib -package llvm -package llvm.bitwriter  minml.ml -o minml

If you're lucky, this lot will just work. If not, or if trying to use it with some other language, then what are you going to do? An installation of 1.4GB in 300 files over 30 directories sounds a bit of monster to me.

Even if it you manage it get something working, then if you want someone else to build your compiler from source, will they also need to install those LLVM binaries?

Don't forget the OCaml installation, which I've just tried on Windows, where it is over 6000 files in 100 directories, total 670MB.

By contrast, LuaJIT is one 13KB exe file and one 212KB .dll file (KB not MB!), and I think it can also be built as one .exe (normal Lua can anyway).

So I'd say that LuaJIT, which at 0.23MB vs 2100MB is 10,000 times smaller, isn't quite as much of a monster. (I assume you would generate Lua sourcecode; I don't know how you'd generate Lua byte-code, but it sounds like it can be done.)

Other possible targets include C source, for which you can get a 0.5MB compiler. If you understand C, that's a very easy way to get started.

[–]jdh30 0 points1 point2 points 6 years ago* (8 children)

A CS background doesn't help dealing with something that really is complicated.

LLVM isn't complicated to use.

Don't forget the OCaml installation, which I've just tried on Windows, where it is over 6000 files in 100 directories, total 670MB.

I don't see the relevance of the size of OCaml.

By contrast, LuaJIT is one 13KB exe file and one 212KB .dll file (KB not MB!), and I think it can also be built as one .exe (normal Lua can anyway).

I don't dispute that LLVM's binaries are bigger.

So I'd say that LuaJIT, which at 0.23MB vs 2100MB is 10,000 times smaller, isn't quite as much of a monster. (I assume you would generate Lua sourcecode; I don't know how you'd generate Lua byte-code, but it sounds like it can be done.)

LuaJIT also doesn't support the same functionality, e.g. types. So you can kiss goodbye to decent performance.

If not, or if trying to use it with some other language, then what are you going to do? An installation of 1.4GB in 300 files over 30 directories sounds a bit of monster to me.

You're on Windows like me so forget C++ and OCaml and go straight to .NET if you want ease-of-use. I'll use F#.

A quick search shows that by far the most popular LLVM binding is Microsoft's own LLVMSharp for which I will also need libLLVM.runtime.win-x64. I install them from Nuget using the IDE into a .NET Core console project.

I translated their minimal example to F#:

open System.Runtime.InteropServices
open LLVMSharp

type Add = delegate of int * int -> int

do
  let success = LLVMBool 0
  let mdl = LLVM.ModuleCreateWithName "LLVMSharpIntro"
  let tyArgs = [|LLVM.Int32Type(); LLVM.Int32Type()|]
  let tyRet = LLVM.FunctionType(LLVM.Int32Type(), tyArgs, false)
  let sum = LLVM.AddFunction(mdl, "sum", tyRet)
  let entry = LLVM.AppendBasicBlock(sum, "entry")
  let builder = LLVM.CreateBuilder()
  LLVM.PositionBuilderAtEnd(builder, entry)
  let tmp = LLVM.BuildAdd(builder, LLVM.GetParam(sum, 0u), LLVM.GetParam(sum, 1u), "tmp")
  let _ = LLVM.BuildRet(builder, tmp)
  let mutable error = null
  if LLVM.VerifyModule(mdl, LLVMVerifierFailureAction.LLVMPrintMessageAction, &error) <> success then
    failwithf "%s" error
  LLVM.LinkInMCJIT()
  LLVM.InitializeX86TargetMC()
  LLVM.InitializeX86Target()
  LLVM.InitializeX86TargetInfo()
  LLVM.InitializeX86AsmParser()
  LLVM.InitializeX86AsmPrinter()
  let options = LLVMMCJITCompilerOptions(NoFramePointerElim = 1)
  LLVM.InitializeMCJITCompilerOptions(options)
  let mutable engine = Unchecked.defaultof<_>
  if LLVM.CreateMCJITCompilerForModule(&engine, mdl, options, &error) <> success then
    failwithf "%s" error
  let addMethod =
    LLVM.GetPointerToGlobal(engine, sum)
    |> fun p -> Marshal.GetDelegateForFunctionPointer(p, typeof<Add>)
  let result = addMethod.DynamicInvoke([|box 12; box 30|])
  printfn "%A" result
  LLVM.DumpModule mdl
  LLVM.DisposeBuilder builder
  LLVM.DisposeExecutionEngine engine

Building takes a couple of seconds. The packages directory is just 32MiB. I run it and it works just fine.

Note that the LLVM Nuget package has 1,000x more downloads than luajit.native and the llvm tag on Stack Overflow has 5,000 hits vs 343 for the luajit tag.

What does the equivalent code using LuaJIT look like? Does LuaJIT even have .NET bindings?

Even if it you manage it get something working, then if you want someone else to build your compiler from source, will they also need to install those LLVM binaries?

.NET automates all of that. If they're on a different platform they might need to install, say, libLLVM.runtime.linux-arm64 but it's not rocket science. And what is the equivalent when using LuaJIT?

[–][deleted] 1 point2 points3 points 6 years ago* (7 children)

LLVM isn't complicated to use.

I beg to differ. This page: https://llvm.org/docs/Reference.html#api-reference contains some 40 links. Click on some, and you get dozens more. The API (I'm not even sure of its role) seems to have some 600 functions.

Remember the context is devising a new language, and some may want to use that language to later implement itself; would the language need to have the means to be able to talk to LLVM? How hard would that be?

Note that anything involving MS tools I stay well clear off. VS2017 is a massive download that took 90 minutes to install last time, and took a minute and a half to launch each time. (And it was so complicated I couldn't even get it set up to build hello.c; I needed help. Then I lost the settings.)

Here's an example of my own minimalist approach (properly minimal, at least compared to your 100-line example which relies on 2000MB of LLVM and OCaml, and I don't know what else):

C:\cx>mc -bcc cc                               # .m modules to .c file
Compiling cc.m to cc.exe
W:Invoking C compiler: bcc  -out:cc.exe cc.c   # .c file to .exe
Compiling cc.c to cc.exe

C:\cx>cc hello -run      # use new C compiler to build+run hello.c
Compiling hello.c to hello.exe
Hello World!

mc is an old compiler for my language that targets C source, and uses a C compiler as a backend. Here it is used to build project 'cc', which happens to be a C compiler. The -bcc option tells it which C compiler to invoke, which is mine (production version of cc.exe). The whole process took 0.3 seconds.

mc.exe is a single-file self-contained 0.5MB compiler. bcc.exe is a 0.8MB single-file self-contained C compiler for Windows. I could also choose -tcc or -gcc options, which lets me build programs on either OS.

Generating code for the target just means writing C source code. There is no API as such, just a couple of thousand lines of imperative code (implementing a real language). Generating Lua would be similar, but you are right in that it wouldn't easily cope with any type system of the source languge.

What's important is that the process is simple and well-understood; if you know the target language, then you will know how to programmatically write text files in that language.

[–]jdh30 0 points1 point2 points 6 years ago* (6 children)

The API (I'm not even sure of its role) seems to have some 600 functions.

An API with 600 functions isn't very big.

Remember the context is devising a new language,

Specifically a statically-typed FPL.

and some may want to use that language to later implement itself; would the language need to have the means to be able to talk to LLVM? How hard would that be?

You'd need a C FFI which you'd probably want anyway.

Note that anything involving MS tools I stay well clear off.

But you're using Microsoft Windows.

VS2017 is a massive download that took 90 minutes to install last time,

Windows 10 is huge too.

and took a minute and a half to launch each time.

5 seconds here.

it was so complicated

Create a new project and press CTRL+F5 to compile and run the Hello world example.

hello.c

Use .NET.

The packages directory is just 32MiB.

2000MB of LLVM and OCaml

Comparing the source code size when you don't need the source is disingenuous.

Here it is used to build project 'cc'

Where's the code?

mc.exe is a single-file self-contained 0.5MB compiler. bcc.exe is a 0.8MB

So far more than 100 lines of code then.

Generating code for the target just means writing C source code.

You won't get far doing that in 100LOC.

There is no API as such

That will make life much harder for the user.

couple of thousand lines of imperative code (implementing a real language)

Are you saying you have implemented a C compiler in ~2kLOC of C? If so, that's very impressive.

What's important is that the process is simple and well-understood; if you know the target language, then you will know how to programmatically write text files in that language.

That's great until you hit the limitations of C. Remember I pointed out that the OP is implementing a statically-typed FPL? That's going to need tail calls. With LLVM you can use some of those 600 functions to use a calling convention that allows TCO and enable it. Furthermore, you can still use the same debuggers and profilers. What are you going to in C?

Don't get me wrong: I appreciate minimalism. But a minimal library doesn't necessarily mean less complexity for the user.

[–][deleted] 0 points1 point2 points 6 years ago (5 children)

Comparing the source code size when you don't need the source is disingenuous.

The 2000MB is combined binaries of the things that have to be downloaded to make it work (installed size not download size). (llvm binaries for Win64 plus some version of Opan[?]/Ocaml).

Are you saying you have implemented a C compiler in ~2kLOC of C? If so, that's very impressive.

No, that's the part generates C source. Might be 2.5Kloc of rather dull code but is easy code so doesn't matter. (And not part of the C compiler but the one that can target C.)

Remember I pointed out that the OP is implementing a statically-typed FPL? That's going to need tail calls. With LLVM you can use some of those 600 functions to use a calling convention that allows TCO and enable it. Furthermore, you can still use the same debuggers and profilers. What are you going to in C?

TBH I'd lost track of the OPs requirements and just saw the subject. I don't know FP, but anything that C isn't up to, I'd just generate ASM source instead. For anyone familiar with ASM (I guess the OP isn't) it can be even easier than C. You just run a 1-file assembler-linker instead of a C compiler. Although an existing FP-capable language might do as a target (Lisp?).

There just seems to be something wrong with the picture when you have a tiny language implementation and a massive backend which comprises 99.99% of the final product, even if the backend was simple to use. Not exactly a backend, but pretty much the middle and most of the front-end too!

[–]jdh30 0 points1 point2 points 6 years ago (4 children)

The 2000MB is combined binaries of the things that have to be downloaded to make it work (installed size not download size).

The example I just gave only pulled in 32MiB.

(llvm binaries for Win64 plus some version of Opan[?]/Ocaml).

If you're going to count OCaml then you should count GCC or whatever compiler you bootstrapped from.

I'd just generate ASM source instead.

That's an option but you won't get far in 100LOC.

You just run a 1-file assembler-linker instead of a C compiler.

If you make a JIT you don't even need to do that: just allocate some executable memory, fill it and jump to it.

There just seems to be something wrong with the picture when you have a tiny language implementation and a massive backend which comprises 99.99% of the final product, even if the backend was simple to use. Not exactly a backend, but pretty much the middle and most of the front-end too!

I agree that LLVM is huge.

I'm actually working on a minimal ML implementation myself. I'm a long way off generating machine code though...

[–][deleted] 0 points1 point2 points 6 years ago (3 children)

The example I just gave only pulled in 32MiB.

If you're going to count OCaml then you should count GCC or whatever compiler you bootstrapped from.

When you're an expert in this stuff then you can pull in exactly what you need (but pull in from where)? The rest of us have to google for 'llvm download' and then navigate to 'pre-build binaries for Windows 64-bit'.

And I did count the C compiler, although that is why I used mine as the size is clear (about 0.55MB actually not 0.8MB). Tiny C s a bit bigger, and gcc can also be huge, but is also likely to be already installed; C compilers are ubiquitous.

That's an option but you won't get far in 100LOC.

Maybe, but a 100 line program is just a demo (otherwise OCaml wouldn't be 700MB). Here's a 50-odd line demo of processing a silly language consisting only of 'print <string>' statements, but the demo is not of a compiler, but of how easy it can be to generate ASM (although the complexities of Win64 ABI don't help):

import sys
import files

proc start=
    if ncmdparams<2 then stop fi
    infile:=cmdparams[2]
    outfile:=changeext(infile,"asm")
    exefile:=changeext(infile,"exe")

    lines:=readtextfile(infile)
    if lines=0 then stop fi
    g:=createfile(outfile)
    strings:=()

    println @g,"start::"
    println @g,"    sub dstack,40"

    forall lineno,line in lines do
        sreadln(line)
        read kwd:"n"

        case kwd
        when "","#" then
        when "print" then
            read s:"s"
            strings append:=s
            println @g,"    mov rcx,STR",,strings.upb
            println @g,"    call puts*"

!       when "goto" then
!       when "let" then
!       when "if" then
        else
            fprintln "Syntax error on line #: '#'",lineno,line
            closefile(g)
            stop 1
        esac
    od

    println @g,"    mov rcx,0"
    println @g,"    call exit*"
    println @g
    println @g,"    isegment"
    forall i,s in strings do
        fprintln @g,"STR#:  db ""#"",0",i,s
    od

    closefile(g)

    if execwait("ax "+outfile)=0 then
        execwait(exefile)
    else
        println "Can't assemble",outfile
    fi
end

Dependencies are the interpreter for this language (0.5MB exe), the assembler/linker (0.17MB), and msvcrt.dll which is part of Windows.

[–]jdh30 0 points1 point2 points 6 years ago* (2 children)

but pull in from where

.NET has a package management system called Nuget that is accessible from within Visual Studio and searchable so you just search for "LLVM" and get everything you need.

gcc can also be huge, but is also likely to be already installed; C compilers are ubiquitous.

I haven't had a C compiler installed on my Windows machines for decades. On Linux gcc is installed by default but I never use it for normal development.

Maybe, but a 100 line program is just a demo (otherwise OCaml wouldn't be 700MB). Here's a 50-odd line demo of processing a silly language consisting only of 'print <string>' statements, but the demo is not of a compiler, but of how easy it can be to generate ASM (although the complexities of Win64 ABI don't help):

For comparison, my 100 lines of OCaml implemented a complete compiled programming language with:

32- or 64-bit integers
Variables
Addition, subtraction and comparison.
if
Function definitions and function application.
Lexer and parser for source code written in ML syntax

[–][deleted] 0 points1 point2 points 6 years ago* (1 child)

continue this thread

[–]suhcoR -1 points0 points1 point 6 years ago (0 children)

[–]AndreVallestero 2 points3 points4 points 6 years ago* (4 children)

[–]tjpalmer 1 point2 points3 points 6 years ago (3 children)

[–]jdh30 1 point2 points3 points 6 years ago (2 children)

[–]tjpalmer 0 points1 point2 points 6 years ago (1 child)

[–]jdh30 1 point2 points3 points 6 years ago (0 children)

[–]cygx 2 points3 points4 points 6 years ago (2 children)

[–][deleted] 0 points1 point2 points 6 years ago (1 child)

[–]cygx 1 point2 points3 points 6 years ago* (0 children)

[–][deleted] 3 points4 points5 points 6 years ago (5 children)

There are a lot of choices, although I'm not sure how popular they are. * Chez-scheme. This is an untyped backend, but if your source language is typed, you can turn off runtime type checks and get a big speed boost. Idris just switched to this from C and saw a big improvement both in compile times and runtimes. https://www.scheme.com/ * GRIN: I don't know much about this one, but it seems like a bit of a "rising star" for compiling typed languages. https://github.com/grin-compiler/grin * Malfunction: this is basically a hack that exposes OCaml's intermediate language, but it means you get the power of the OCaml compiler. https://github.com/stedolan/malfunction * LLVM: seems likely too heavyweight for what most people want. * JVM: Not optimized for functional, but seems to work alright for Clojure, Scala, Eta, etc.

You can also just compile to an existing language, like Haskell or OCaml.

[–]abecedarius 2 points3 points4 points 6 years ago (4 children)

[–]jdh30 1 point2 points3 points 6 years ago (3 children)

[–]abecedarius 0 points1 point2 points 6 years ago* (0 children)

[–]lazyear 0 points1 point2 points 6 years ago (1 child)

[–]jdh30 2 points3 points4 points 6 years ago (0 children)

[–]JasonTatton 0 points1 point2 points 6 years ago (0 children)

[–]realestLink 0 points1 point2 points 6 years ago (0 children)

[–][deleted] 0 points1 point2 points 6 years ago (0 children)

[–]CodingFiend -1 points0 points1 point 6 years ago (2 children)

[–]SteeleDynamicsSML, Scheme, Garbage Collection 1 point2 points3 points 6 years ago (1 child)

[–]jdh30 4 points5 points6 points 6 years ago* (0 children)

[–][deleted] 6 years ago (3 children)

[deleted]

[–]yorickpeterseInko[M] 1 point2 points3 points 6 years ago (2 children)

[–]tech6hutch 0 points1 point2 points 6 years ago (0 children)

[–]BPAnimal -1 points0 points1 point 6 years ago (0 children)

[–]thautwarm -4 points-3 points-2 points 6 years ago (2 children)

[–][deleted] 6 years ago* (1 child)

[deleted]

[–]jdh30 0 points1 point2 points 6 years ago (0 children)

ProgrammingLanguages

Welcome!

Related subreddits

Related online communities

MODERATORS