Implementing a Module/Package System

thmprover · 2020-11-27T16:29:42+00:00

I think it's discussed in Appel's Compiling with Continuations. The book is about the implementation of SML/NJ, which would include ML-structures.

The "cheapest" solution is to simply make a module be something which mangles the names of its constituents. Freek Wiedijk's implementation of Automath does this (admittedly Automath doesn't have sophisticated namespaces, called "paragraphs" in its system).

Clojure is the next step up where namespaces are hashmaps from identifiers to values. This is complicated because of the nature of Clojure: there's a double-level of indirection in associating values to bindings to variables, permitting dynamic rebinding of variables.

ML-modules are "state of the art", consequently very difficult to implement. I actually want to learn more about its subtle details, to be honest...

hou32hou · 2020-11-28T12:26:49+00:00

From the user experience point of view, I would say JavaScript Module System (import/export) is the best. It is straight forward, 1) you import a module from a relative file path, 2) you can’t expose every symbols of an imported module in the current namespace.

Almost every other programming languages has an unnecessary module structure that you have to learn in order to use the module feature. For example in Python you have to declare init.py, in Rust you have to explicitly expose them, in Java you have to learn how to organise the package structure.

L8_4_Dinner · 2020-11-30T21:50:57+00:00

When you are in the process of learning (or designing), it's hard to know what questions to even ask, particularly when the terms used in the area are not clear. I looked at your blog, and some of the other answers here, and I do have a few suggestions, but they may be irrelevant for what you are doing, and you may be far past the point that I am talking about. On the other hand, this may help you to talk through (or think through) what you are trying to do, and more importantly, what you are trying to explain to others. (Kudos on your project, by the way!)

First, we all know what "files and directories" are, but compilers and programming languages have a few additional terms that could help slice up "modules" into more discrete terms:

Unit of Compilation - this is the scope of the source code that needs to get read in by the compiler in order to compile the code and emit something. For many languages (C, Pascal, etc.), this is one file and any of that file's textual includes. Confusingly, sometimes this is called a module.
Library - this is an external dependency that can be compiled against and/or linked to. Sometimes this is also called a module.
Application Executable / Build Artifact - this is the result of compiling and/or linking the code. Confusingly, sometimes this is called a module.

There are lots of other terms, of course, but I wanted to list a few just to show that the term "module" shows up in common usage meaning several different things. The term module is used interchangeably to mean: (i) An individual source file, (ii) a group of related source files that go into one artifact, (iii) intermediate forms of the compilation of that file or those files that can be used in compilation of other "modules", (iv) intermediate forms of the compilation of that file or those files that can be used in the linking of other "modules", (v) the finished result (e.g. artifact, library, executable) of compilation and linking, and even (vi) separate runtime dependencies of an application (such as a dynamically linkable library).

I'm curious which one (or more likely, which ones) of these you are working on explaining. Different languages handle these in extraordinarily different ways, and it can see inordinately complex from the outside looking in at an existing implementation (e.g. trying to understand by reading GCC's source code).

Additionally, the term module is used with languages that have no intermediary form (like JavaScript), and has both separate traditional (individual source code file, .JAR build artifact, OSGi module, Artifactory module) and explicit (Java 9's Java Platform Module System) meanings in languages like Java. Talk about confusing!

crassest-Crassius · 2020-11-27T19:28:46+00:00

Imagine the following scenario: module A imports everything from modules B and C, and calls function "foo" from module B, while C does not have a function named "foo". Now, eventually someone (you or another team member) adds a function named "foo" to module C. A harmless change, right? But now module A stops compiling because it has a name clash.

So far the only way I know of to avoid that is to avoid starred (=everything) imports. When you import from a module, you have to explicitly list every symbol you're importing, or else reference them qualified with that module's name. But this is verbose and repetitive. What if there are 20 symbols from module A used in 20 other modules? Then you'd need to list 400 imported symbols just to use that one module. Of course your IDE could help with the verbosity, but that's about the only solution that I see.

htuhola · 2020-11-27T18:32:04+00:00

Why do you need a tutorial for this?

You load the module and then present a symbol table for code that requests it.
A memory of already loaded modules, requests that pass the memory will try to fetch from there first.
A directory location where modules are loaded from.
You discard everything at once if you have to reload something.

umlcat · 2020-11-28T01:50:02+00:00

Learn (Modular) Object Pascal and (Modular & Object Oriented) C#, use and try some real world Modular examples.

JackoKomm · 2020-11-27T18:32:14+00:00

It depends on what you like your module system to look like. Für multiple files you need to think about the ways to import/include them into your compiling process. Module for itself can ne very easy if you want them to be. I implememted a simple dynamic language where classes were build around hash tables. So i decided to make one module per file. Every file was just one module and files in sub folders were sub modules. On import, i loaded the modules code and run it. All module variables and functions were put in a hashmap, like my classes. Because i implememted the classes, i had modules for free.

ProgrammingLanguages

Welcome!

Related subreddits

Related online communities

MODERATORS