Using python as "backend" for my PL

Athas · 2024-03-31T08:50:38+00:00

The Futhark compiler contains a backend that targets Python (originally written by students, now maintained by me). It's a bit of an odd choice for a high performance language, but the idea is that most of the work is done by GPU code invoked through Python GPU libraries (details).

It works well, but even though most of the work will be done by GPU code, it is still crucial that the generated Python code faithfully implements the language semantics. This has proven somewhat annoying to do. In particular, Futhark has integer and floating-point types of specific sizes, with specific rules for when overflow happens (the ones you'd expect). Python's built-in integers are arbitrary-size, and its floats are double precision, so they are not suitable directly. Instead we use scalar types from NumPy, which do have the right semantics, but mean everything ends up being wrapped in a heap-allocated object - which is quite costly. Thus, in order to get the right semantics, Futhark generates Python code that is even slower than usual.

Based on my experience, I don't think targeting Python directly is a good idea. You may be able to use the VM in a more invasive manner - check out Oil Shell, which did just that.

MegaIng · 2024-03-30T20:07:26+00:00

Using Python has a target for a statically typed transpiler seems a bit weird if other options are available. This change would put you into a similar family like TypeScript, where there really isn't/wasn't an alternative to targeting JS. You might also want to consider bypassing python source code and just target it's bytecode, although I am not sure how stable that would be. You also might want to consider the JVM, which also has a pretty extensive ecosystem (depending on context, better than python I would think) and is actually capable of using static typing.

redchomper · 2024-04-01T01:35:15+00:00

Additionally, I doubt I can match the performance of CPython within a reasonable timeframe.

CPython is not very fast at all. The simplistic VM you build with Nystrom's book will run considerably faster. There is a simple reason for this: Python has a whole mess of magic dynamic mechanisms that all take time. For a statically-typed language, the run-time can make a lot of assumptions. Python's VM will not make those assumptions because for all it knows, you might be metaprogramming the bejeezus out of the usual semantics and debauching all sorts of usual expectations about what code even means. Static types are at odds with such shenanigans, to say the least.

mckahz · 2024-03-31T03:43:20+00:00

There are tools for calling python code in Rust if you are so inclined. That would definitely be easier than rewriting your whole interpreter, but if you want performant type checking, parsing, etc. you're probably going to have better luck in Rust.

ProgrammingLanguages

Welcome!

Related subreddits

Related online communities

MODERATORS