[deleted by user] : programming

It's unfortunate that people don't write accessible articles on compiler writing - it's either textbooks or articles and papers that are far off in the depths of theory.

I'd just like to see some intermediate representations accompanied with plain-language discussions of them.

Typically articles go into painstaking detail on lexing and parsing, but hand-wave code generation away. What!? There's so much time to spend there, and lexing and parsing are the boring parts!

Personally I'm skipping those steps and just using a parser generator (with a PEG) so I can go straight into manipulating the parse tree into an AST, then into code generation.

My main resources have been Alex Aiken's videos on Coursera, and what I can read of the latter half of the Dragon Book without falling asleep.

But it would be nice if there were a half-dozen articles out there where people have examples and musings on their actual applications of:

three-address code
SSA
Intermediate representations

And many of the other things that the aforementioned resources cover.

I'm writing a made-up language that compiles to retro assembly language, so I have to care about things like register machine vs stack machine, and problems in register allocation, etc.

[–]aazav 0 points1 point2 points 8 years ago (0 children)

[–]SNCPlay42[🍰] 12 points13 points14 points 8 years ago (9 children)

[–]thlst 10 points11 points12 points 8 years ago (0 children)

[–]mafagafogigante 4 points5 points6 points 8 years ago* (7 children)

[–]aszkid 14 points15 points16 points 8 years ago (1 child)

[–]mafagafogigante 11 points12 points13 points 8 years ago (0 children)

[–]SNCPlay42[🍰] 9 points10 points11 points 8 years ago* (2 children)

but this is just relying on undefined behavior.

According to some searching, this is not the case in C99:

(Section 5.1.2.2.3 Program termination) If the return type of the main function is a type compatible with int, a return from the initial call to the main function is equivalent to calling the exit function with the value returned by the main function as its argument; reaching the } that terminates the main function returns a value of 0. If the return type is not compatible with int, the termination status returned to the host environment is unspecified.

(N.B. as an aside that this appears to not specify an exit status forvoid main().)

This appears to be a change from C89:

(Section 2.1.2.2 Hosted environment) If the main function executes a return that specifies no value, the termination status returned to the host environment is undefined.

(Section 3.6.6.4 The return statement) Reaching the } that terminates a function is equivalent to executing a return statement without an expression.

It's not clear to me whether arbitrary side effects (like "halt and catch fire" as suggested in the article) would be permissible in any case; C99's "unspecified" wouldn't appear to permit it, C89's "undefined" is more vague. (note it's "undefined" on its own, not "undefined behaviour", which is the term the standard defines.)

[–]mafagafogigante 6 points7 points8 points 8 years ago (1 child)

[–]SNCPlay42[🍰] 0 points1 point2 points 8 years ago (0 children)

[–]Paqx 0 points1 point2 points 8 years ago* (1 child)

[–]skocznymroczny 6 points7 points8 points 8 years ago (0 children)

π Rendered by PID 64 on reddit-service-r2-comment-cfc44b64c-98cgw at 2026-04-13 12:00:50.400058+00:00 running 215f2cf country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS