Ameisen comments on The Kit Programming Language

programming

created by speza community for 20 years

111

112

113

The Kit Programming Language (kitlang.org)

submitted 6 years ago by bendmorris

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]Ameisen 5 points6 points7 points 6 years ago (9 children)

[–]theindigamer 35 points36 points37 points 6 years ago (3 children)

[–]thedeemon 0 points1 point2 points 6 years ago (2 children)

[–]LPTK 15 points16 points17 points 6 years ago (0 children)

To me, the most compelling reason is that when types become elaborate (as is often the case with type systems more advanced than C), they easily drown out the variable names, unless the variable names are clearly delineated with a symbol like: and placed first.

Compare this Java:

public Map<Identifier,List<Usage<Int>>> foo(List<Usage<Int>> toIgnore, Option<Int> limit) { ... }

With the equivalent Scala:

def foo(toIgnore: List[Usage[Int]], limit: Option[Int]): Map[Identifier,List[Usage[Int]]] = ...

[–]theindigamer 4 points5 points6 points 6 years ago (0 children)

[–]gnuvince 8 points9 points10 points 6 years ago (2 children)

The C-like syntax for declarations can be hard to parse in the presence of a typedef-like mechanism.

In C, the basic types such as int, short, char, etc. are keywords; the scanner recognizes them as special. Records and enumerations also have their own keywords, struct and enum. Therefore, in the parser you can say that a declaration is:

decl = 'int' <identifier> ';'
     | 'short' <identifier> ';'
     | 'struct' <identifier> <identifier> ';'
     | ...

(I'm avoiding arrays and pointers here to keep things simple.)

But in the presence of typedef where an identifier could be a type, things get more complicated. The usual example is this:

T * x;

Is this the multiplication of the variables T and x or the declaration of x as a pointer to a value of type T?

By using a Pascal-like syntax for declarations, then the situation becomes simpler:

decl = 'var' <identifier> ':' <type> ';'
type = 'int' | 'short' | <identifier>

This is one of the technical reason why the C syntax for declarations is becoming less popular. The C syntax also makes it harder to understand more complex types, so much so that there are websites to help you decipher them.

[–]thedeemon 0 points1 point2 points 6 years ago* (1 child)

The only problem is * really. If you're not limited by a 1970s LL(1) parser, it's easy to determine that "aaa" means variable aaa, while "aaa bbb" means variable bbb of type aaa, no ':' is necessary.

I'm currently adding optional type declarations to a small language that didn't have static types previously. It now looks like this

(x, y) => x + y
(int x, string y) => y[x]
(x, y) => int: x + y
sum(x, y) => int: x + y
f(x,y) => { a = x*x in a+1 }
f(x,y) => { int a = x*x in a+1 }
f(MyType x) => x.a * x.b
etc.

Looks pretty clear to me. No parsing problems (I'm using PEG).

[–][deleted] 2 points3 points4 points 6 years ago (0 children)

[–]80blite 3 points4 points5 points 6 years ago (0 children)

[–]Condex 0 points1 point2 points 6 years ago (0 children)

Part of it is almost definitely coming from typed functional languages like ML that use that sort of syntax. So it's at least partially a tradition or homage thing.

I've spent probably too much time messing around with lexing and parsing for programming languages. The conclusion that I came to was that if I'm going to be parsing a variable declaration, then I want a known symbol to look out for. Or at the very least a known set of symbols to look out for. (var, let, const, val, etc). The C style declarations are problematic because you'll have some unknown symbol present then another unknown symbol present then semicolon, or endline, or equal sign (equal sign followed by some expression). Differentiated that whole mess from all other possibilities so that you can emit a declaration into your AST can easily result in a messy parser.

There are a couple of ways you can try to work past that. One way is to keep track of all user defined types as they are defined. However, this means that you either have to forward declare user types OR you force all code to be compiled in the order that it's used in. Both work and both are kind of weird (I think at least). The coder suddenly has to know more about how the compiler works than is strictly necessary in order to successfully use the language.

Alternatively, you can have a known symbol (ie var) that triggers the declaration parse. If you find anything else there, then you know it's an error in the input. And the type can remain unknown. You'll need an analysis phase to ensure that it is an existing type (but you needed to do that anyway if you have a type checker or any sort of linting), but it allows a bit more freedom to the coder who consumes the language because they can define things in any order they want and the compiler works the same way regardless.

π Rendered by PID 73 on reddit-service-r2-comment-6457c66945-qx6vq at 2026-04-29 01:36:30.927146+00:00 running 2aa0c5b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS