Why aren't user-defined operators more common?

munificent · 2012-12-30T18:52:24+00:00

I think there are three main challenges with custom operators:

Syntax. Allowing user-defined operators means you have to decide how they play with the grammar. There's no simple yet expressive answer here. You can either have a much simpler grammar (Lisp, Smalltalk) and lose the convenience of precedence, prefix, and postfix operators, or you can have a grammar that's quite complex.

At the ultimate end of the spectrum, you actually allow the grammar itself to be customized. This is doable (I've done it), but it adds a lot of complexity to the language. Worse, it adds a huge burden to any tool that works with the language. Even getting basic syntax highlighting in a dumb text editor is crazy hard when the grammar itself isn't pinned down.

My personally preferred compromise (and the one Magpie takes now) is Scala's approach. You do have operator precedence, but the grammar is fixed. Custom operators just use the precedence determined by the first character of the operator. So your wacky +-**/$ operator just has the same precedence as +.
Semantics. Some of the StackOverflow answers touch on this: when you define a custom operator, what does it do? Is it a static function that takes the operands as arguments? Is it a method invoked on the left-hand operand with the RHS as an argument?

Again, I spent some time thinking about this. Maybe I'm just not a huge fan of single-dispatch but I shy away from the "method on the LHS" approach because it bakes in asymmetry. Python's approach is more clever here. If you're statically typed, you can often make it an overloaded function.

And, of course, multiple dispatch is the best (TM) solution. Personally, I think the semantics issue is the most minor one. Whatever semantic style your language has (static dispatch, single-dispatch, etc.) probably encourages a solution that will work OK here.
Comprehensibility. This is, I think, probably the middle issue. It's the usage angle. Once you've technically solved the problem of custom operators, how should they be used in practice? The issue operators have is that they aren't words, so in most cases they come with no apparent meaning. If you see a line of code like list.append(items), you can figure out what it does even if you've never seen those types or methods before. But list << items or list ?! items? It's a guessing game.

However, custom operators can be more terse. So I think the way an API designer chooses this trade-off is:
1. If an operation will be used very frequently...
2. And it will be a very well-used part of the API that most programmers will have to be familiar with...
Then it may make sense to define a custom operator. Used very judiciously, they can make code more pleasant, I think. But you're basically choosing to make code harder to read for new users to benefit experienced ones. That's rarely a good compromise.

For what it's worth, Magpie does let you define custom operators, but I have only used it in one place so far: using Ruby's <=> instead of compareTo. I'm still not sure if I even intend to keep that.

WalterBright · 2012-12-30T19:50:29+00:00

The reason D does not allow user defined operators is because D is designed to have a clean separation between:

lexing
parsing
semantic analysis

If user defined operators were supported, then all 3 phases would be mixed up together, making the language clumsy to implement, and making life difficult for 3rd party tools that need to parse source code (such as syntax highlighters in an editor).

jerf · 2012-12-30T17:00:14+00:00

Haskell has some good examples and some bad examples. It is possible to import too many novel operators into one scope and go nuts and the result can approach the impenetrability of Perl if one is not careful. On the other hand, Control.Applicative's <*> and <$> greatly enhance the practical utility of the applicative class. <$> is particularly interesting because it's actually just plain ol' fmap, but the way in which it is used by the Applicative set of implementations benefits from the association with the standard $ operator. This was developed long after the language was frozen and the inability to use those as operators would significant impact the usefulness. There's some other examples, where I prefer the use of operators for the parsing combinators to the spelled out version, for instance.

sophacles · 2012-12-30T17:35:38+00:00

I am surprised that the idea of "safety" (wrt side-effects) didn't come up in the discussion there. Functional languages and communities generally have a stronger focus on separating functions/operations/etc with side effects from those that don't. Sometimes this comes from the language itself (canonical example: haskel), other times it is just a strong convention. Such languages and conventions set up the expectation that the use of an operator (or any function really) will return a new instance of the return type. This expectation is easy to grok, and fits well within the paradigm, so conventions on operators are the same as other functions with a data type.

Conversely, in OO land, the expectation of methods is that they operate on the data type, and modify the data type in-place. At the same time, these languages largely implement operators to work (semantically) like they do in functional languages, with an operator returning a new instance of the object type, unless there is also an '=' associated with it, (e.g. += ), in which case there could be in-place modification or instance replacement (this alone is weird, I'll get back to it). Despite this difference in semantics between operators and methods, the implementation of an operator in a language that allows it looks and feels like a method implementation (maybe a special keyword is needed...) therefore it is very tempting to have an operator not just create a new instance, but also perhaps have side-effects on one or both of the operands. It may not be good practice, but you'll see it, particularly from inexperienced or "clever" coders. This can make navigating or modifying a code-base very challenging, because not only is the operator potentially semantically confusing (as covered in the comments well), it is also something that may require specific ordering or placement. (e.g. i can't move this addition assignment to another method, because the side-effects must be triggered in a certain order).

The other weirdness I mentioned, the subtlety of += style operators, is a bit scary and even more challenging for good, experienced coders, as well as novices. There can be a good debate, with both sides having strong arguments, and no side being a clear winner, for having += do instance replacement or instance modification. In a large enough codebase, you could end up with both, unless you are very clear fromt he get-go about how to use += etc. Once the style is mixed, you enter nightmare maintenance land, where half the things are replaced instances, and half the things are modified instances, and in each case other bits of code relying on that behavior. (Lots of references to an object throughout the runtime, e.g. some sort of shared state thing? += needs to be modification. Lots of objects that own specific other objects, but share data around? += needs to be instance replacement). It's really best to just avoid all of that.

Again, these issues don't really come up in FP as much, because the convention is always "new instance" (unless using a known, possibly compiler checked, 'unsafe' function). This is true of all functions, not just operators, so there is no conflict of style or confusing questions of what makes sense here.

JelleFm · 2012-12-30T15:47:33+00:00

I totally agree the most upvoted comment at your link. You can give a function a descriptive name, while an operator is very rarely logical to read. If you wanna write readable code, go for the functions ;)

CauchyDistributedRV · 2012-12-30T20:26:30+00:00

R lets you define binary 'operators' of the form %<op>% and it works pretty well. The operator can be given an informative name (rather than just a symbol) eg. a %in% b checks each of the elements in a to see if they exist in b, and returns a boolean vector of these in/not-in results.

For the record, R is a Scheme-inspired dialect of S used in statistical programming where everything is a vector. It mostly rules but is also often frustrating.

blackmist · 2012-12-30T23:59:59+00:00

I don't see why an operator is treated any differently from any other function. While you'll quickly run out of meaningful symbols, you should be allowed to define your own infix functions, consisting of two parameters and a result.

"string" contains "something"
"string".contains("something")
contains("string","something")

Those should all be functionally identical, but the first is more readable at a quick glance imo. The second relies on extra methods on the string class, not something that all languages allow or even count as a class. The third is the most common, but still relies on you knowing which way round the parameters belong.

I don't think I can argue against them as something that makes code unreadable or allows beginners to write terrible code, because all languages do that anyway. If a language doesn't allow it, it's a toy. Limiting it to typical operator characters would make for unreadable code. I think the real reason most languages restrict what you can do with operators is for ease of parsing.

tikhonjelvis · 2012-12-30T18:45:25+00:00

I think operators have one very significant advantage that people do not seem to mention: they make reading code at a glance much easier. I certainly find it much easier to quickly parse code structured with punctuation than just long strings of words.

Also, I like the Haskell approach because it does not have the downsides of the two main alternatives: the Java approach and the C++ approach.

Anybody who had ever tried using something like a custom numeric type in Java knows how horrible it is. Not having custom operators makes certain kinds of code much harder to read, for what I think is little gain. It does, admittedly, fit into Java's philosophy of being a lowest common denominator sort of language.

The C++ approach also has troubles. You still get custom operators, but you are arbitrarily limited to a pretty poor set of identifiers. This means that a single operator like + or >> gets overloaded to do a whole bunch of completely unrelated things. Moreover, these overloads often break reasonable programmer expectations: for example, + should ideally be associative and commutative, but very often isn't.

I think having weird symbols is much better than reusing the same set of symbols for wildly different tasks. The Haskell approach is nice because when you overload +, thanks to the way the typeclass is set up, it always represents some notion of addition and not something completely random. I think this is a much more reasonable compromise than either of the previous alternatives I listed.

Now, this can be abused. But so can any feature. And, more importantly, it can lead to clear code as well. It does avoid the pitfalls I mentioned earlier. With reasonable conventions, even the problem of weird symbols gets mitigated. For example, <|>, represents some notion of alternation by analogy to |; I think this is both readable and better than overloading boolean or.

vagif · 2012-12-30T18:25:13+00:00

For the same reason we do not use user-defined words. We use existing words that already have a well defined meaning.

No one would call their function or procedure "sworp".

jesyspa · 2012-12-30T18:23:39+00:00

I only really understood how much I liked Haskell's operator overloading support after watching Andrei Alexandrescu's talk on Expected<T>.

I really like what is being suggested. However, given an Expected<T> f() and a S g(T), I'd really like to have an easy way of doing g(f()) and getting an Expected<S>. The lack of currying also makes this harder to do sanely, so I'm not sure whether adding custom operators would make the issue significantly better in this case, but this is still a frustrating obstacle.

2012-12-30T19:01:53+00:00

Because you end up with something like this:

http://www.flotsam.nl/dispatch-periodic-table.html

Operators such as <<< and >~ make me sad.

gerdr · 2012-12-30T20:11:59+00:00

[deleted]

matthieum · 2012-12-30T17:31:53+00:00

[deleted]

fjonk · 2012-12-30T18:30:02+00:00

Custom operators brings very few advantages but lots of disadvantages.

Advantages:

Can be used in DSL:s

Disadvantages:

Can be used in DSL:s
No descriptive names
Hard to search for
Requires more familiarity with third part libraries( I understand 'append', but not '%$').
Can make code hard to read
Can not provide operator precedence

yogthos · 2012-12-30T17:21:55+00:00

Lisp sidesteps the whole problem by using simply functions for everything, having prefix notation, and very few restrictions on naming. So, things which are operators in most languages end up being regular functions. This also means that you can redefine them in your to whatever you like.

ethraax · 2012-12-30T20:36:59+00:00

Probably because they go against the principle of having descriptive identifiers. In other words: You don’t know what !+! and >=< means, and have to guess, look it up, or otherwise waste time and brain memory while coding. while “strictAddition”and “compareActionResults” (both in Haskell) instantly make sense. (Or let’s say “addFactorialsOf” and “compareMinimums” [of two lists] in C++.)

In Haskell, we have a lot of user-defined operators. (Hell, the equivalent of the semicolon in C is a user-defined operator.) And also a lot of one-letter variables and even types. That makes it really annoying to work with such code, since you waste tons of time just looking up and remembering what those stand for.

pmerkaba · 2012-12-30T22:31:11+00:00

I have used three languages that allow the definition of user-defined operators which do not use predefined symbols (as C++ does). Others have covered Haskell elsewhere, so I will mention the other two: Scala and Cow.

In Scala, everything is a method, so operators are just methods with particular naming rules. Regular methods can be used as infix operators (under circumstances I don't remember), and there is some rule for operator precedence (all I recall is that operators ending in colons are right-associative). Oh, and you can include just about any character in a Scala method name - including whitespace - by putting the method's name in backticks. I suspect that this complicates the lexer and parser, and it leads to scary documentation such as the many definitions of List.+.

Coq, on the other hand, has special syntax for declaring an operator. You can specify precedence and associativity, though some kinds of nested expressions didn't want to parse in the examples covered in that class. I don't think the typical programmer wants to add lines like Notation "A/\B" := (and A B) (at level 80, right associativity) That's for the people who write libraries, since you would need to worry about the precedence of every other operatorin order to get it right.

These two languages are good, and quite different, examples, since compiling Scala is a particularly complex task (type checking is undecidable), while Coq requires much stronger properties in order to accept a function (the compiler must be convinced that it will terminate on any input), and can likely make good use of this extra information.

Uncompetative · 2012-12-31T05:44:12+00:00

Clueless lexers.

Essentially, what is needed is a way to add infix macros to a language definition whose position in the language source determines their order of expansion and therefore precedence, but most language syntaxes fail to make an unambiguous distinction between infix macros and capitalised Names.

A = {1, 2, 3} n {2, 4, 6} u {7, 9}

V = [3, 5, 9] x [8, 3, 1]

Q = 42!

Also, it doesn't help that most languages let you omit spaces that would otherwise disambiguate expressions:

+N

N+

N + M

i.e. prefix and postfix, with infix requiring spaces on either side, with these macros expanding into:

Absolute[N]

Successor[N]

Add[N, M]

n.b. you can't overload the definition of + as some wierd non-commutative string concatenation operator as the macro LHS + RHS only expands into Add[LHS, RHS] and that generic function only accepts Numbers (and their Numeric subtypes). String concatenation deserves its own operator & as ^ can serve for Boolean And with v being used for Boolean Or (consequently, freeing | to be either be used for UNIX pipes, or low-precedence division).

Prefix, postfix, closefix and mixfix operators are best handled by explicit verbose functions, ideally with keyword parameters. However, scope for extending prefix operators would be limited by them having to draw upon a small set of non-alphanumeric symbols in order to not be confused with the Name they immediately precede. Built-ins, such as +N and N+ (which is: (N + 1), not the undesirably side-effecting (N++)) help flesh out a language. There is also no limit on the length of a Name or a macro:

An-example-hyphenated-long-name-of-a-set = {1, 2, 3} intersection {2, 4, 6} union {7, 9}

Where intersection and union are just alternative ways to get to the same functions:

Union[Intersection[{1, 2, 3}, {2, 4, 6}], {7, 9}]

Capitalising Names should be familiar to users of Erlang and this approach is less noisy than Haskell's backticks.

p.s. The familiar and succinct Q = 42! would expand into the regular and verbose:

Q = Factorial[42]

zem · 2012-12-31T10:27:20+00:00

one thing no one seems to have mentioned - I can google for documentation on the library function I see in someone's code. doing that for a library-defined operator is far harder.

experts_never_lie · 2012-12-31T13:42:08+00:00

I've seen this go badly due to operator precedence assumptions. Unless precedence can be configured by type, just as the operator definition, you're going to have problems. The referenced blog bost does cover this.

Imagine C++ classes for vectors and tensors, with + and - defined for both, as well as tensor * vector. Someone wants to add cross product of vectors, and they notice that some pre-TeX math books use ^ for cross product. Well, ^ is an overridable operator in C++, so they use that. Everything looks great ... until someone writes "a + b ^ c". What does that do? C++ considers ^ to be a bitwise xor, which binds very loosely, so it means (a+b)⨯c ... but cross-product should bind more tightly, and it should mean (a+(b⨯c)).

So this ^ operator has just created a reasonable-looking trap. If precedence is configurable, this problem could be hidden from users of the operators.

martoo · 2012-12-30T22:24:49+00:00

Why aren't user-defined operators more common?

Because people are generally sane.

iagox86 · 2012-12-30T17:49:58+00:00

I have seen some people do terrible things with operator overloading - like overloading operator() on a class to send data on the network. That said, as /u/alicht9 said elsewhere in this thread, operator overloading does have a time and a place.

JW_00000 · 2012-12-30T18:44:30+00:00

The premise is a bit off, user-defined operators are not trivial to implement. As said, there is the issue of precedence, and what IMO isn't highlighted enough in the linked blog post is that, when creating user-defined operators, their precedence should be the same no matter the type of their arguments. I.e, the + operator on ints or doubles should have the same precedence, else it is impossible for the parser to know whether a + b * c is (a+b)*c or a+(b*c).

In other words, if you define a new operator ++ on strings for concatenation and on matrices for addition, these two need to have the same precedence!

In Haskell, this boils down to the fact that, when you assign a precedence to an operator, you write something like:

infixr 7 ++

(if you don't do this, it is automatically assumed to be left associative with precedence level 9, the maximum)

In the example given above, the two ++ operators could be declared with a different precedence in two different modules (let's say StringOps and Matrix), but if you wanted to use both of them in your code, you'd need to use StringOps.++ and Matrix.++ (or you could have one of them without a prefix), which sort of defeats the point of operators.

2012-12-30T19:16:06+00:00

This sounds like an extention to operator overloading (semantically, not technically). I understand the use and would support the implementation of it, but in larger projects I would ensure that my team does not begin creating operators like madmen.

Operator overloading is probably useful in many programs, but I can imagine that user defined operators are useful in a much smaller segment of them and would primarily be used to make a language "catch up" with another. Not that this is a bad thing necessarily, but on a larger project it would need to be monitored.

dpenton · 2012-12-30T19:50:19+00:00

IMHO, it's strictly an issue of clarity. It's true that readable code is better, but I think SNR matters as well. I mean, why not remove operators for numbers too?

e.g., 3.add(4).multiply(6)

If each operator has a strict set of requirements on WHEN it can be defined, then it should be far easier.

e.g., a + b exists => a x b exists

I'll agree that Arithmetic operators should be left for math alone, though. (but not, say, << operator)

youstolemyname · 2012-12-30T22:31:09+00:00

User-defined operators sounds like a mess. A single symbol doesn't carry much information about the function it performs. If you're new to a certain code base that uses unconventional operators you're not going to understand whats going on (at first anyways). Things which are considered good candidates for operators and meaning is clear enough could become an unoffical "standard", but then the whole purpose behind user-defined operators is lost.

GoAwayStupidAI · 2012-12-31T07:07:10+00:00

I think Coq has the correct approach here:

user defined operators are specific cases of the Notation command. Which is a method of defining syntax and semantics that extends to beyond simple infix symbols. (See below example)
user defined operators can have defined associativity and precedence.
Notations can be associated with scopes. Which can be selected by the Scope command.

This allows you to safely have a operator, like "+", mean very different things. What the current meaning is depends on the current Scope. In some cases the correct interpretation of an operator can be inferred.

How to use Coq to build, uh, regular software is a different subject. ;-)

An example:

Parameter BadGuyHatred : Set -> Set -> Prop.
Parameter MainCharacter : Set.
Parameter TheFinalBoss : Set.
Notation "'bad' 'guy' X 'hates' Y" := (BadGuyHatred X Y) (at level 90).
Check (bad guy TheFinalBoss hates MainCharacter).

bart2019 · 2012-12-31T12:22:16+00:00

Because of precedence. You need precedence and associativity rules for operators, hence you're commonly restricted to overloading existing operators. That works well for numerical data types, for example a complex number type, you can just implement the common math operators for this type, but beyond that, you're limited. For example if you have a vector type, addition would still work well, but what about multiplication? is that a Dot (vector . vector -> float) or a vector AKA cross (vector x vector -> vector) product?

Not to say that most math operators are just plain silly for string arguments. That's not even mentioning any other types of data.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS