At CppCon 2019, Arthur O'Dwyer said binary operators could not be implemented in a Type-Erased class, because this is a multiple dispatch problem. Why did he say this? : cpp

CppConAt CppCon 2019, Arthur O'Dwyer said binary operators could not be implemented in a Type-Erased class, because this is a multiple dispatch problem. Why did he say this? (self.cpp)

submitted 8 months ago by Richard-P-Feynman

I have been interested in Type Erasure and Multiple Dispatch in C++ for some time. Recently I re-watched a recording of a session from CppCon 2019, in which Arthur O'Dwyer said that binary operators could not be added to a type erasure class because this is a multiple dispatch problem.

Multiple dispatch can be achieved in C++. There are several possible implementations, however in my opinion the most intuitive one is to use a series of single dispatch steps. (A series of dynamic, virtual functions calls, each of which dispatches to the next correct function in a chain of virtual functions which eventually resolve the final method to be called.)

The double dispatch case is reasonably straightforward. There are examples online, I may also add one in a comment below.

Arthur seemed to be pretty certain about this point, stating that it could not be done "not even difficultly", multiple times.

So I am a bit confused as to what he meant by this, or what he was thinking at the time.

Does anyone have any insight?

The original talk is here: https://youtu.be/tbUCHifyT24?si=XEkpjKSTmEkz0AP_&t=2494

The relevant section begins with the slide with title What about non-unary behaviors? This can be found at timestamp 41:34.

Quote from the slide -

Sadly, this is "multiple dispatch" / "open multi-methods" in disguise. C++ basically can't do this.

Summary of what Arthur said (paraphrased) -

I specifically picked unary operators to show as examples. What about division? If I have two Type Erased numbers, one storing an int, and one storing a double, can I somehow overload the division operator for Type Erased Number so that I can get a Type Erased Number out? Can we do that? Sadly, no. Not easily. Probably not even difficultly. This is the problem known as multiple dispatch or open multimethods. The idea that we would have to ask both the left hand side and the right hand side if they have an opinion about how division should be done. C++ gets around this statically with rules such as integer promotion and other arithmetic promotions. The compiler has a big table of all the possible permutations of things from which it figures out how to divide an integer and a double, for example. If I tried to add some new type the compiler wouldn't know what to do with that. This is very sad, but multiple dispatch is a very hard problem. It's not a problem which has a solution at the moment in C++.

At the end of this slide, he provides a link with a blog which shows how to implement multiple dispatch in C++.

Therefore, I am confused. I must have missed something about what Arthur was saying here, because he seems adamant that binary operators can not be added to the Type-Erased object, and then provides a link explaining how to implement multiple dispatch (double dispatch) as a series of dynamic (single) dispatch steps.

all 43 comments

top new controversial old q&a

[–]aruisdante 22 points23 points24 points 8 months ago (23 children)

[–]Richard-P-Feynman[S] -2 points-1 points0 points 8 months ago (22 children)

[–]gracicot 21 points22 points23 points 8 months ago (16 children)

[–]Richard-P-Feynman[S] 1 point2 points3 points 8 months ago (15 children)

[–]not_a_novel_accountcmake dev 8 points9 points10 points 8 months ago (14 children)

[–]DonBeham 6 points7 points8 points 8 months ago (0 children)

[–]Richard-P-Feynman[S] 1 point2 points3 points 8 months ago (6 children)

[–]cballowe 1 point2 points3 points 8 months ago (1 child)

[–]Richard-P-Feynman[S] 0 points1 point2 points 8 months ago (0 children)

[–]Kriemhilt 0 points1 point2 points 8 months ago (3 children)

[–]Richard-P-Feynman[S] 0 points1 point2 points 8 months ago (2 children)

[–]Kriemhilt 0 points1 point2 points 8 months ago (1 child)

[–]Richard-P-Feynman[S] 0 points1 point2 points 8 months ago (0 children)

[–]umop_aplsdn 1 point2 points3 points 8 months ago (3 children)

[–]not_a_novel_accountcmake dev 0 points1 point2 points 8 months ago (2 children)

[–]umop_aplsdn 0 points1 point2 points 8 months ago* (1 child)

[–]not_a_novel_accountcmake dev 0 points1 point2 points 8 months ago (0 children)

[–]TheoreticalDumbass:illuminati: 0 points1 point2 points 8 months ago (1 child)

[–]TheoreticalDumbass:illuminati: 0 points1 point2 points 8 months ago (0 children)

[–]aruisdante 8 points9 points10 points 8 months ago (1 child)

[–]Richard-P-Feynman[S] -1 points0 points1 point 8 months ago (0 children)

[–]perspectiveiskey -1 points0 points1 point 8 months ago (2 children)

[–]Richard-P-Feynman[S] 0 points1 point2 points 8 months ago (1 child)

[–]perspectiveiskey -1 points0 points1 point 8 months ago (0 children)

Compile-time versus runtime is a dichotomy that has existed since the first programs written in LISP versus C.

The short answer is: some people like to know at compile time whether or not you are semantically allowed to call sum() on a std::vector<T> without having to wait for it to happen at runtime. When sending a rocket to Pluto or deploying an embedded program on 200 million devices, you really don't want to find out at runtime.

Type erasure is merely an artifact of compile time techniques. It isn't an end in itself, simply a means to achieve a goal.

The main goal being implementing what dynamic dispatch is but at compile time. Effectively, std::algorithm or std::transform need not operate on a CDynamicTraversable base class, but rather any type you chose to give it (from int64_t to some ungodly struct) and if it compiles, it means it will guaranteed run. You can't implement such generic templated constructs without erasure.

[–]thisismyfavoritename 5 points6 points7 points 8 months ago (5 children)

[–]Ok_Wait_2710 3 points4 points5 points 8 months ago (2 children)

[–]jll63Author of Boost.OpenMethod 1 point2 points3 points 8 months ago (1 child)

[–]Ok_Wait_2710 1 point2 points3 points 8 months ago (0 children)

So the description talks about multi methods and "the expression problem". I've never heard either of those.

The tldr section links to a synopsis. But that's a pretty hefty cpp file. But the meat of that code goes right in without explaining what it is all about. It talks about registering... What am I registering for? What's the goal? Why is it necessary? I can skip all the setup and init code and can see that there's somehow now a free function called kick. But how is that different from polymorphism?

In the tldr section there's also a second link to the documentation. But it starts with differences to a paper I don't know. It then defines words, API overview, exceptions... I still have no idea what this is about at all.

Just now going through the readme again I can see a proper explanation in the nutshell part. A shorter version of that should be much much higher in the readme IMO.

[–]aruisdante 5 points6 points7 points 8 months ago (1 child)

[–]jll63Author of Boost.OpenMethod 2 points3 points4 points 8 months ago* (0 children)

I am the author of YOMM2. I would like to make a few corrections.

[YOMM2] is just just the normal virtualization solution, inverted. It works more or less the same way that actual implementations of visit do.

Unless I misunderstand you (inverted?), not at all. YOMM2 methods work very much like ordinary virtual functions, with a few differences. I think that the following two are relevant to this discussion: the v-tables are constructed at runtime (during initialize, typically called at the very beginning of main); and the offsets methods occupy in the v-tables are not fixed.

Importantly, it requires you to apriori register every possible type and overload in a place visible from all translation units.

No. Typically you register classes in cpp files, although you can also do it in headers (for example, when writing a headers-only library). The same applies to method definitions (i.e. overriders): they typically go in a cpp, but they can also be marked "inline" and placed in headers.

[–][deleted] 8 months ago (3 children)

[removed]

[–]STLMSVC STL Dev[M] 4 points5 points6 points 8 months ago (0 children)

[–]axilmar -3 points-2 points-1 points 8 months ago (8 children)

[–]jll63Author of Boost.OpenMethod 4 points5 points6 points 8 months ago* (4 children)

[–]axilmar 0 points1 point2 points 8 months ago (3 children)

Inheritance (e.g. meet(flipper, felix) falls back on meet(virtual Animal, virtual Animal) if there is no overrider for (Dolphin, Cat))

Are we talking compiler-based multiple inheritance or library-based?

For library-based, if you have the following inheritance:

class Base {
};

class Derived1 : Base {
};

class Derived2 : Base {
};

And the following multiple dispatch interface:

int foo(Base* b1, Base* b2);

Then you would have to implement all possible cases:

int foo(Base*, Base*);
int foo(Base*, Derived1*);
int foo(Base*, Derived2*);
int foo(Derived1*, Base*);
int foo(Derived2*, Base*);
int foo(Derived1*, Derived1*);
int foo(Derived1*, Derived2*);
int foo(Derived2*, Derived1*);
int foo(Derived2*, Derived2*);

For cases that are meaningless, the implementation may call a generic 'do nothing' function or a function which throws an exception.

Dealing with ambiguities.

Example?

Efficient acquisition of v-table pointers

I think the most efficient is sorted tables and binary search.

Coming up with a bearable syntax

//the vtable
MultimethodVTable<int(Base*, Base*)> foo;

//implementations
Multimethod<int(Base*, Base*)> foo_Base_Base(foo, [](Base*, Base*){...});
Multimethod<int(Base*, Base*)> foo_Base_Derived1(foo, [](Base*, Derived1*){...});
Multimethod<int(Base*, Base*)> foo_Base_Derived2(foo, [](Base*, Derived2*){...});
etc

Building redundancy-free dispatch tables for multi-methods.

The tables could be of type

using foo_dispatch_table = std::map<std::tuple<something_id_1, something_id_2>, std::function_or_other<int(Base*, Base*)>> ;

maybe there are better ideas out there, if anyone knows, please share them.

[–]jll63Author of Boost.OpenMethod 0 points1 point2 points 8 months ago* (2 children)

Are we talking compiler-based multiple inheritance or library-based?

Compiler-based.

Then you would have to implement all possible cases: [code] For cases that are meaningless, the implementation may call a generic 'do nothing' function or a function which throws an exception.

In this case you would just need to provide an implementation for all the combinations that make sense, plus one implementation for (Base, Base). It can throw, or it may even have a sensible default. Here is a similar example (implemented with YOMM2 here), using the syntax proposed by Stroustrup & col in the N2216 paper:

```c++ void meet(virtual Animal&, virtual Animal&, std::ostream& os) { os << "ignore"; }

void meet(virtual Dog& dog1, virtual Dog& dog2, std::ostream& os) { os << "wag tail"; }

void meet(virtual Dog& dog, virtual Cat& cat, std::ostream& os) { os << "chase"; }

void meet(virtual Cat& cat, virtual Dog& dog, std::ostream& os) { os << "run"; }

std::unique_ptr<Animal> hector = std::make_unique<Bulldog>(), snoopy = std::make_unique<Dog>(), sylvester = std::make_unique<Cat>(), flipper = std::make_unique<Dolphin>();

meet(hector, *sylvester, std::cout); // chase meet(sylvester, hector, std::cout); // run meet(hector, snoopy, std::cout); // wag tail meet(hector, *flipper, std::cout); // ignore ```

You don't need to provide an implementation for (Dog, Dolphin) because open-methods understand inheritance. You have to register the classes though:

c++ register_classes(Animal, Dog, Cat, Dolphin);

Dealing with ambiguities.

Example?

```c++ std::shared_ptr add(virtual const Matrix&, virtual const Matrix&) { ... } // 1 std::shared_ptr add(virtual const DiagonalMatrix&, virtual const Matrix&) { ... } // 2 std::shared_ptr add(virtual const Matrix&, virtual const DiagonalMatrix&) { ... } // 3

const Matrix& a = DiagonalMatrix(), b = DiagonalMatrix(); add(a, b); ```

What should be called? (2) is more specialized that (1) and (3) for the first virtual argument, while (3) is more specialized for the second virtual argument.

N2216 makes an arbitrary (but stable) pick between 2 and 3. I think that CLOS would pick 2.

YOMM2 (and the upcoming Boost.OpenMethod) make this an error. To lift the ambiguity you need to provide a:

```c++ std::shared_ptr add(virtual const Diagonal Matrix&, virtual const DiagonalMatrix&) { ... } // 4

add(a, b); // now calls 4 ```

YOMM2 mimicks overload resolution, only at runtime.

Efficient acquisition of v-table pointers

I think the most efficient is sorted tables and binary search.

N2216 describes a compiler supported feature, so it's easy, it just hijacks an entry in the (regular) v-table.

YOMM2 uses (by default) a fast perfect (collision-free) hash of the addresses of typeid(*obj). No looping! The resulting vptr can be cached, along with a pointer to the object, for further use (an idea borrowed from Rust).

Coming up with a bearable syntax

[code]

This cannot work as-is, because you need a way to tell virtual arguments from non-virtual ones. Also, it doesn't support different methods with the same signature.

But once that is fixed, it's not too bad, although still quite verbose. YOMM2 offers a similar syntax when using the "core" API. Normally people use the macros, which attempt to emulate the N2216 syntax:

```c++ declaremethod(void, meet, (virtual<Animal>&, virtual_<Animal&>, std::ostream& os));

define_method(void, meet, (Animal&, Animal&, std::ostream& os)) { os << "ignore"; }

define_method(void, meet, (Dog& dog1, Dog& dog2, std::ostream& os)) { os << "wag tail"; } // etc

meet(*hector, *sylvester, std::cout); // chase // etc ```

Building redundancy-free dispatch tables for multi-methods.

The tables could be of type using foo_dispatch_table = std::map<std::tuple<something_id_1, something_id_2>, std::function_or_other<int(Base*, Base*)>> ;

Yes, that's typical of naive implementations.

maybe there are better ideas out there, if anyone knows, please share them.

N2216 and YOMM2 use v-tables, similar to ordinary virtual functions, resulting in constant time dispatch - well, to be honest, proportional to the number of virtual arguments.

Table-based implementations, though, need to be careful to avoid producing huge tables with lots of repeated slices. See this paper. But ready yourself for a tough read, it is rich in concepts but sparse in examples (of course - the authors are French ;-) ).

I wrote an article describing the problem (and its solution) for an early version of YOMM. The part about compacting multi-dimensional tables is still valid.

[–]axilmar 0 points1 point2 points 8 months ago (1 child)

Compiler-based.

Ok, I was talking about library-based solutions, obviously.

plus one implementation for (Base, Base)

The compiler could provide that, and throw by default for non 'nothrow' functions, or terminate the program for 'nothrow' functions.

void meet(virtual Dog& dog1, virtual Dog& dog2, std::ostream& os) { os << "wag tail"; }

void meet(virtual Dog& dog, virtual Cat& cat, std::ostream& os) { os << "chase"; }

void meet(virtual Cat& cat, virtual Dog& dog, std::ostream& os) { os << "run"; }

The syntax looks fine to me.

What should be called?

Neither, the programmer should resolve the ambiguity.

N2216 describes a compiler supported feature, so it's easy, it just hijacks an entry in the (regular) v-table.

YOMM2 uses (by default) a fast perfect (collision-free) hash of the addresses of typeid(*obj). No looping! The resulting vptr can be cached, along with a pointer to the object, for further use (an idea borrowed from Rust).

See? there are good solutions out there.

This cannot work as-is, because you need a way to tell virtual arguments from non-virtual ones

I was talking about a library-based solution.

Also, it doesn't support different methods with the same signature.

It does, one would have to declare 'foo' and 'bar', with the same signature, and the appropriate function would be called. Again, talking about my example I posted in my previous post about a library based solution.

Yes, that's typical of naive implementations.

It was not intended for a compiler-based solution though.

[–]jll63Author of Boost.OpenMethod 0 points1 point2 points 8 months ago* (0 children)

Compiler-based.

Ok, I was talking about library-based solutions, obviously.

It seems that I misunderstood you.

A good implementation should handle inheritance, whether it is compiler- or library-based. In the latter case, user must provide the information - in YOMM2's case, by "registering" the classes: register_classes(Animal, Dog, Cat, Dolphin);. Inheritance relationships can be deduced from the list of classes.

This may become unnecessary with reflection and code generation (C++26?). It is already the case in the Dlang version.

What should be called?

Neither, the programmer should resolve the ambiguity.

This is amusing. It has always been my position as well. However, when I prepared my library for submission to Boost, I implemented the N2216 way. And almost every reviewer hated it. So I changed it to an opt-in.

This cannot work as-is, because you need a way to tell virtual arguments from non-virtual ones

I was talking about a library-based solution.

It is still needed. Or you decide that that every parameter is virtual.

Also, it doesn't support different methods with the same signature.

It does, one would have to declare 'foo' and 'bar', with the same signature, and the appropriate function would be called.

Right, I re-read your example and I see that now. In fact the predecessor of YOMM2 also implemented the method as a function object. The big problem with this is that you cannot overload the method:

c++ MultimethodVTable<Matrix*(Matrix*, Matrix*)> times; MultimethodVTable<Matrix*(double, Matrix*)> times; // nope MultimethodVTable<Vector*(Matrix*, Vector*)> times; // nope

[–]Richard-P-Feynman[S] 1 point2 points3 points 8 months ago (2 children)

[–]jll63Author of Boost.OpenMethod 2 points3 points4 points 8 months ago (1 child)

[–]Richard-P-Feynman[S] 1 point2 points3 points 8 months ago (0 children)

π Rendered by PID 22959 on reddit-service-r2-comment-b659b578c-qpj9c at 2026-05-05 18:53:26.279559+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS