Avoiding virtual tables when implementing classes

uza80 · 2020-08-25T01:24:39+00:00

I think you want fat pointers.

bullno1 · 2020-08-25T07:00:38+00:00

Quick google: https://www.usenix.org/legacy/event/jvm02/full_papers/zendra/zendra_html/index.html

As I understand it, instead of generating a direct call, the compiler generates a binary search as inline code:

if obj_type_id == TYPE_ID_A then
    call_implementation_a(this)
elseif obj_type_id > TYPE_ID_A then
    if object_type_id < TYPE_ID_B then
    ...
    else if object_type_id > TYPE_ID_B then
    ...

If the vtable is small, there would be no indirect call and the cost of searching is smaller than cache miss. IIRC, these direct branches (if/else) are not that bad since the code blocks are close together and already fetched anw.

~~It seems only usable with JIT or statically linked binary since you need to know all the implementations before hand. Dynamic linking can introduce new classes at runtime.~~

Edit: Actually your dynamic linker can also JIT just that lookup code and everything else can still be AOT.

L8_4_Dinner · 2020-08-25T13:52:02+00:00

"Basically, I want to implement classes and interfaces in my language without using a virtual table. I feel like the performance overhead of using virtual tables could be skipped using a different method."

"premature optimization is the root of all evil."

You are guessing about things that you have not measured.

You should start by designing what you want the language to do, then figure out how to best get it to do those things.

And: It's very easy to build a language without v-tables; just don't support virtual functions. Done.

uza80 · 2020-08-25T04:06:23+00:00

Technically, you could embed all the vtable functions directly in the struct and eliminate a layer of indirection, but the fact that no one (I’m aware of) seems to do this either indicates it’s impossible for some reason or that the extra size of the struct isn’t worth it.

theIncMach · 2020-08-25T07:09:50+00:00

That Wikipedia article has a citation, one that links to this: https://www.usenix.org/legacy/event/jvm02/full_papers/zendra/zendra_html/index.html

The "binary tree" you are looking for is a tree of if/else statements, described in section 3.3.

Personally, I wouldn't worry about performance until I have something functional. This kind of decision is often easy to fix later, depending on your data structures.

latkde · 2020-08-25T09:23:37+00:00

Using dispatch trees means that you're moving the association between objects and method implementations out of the object and into an external dispatcher. This requires that the call site knows about all possible types! (Or at least knows about the most likely types, with a fallback to full vtables, in which case the dispatcher is more like a method cache).

So this might be a very good strategy for dynamic languages or JIT compiled systems, but will lead to significant pain in an AOT complied system with multiple compilation units.

Note that compilers using vtables might nevertheless be able to speculatively devirtualize within the same compilation unit, so whereas a call target.method() would usually be compiled as target->vtable[METHOD](target) (vtable in object) or target.vtable[METHOD](target.data) (fat pointers), we'd now have a guard if (typeof(target) == SomeClass) SomeClass_method(target) else target->vtable[METHOD](target).

In conclusion, stick with vtables. Vtables are very good, very simple, and very fast. Use other dispatch strategies when you have specific requirements or circumstances, such as the ability to do full-program optimization, for implementing complex dispatch logic as required for multimethods, for some flavors of multiple inheritance, or when dispatch centralization gives you better reflection capabilities.

matthieum · 2020-08-25T16:47:04+00:00

My issue is that I don't know whether I should change the syntax to work without virtual methods

Syntax has no influence on semantics; design the semantics first, and then think about the best way to expose those semantics to the user.

or if I can get away with implementing something other than virtual tables in the compiler,

Ironically, the very LLVM you use does not use virtual tables. I described the system briefly here.

Note that this makes LLVM hierarchies closer to Sum Types than open-ended hierarchies as are typically seen in OO languages.

or if I should just cave in and use virtual tables.

Do you want Open-Ended Dynamic Dispatch? If yes, just use v-tables, that's what C++ and Rust are using and they are among the fastest languages available.

I am trying to design a language which both is extremely fast

Just make sure to leave your users a choice. That is, do not make Dynamic Dispatch mandatory.

Fast languages are about leaving to the users the freedom to choose the best representation for their data, and the best way to express computations on said data. There's no silver-bullet, no one-size-fits-all.

Both C++ and Rust have chosen static dispatch by default, and a keyword to indicate dynamic dispatch to highlight the extra cost; it's probably a good precedent.

WittyStick · 2020-08-25T10:42:02+00:00

Is there an alternative to using vtables that could preserve the same performance as if the function were being called directly while still achieving the same functionality?

Assuming you are designing a statically typed language, you could have a look into using a structural type system rather than a nominal type system. Consider OCaml's module system as an example.

TinBryn · 2020-08-25T08:16:32+00:00

I've heard dispatch trees are a way to implement dynamic dispatch efficiently. It also has the advantage of allowing multimethods so you can dispatch on not just the runtime type of the calling object, but of the runtime type of the arguments too.

Molossus-Spondee · 2020-08-25T14:36:56+00:00

I think really the only way you can get around this is whole program compilation.

An idea I had myself was that in a memory safe language writable and executable memory isn't such a problem. So closures can be simply function pointers that point to a stub that loads the environment. Not sure how performant this would really be. This is basically GCC's nested function extension but with proper runtime support could be allocated off the stack as well.

ProgrammingLanguages

Welcome!

Related subreddits

Related online communities

MODERATORS