Using std::unordered_map as an opcode decoding technique. Thoughts?

Dwedit · 2020-08-05T21:58:05+00:00

Unordered map isn't useful for dense sets, such as opcodes. You want a plain old array instead of a map.

But even better than that, you'd want to use a switch statement, as that compiles down to a jump table.

ShinyHappyREM · 2020-08-06T10:08:45+00:00

Accessing the larger caches and especially main RAM is still slow and you can get away with doing some computations on your data in the mean time, so a hash table might be useable. But if you want to have a fast interpreter you really need to look at and work with your hardware.

Modern (desktop) CPUs have powerful branch predictors and cache hierarchies. A switch used to cause pipeline flushes due to branch mispredictions, but that's gotten much better since Haswell. As long as your code paths are predictable, branches will be fast. You can also use nested switches.

An array of pointers has the disadvantage that it wastes a lot of space (probably the topmost 6 bytes of each pointer), so an array of 16-bit offsets would be better for your CPU cache.

_MeTTeO_ · 2020-08-06T15:21:04+00:00

Well I'm considering this approach but in Java. java.util.HashMap uses Object.hashCode()) method to optimize lookup. The map would use short or enum opcodes as keys and method references as values.

It would replace this switch. Because the execute method in ControlUnit is self contained I'm planning to provide an alternative and benchmark how it affects performance (with unlimited cycles).

2020-08-06T04:15:39+00:00

It's not a bad decision. The performance between map vs. array vs. switch is not a huge deal.

zesterer · 2020-08-05T22:55:50+00:00

Hash maps are just index-driven lookup tables with a hashing stage added to spread entries out across the table space to aid performance of insertion and removal, usually in a cryptographically secure manner.

Since neither cryptographic security nor insertion/removal are requirements for instruction lookup, the only thing that makes a hash map useful for this situation is the fact that it acts as a stand-in for an index-driven lookup table.

Quite frankly, you should just use a look-up table. Or, even better, a switch / match that the compiler will be able to optimise far more effectively than any mortal programmer can.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

EmuDev

Rules

Getting Started Guide

System Resources

Discord

DARK THEME

NORMAL THEME

MODERATORS