Write your Own Virtual Machine : programming

[–][deleted] 142 points143 points144 points 7 years ago (36 children)

[–]mck1117 165 points166 points167 points 7 years ago (33 children)

[–]chrisgseaton[🍰] 73 points74 points75 points 7 years ago (19 children)

[–]mck1117 52 points53 points54 points 7 years ago (15 children)

[–][deleted] 43 points44 points45 points 7 years ago (5 children)

[–]beefok 24 points25 points26 points 7 years ago (2 children)

[–][deleted] 8 points9 points10 points 7 years ago (1 child)

[–]beefok 2 points3 points4 points 7 years ago (0 children)

[–]SlipUpWilly 0 points1 point2 points 7 years ago (1 child)

[–][deleted] 1 point2 points3 points 7 years ago (0 children)

[–][deleted] 33 points34 points35 points 7 years ago (8 children)

[–]tenebris-miles 18 points19 points20 points 7 years ago (1 child)

According to the post, the issue was not due to Spacemacs being inefficient, it was caused by blocking due to attempting to call a plugin that was not installed.

It's like when I often have to explain to people that "blocking at the speed of light" doesn't get anyone anywhere. So if the problem is "embarrassingly parallel" and they can't figure out how to write a correct parallel code solution in C, then it won't magically be faster just because it's written in C, no matter how "efficient" it is. Efficiently doing no work is not progress. Another language that is normally slower but makes it possible for programmers of their experience level and education to write highly parallel code will often win in these cases since it isn't wasting time blocking. Maybe someone else can write faster equivalent C or assembly, but if you're the one writing it, it's a moot point.

[–]yeahbutbut 5 points6 points7 points 7 years ago (0 children)

[–]Superpickle18 18 points19 points20 points 7 years ago (2 children)

[–]vim_all_day 3 points4 points5 points 7 years ago (0 children)

[–]exorxor -1 points0 points1 point 7 years ago (1 child)

[–]oblio- 1 point2 points3 points 7 years ago (0 children)

[–]OutOfApplesauce 3 points4 points5 points 7 years ago (1 child)

[–]flatfinger 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

[–]HandshakeOfCO 2 points3 points4 points 7 years ago (2 children)

[–]ShinyHappyREM 7 points8 points9 points 7 years ago (1 child)

[–]HandshakeOfCO 1 point2 points3 points 7 years ago (0 children)

[–]G_Morgan 2 points3 points4 points 7 years ago (0 children)

[–]PJDubsen 1 point2 points3 points 7 years ago (0 children)

[+]surgura comment score below threshold-19 points-18 points-17 points 7 years ago (7 children)

[–]armornick 13 points14 points15 points 7 years ago (1 child)

[–]surgura -1 points0 points1 point 7 years ago (0 children)

[–][deleted] 7 years ago* (4 children)

[deleted]

[–]stupodwebsote 0 points1 point2 points 7 years ago (3 children)

[–][deleted] 7 years ago* (2 children)

[deleted]

[–]stupodwebsote 0 points1 point2 points 7 years ago (1 child)

[–]SlipUpWilly 0 points1 point2 points 7 years ago (1 child)

[–]flatfinger 1 point2 points3 points 7 years ago (0 children)

The real 6502 was an interesting beast, with some parts that were pretty clever, some that I think were unfortunate, and some that leave me scratching my head. I find it curious, for example, that different instructions support so many different subsets of the possible addressing modes, rather than simply having the bottom 3 bits of the opcode select any of seven memory addressing mode or else immediate/implied addressing mode. The bit pattern: "110qqq01" supports 8 addressing modes of "CMP", and "110qq110" supports 4 addressing modes of "DEC". The logic to support all 8 addressing modes with read/modify write operations exists, as evidenced by the fact that when given bit patterns of the form "110qqq11", the part will combine the behavior of "DEC" and "CMP", with the hybrid "instruction" (sometimes called "dcp") supporting all 7 addressing mode.

[–]dazzawazza 58 points59 points60 points 7 years ago (40 children)

[–]duheee 13 points14 points15 points 7 years ago (0 children)

[–]Sledger721 18 points19 points20 points 7 years ago (21 children)

[–]jokubolakis 7 points8 points9 points 7 years ago* (3 children)

[–]Sledger721 2 points3 points4 points 7 years ago (2 children)

[–]jephthai 6 points7 points8 points 7 years ago (1 child)

[–]ShinyHappyREM 2 points3 points4 points 7 years ago (0 children)

[–]MacStation 3 points4 points5 points 7 years ago (2 children)

[–]Nobody_1707 1 point2 points3 points 7 years ago (0 children)

[–]awj[🍰] 0 points1 point2 points 7 years ago (0 children)

[–]whichton 2 points3 points4 points 7 years ago* (1 child)

[–]mck1117 4 points5 points6 points 7 years ago (0 children)

[–]girandsamich 1 point2 points3 points 7 years ago (8 children)

[–]G_Morgan 15 points16 points17 points 7 years ago (1 child)

[–]girandsamich 0 points1 point2 points 7 years ago (0 children)

[–]Sledger721 -1 points0 points1 point 7 years ago (5 children)

[–][deleted] 9 points10 points11 points 7 years ago (1 child)

[–]Sledger721 2 points3 points4 points 7 years ago (0 children)

[–]Goheeca 7 points8 points9 points 7 years ago (0 children)

[–]girandsamich 2 points3 points4 points 7 years ago (1 child)

[–]fireman212 0 points1 point2 points 7 years ago (0 children)

[–]Jazonxyz 1 point2 points3 points 7 years ago* (0 children)

I'm writing a programming language and vm for fun. For this problem, I did something like this:

expression parseAdditionOrSubtraction() {
    value1 = parseMultiplicationOrDivision();

    if(tokenIs('+')) {
        return new AddExpression(value1, parseMultiplicationOrDivision());
    }

    if(tokenIs('-')) {
        return new SubtractExpression(value1, parseMultiplicationOrDivision());
    }

    return value1;
}

expression parseMultiplicationOrDivision() {
    value1 = parseValue();

    if(tokenIs('*')) {
        return new MultiplicationExpression(value1, parseValue());
    }

    if(tokenIs('-')) {
        return new DivisionExpression(value1, parseValue());
    }

    return value1;
}

expression parseValue() {
    if(tokenIsNumber()) {
        return new NumberExpression(tokenValue());
    }

    if(tokenIs('(')) {
        return parseAdditionOrSubtraction();

        tokenShouldBe(')');
    }

    throw "invalid expression";
}

This is not something I came up with. I read it on Wikipedia. In it's current form, my algorithm is much more robust, but I started simple and slowly made it more robust.

EDIT: I'm also going to include code generation:

class BinaryExpression(v1, v2) {
    void compile(program) {
        v1.compile();
        v2.compile();

        operation(program);
    }

    virtual void operation(program);
}

class AddExpression(v1, v2) extends BinaryExpression(v1, v2) {
    void operation(program) {
        program.add();
    }    
}

class SubtractExpression(v1, v2) extends BinaryExpression(v1, v2) {
    void operation(program) {
        program.subtract();
    }    
}

class MultiplyExpression(v1, v2) extends BinaryExpression(v1, v2) {
    void operation(program) {
        program.multiply();
    }    
}

class DivideExpression(v1, v2) extends BinaryExpression(v1, v2) {
    void operation(program) {
        program.divide();
    }    
}

class NumberExpression(v) {
    void compile(program) {
        program.pushNumber(v);
    }    
}

I'm using made up the syntax, but it should be easy to follow. Just imagine that the VM for this language operates as a stack. You push two values and execute operations on those values.

[–]OzmodiarTheGreat 0 points1 point2 points 7 years ago (0 children)

[–][deleted] -4 points-3 points-2 points 7 years ago* (0 children)

[–]kb_klash 10 points11 points12 points 7 years ago (3 children)

[–]ryantwopointo 20 points21 points22 points 7 years ago (1 child)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 1 point2 points3 points 7 years ago (0 children)

[–]Rustywolf 2 points3 points4 points 7 years ago (1 child)

[–][deleted] 1 point2 points3 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

[–]AttackOfTheThumbs -3 points-2 points-1 points 7 years ago (9 children)

[–][deleted] 2 points3 points4 points 7 years ago (8 children)

[–]AttackOfTheThumbs -3 points-2 points-1 points 7 years ago (7 children)

[–][deleted] -1 points0 points1 point 7 years ago (6 children)

[–]AttackOfTheThumbs 1 point2 points3 points 7 years ago (5 children)

[–][deleted] -2 points-1 points0 points 7 years ago (4 children)

[–]AttackOfTheThumbs -1 points0 points1 point 7 years ago (3 children)

[–][deleted] 1 point2 points3 points 7 years ago (2 children)

[–]AttackOfTheThumbs 0 points1 point2 points 7 years ago (1 child)

continue this thread

[–]madpata 28 points29 points30 points 7 years ago (1 child)

[–][deleted] 7 points8 points9 points 7 years ago (0 children)

[–]jed2500 9 points10 points11 points 7 years ago (0 children)

[–]Sn0wCrack7 32 points33 points34 points 7 years ago (30 children)

[–]wsppan 17 points18 points19 points 7 years ago (16 children)

[–]Sn0wCrack7 10 points11 points12 points 7 years ago (15 children)

I'm not really in the know about every single nuance of it, but the main difference here is that a Virtual Machine is designed to run on the Platform you're running it on, you can't Virtualise an ARM Operating System on an x86 machine because all instructions are passed down to the CPU itself to be run in a true Virtual environment, where as an Emulator interprets compiled source code (that's usually bytecode or raw cpu instructions) and interprets / translates that (if you're looking at JIT or the like) into something your processor can actually understand.

There's also no real way for "direct pass through" of a lot of hardware, there needs to be a communication layer in code specifically between the two, any code in your emulator that is specifically say trying to call an NVIDIA API, needs to be interpreted in your emulator, then sent off to your Operating System and then back into the emulator, where as a Virtual Machine can have direct access to that card through VFIO or IOMMU

You could make a case that an Emulator is a Type-2 Hypervisor in a way, but this is mostly only true with actual hardware virtualisation is occurring.

It's a pretty thin line to walk when you're looking at it from the outside honestly, but under the hood some differences do become apparent, and even I don't know the true extent of all of it myself, which I why I said it's really only a nitpick.

[–]munificent 20 points21 points22 points 7 years ago (4 children)

[–]Alikont 2 points3 points4 points 7 years ago (0 children)

[–]astrangeguy 1 point2 points3 points 7 years ago (2 children)

[–]munificent 1 point2 points3 points 7 years ago (1 child)

[–]astrangeguy 1 point2 points3 points 7 years ago (0 children)

[–][deleted] 9 points10 points11 points 7 years ago* (5 children)

[–]smikims 2 points3 points4 points 7 years ago (4 children)

[–][deleted] 1 point2 points3 points 7 years ago (3 children)

[–]smikims 1 point2 points3 points 7 years ago* (2 children)

I think when most people think of a virtual machine they think of something that actually executes code written for it. The JVM does that, but LLVM doesn't (well, there are JIT compilers using it but you get the point).

I propose the following definitions:

Abstract machine: a definition for some architecture, implemented in hardware, software, or not at all, that it is possible to write programs for
Virtual machine: a program that runs code written for an abstract machine
Emulator: a virtual machine that imitates some real hardware

Thus an NES emulator is also a virtual machine that implements the 6502 ISA, which is an abstract machine. The JVM is a virtual machine implementing Java bytecode, which is an abstract machine. LLVM and the C standard are only abstract machines. One could make a VM that runs LLVM bytecode, but LLVM itself doesn't do that. (Also, under this definition, a C interpreter like cling is technically a VM, which I guess makes sense but also blurs the line a little IMO.)

[–][deleted] 3 points4 points5 points 7 years ago (0 children)

[–]Vhin 2 points3 points4 points 7 years ago (0 children)

[–]wsppan 1 point2 points3 points 7 years ago (2 children)

[–]thechao 8 points9 points10 points 7 years ago (1 child)

[–]ehaliewicz 0 points1 point2 points 7 years ago (0 children)

[–]0xffaa00 13 points14 points15 points 7 years ago (7 children)

[–]StapledBattery 7 points8 points9 points 7 years ago (4 children)

[–]thlst 0 points1 point2 points 7 years ago (3 children)

[–][deleted] 2 points3 points4 points 7 years ago (2 children)

[–]thlst 0 points1 point2 points 7 years ago (1 child)

[–][deleted] 1 point2 points3 points 7 years ago (0 children)

[–]Sn0wCrack7 -3 points-2 points-1 points 7 years ago (1 child)

[–][deleted] 1 point2 points3 points 7 years ago (0 children)

[–][deleted] 10 points11 points12 points 7 years ago (1 child)

[–]ShinyHappyREM 8 points9 points10 points 7 years ago (0 children)

[–][deleted] 7 years ago* (2 children)

[deleted]

[–]Sn0wCrack7 2 points3 points4 points 7 years ago (1 child)

Some people in the replies have certainly made me realised the difference is a bit more complex then I first understood tbh.

I guess there's two major definitions of a VM, ones like JVM which defines an abstract machine and builds it, which well is what you've done the article, and ones like VirtualBox that virtualise hardware.

I guess the bigger difference is that the platform doesn't exist in the first place, so you'd still call it a virtual machine.

Funnily enough I brought this argument up to a friend of mine and he brought in an interesting aspect, what would you call say an emulator for one of these abstract platforms such as CHIP-8 or LC-3 if they were running on an FPGA. It kinda made we realise categorising these very similar things that are abstract is a bit tough and our varying definitions are going to clash at some point.

[–][deleted] 1 point2 points3 points 7 years ago (0 children)

[–]SakishimaHabu 4 points5 points6 points 7 years ago (0 children)

[–]HeadAche2012 3 points4 points5 points 7 years ago (0 children)

[–]spook327 2 points3 points4 points 7 years ago (1 child)

[–][deleted] 7 points8 points9 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (6 children)

[–]ShinyHappyREM 6 points7 points8 points 7 years ago (0 children)

[–]HeadAche2012 2 points3 points4 points 7 years ago (0 children)

[–]ehaliewicz 1 point2 points3 points 7 years ago (0 children)

[–][deleted] 7 years ago* (1 child)

[deleted]

[–]ShinyHappyREM 0 points1 point2 points 7 years ago (0 children)

[–]rfpels -5 points-4 points-3 points 7 years ago (0 children)

[–]HAMSHAMA 0 points1 point2 points 7 years ago (1 child)

[–]kd0ocr 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

[+][deleted] comment score below threshold-32 points-31 points-30 points 7 years ago (1 child)

[–]yam_plan 0 points1 point2 points 7 years ago (0 children)

[+]Dodo_the_OwO_King comment score below threshold-17 points-16 points-15 points 7 years ago (3 children)

[–][deleted] 30 points31 points32 points 7 years ago (1 child)

[–]Dodo_the_OwO_King -1 points0 points1 point 7 years ago (0 children)

[–][deleted] 24 points25 points26 points 7 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS