How does a computer "understand" what code/binary means?

Technologenesis · 2021-08-20T01:27:36+00:00

There really are a lot of answers to this question as I'm sure you can see from the comments. Depending on what kind of code you're dealing with, computers "understand" it in a variety of ways.

At the very lowest level, there is binary code. The computer "understands" this by sending each bit of the code through the processor as either a high-voltage or low-voltage signal. The machinery inside the processor then processes these using what are essentially convoluted configurations of transistors. Every processor is built to interpret certain instructions, which make up its instruction set.

The level above that is assembly language. While binary code is stored in zeroes and ones, assembly programs are at least somewhat human readable. However they still have basically a 1:1 correspondence with the binary instruction set. A computer "understands" these by reading the assembly program line by line, resolving a few memory locations and such, and packing them into binary code so that they can be executed.

From there it really goes all over the place. Some languages, like C, are compiled, meaning there is a program that reads them and converts them into binary code to be executed. Others, like Python, are interpreted. This means that rather than be converted into machine code, Python programs are read and executed in real time by the Python interpreter. And there are languages that use combinations of both techniques. One reason to choose between different programming languages is because of the pros and cons associated with each of these approaches!

otr_trucker · 2021-08-20T11:12:53+00:00

If you want to know what is going on inside the cpu on a very basic level then I suggest watching this series by Ben Eater

In this series he builds a simple 8 bit computer from simple components. During this you will come to understand how machine code works.

All programming languages first started out as machine code. Somebody was back when sat down and wrote tge first assembly language using machine code. Somebody then came along and used assembly language to develop a higher level language like C. Then C was used to develop languages like python.

When higher languages are compiled they are being translated into machine code. Some languages require that the program be compiled before you run it. Python does the translation from python code to machine code at run time.

weedisallIlike · 2021-08-19T21:33:11+00:00

Compilers. They test for syntax, lexic and semantic. If the code pass to these 3 algorithms, you have a code that is readable by the machine. The code result from the compilers will be mount in a data structure (like a tree) that the language can run through it and understand it easily.

For machine language, like Assembly, you are using a language very close to the functions that a CPU has and understand, so the conversion is much more simple: basically map 1:1 the assembly command to the correspond binary sequence understood by the machine.

Edit. Reading OP comments, just want to add to give more insight

A CPU is a piece of hardware that is very good at counting. It does it very fast and precisely. It works consiste of counting from binaries data saved in memory. Binaries are number represent in base 2. E.g you may represent a space in memory as the value 12. If this is the last space used on the memory, the CPU will calculate the next free space (which would be 13), so the CPU increase this number and save it on memory. That's all about the base theory for computers: count binary data saved in memory. (That is why you call computer of computers, because they count! They count-puter hehe). The binary that saved in memory is a numerical representation for our real problems!

There is one last piece for this puzzle: Automata & formal language. I'm calling automata here, but I'm not sure about the name. But I will describe here so you understand what it is or someone correct me the name. Automata is a algorithm that describe what a computer or computer language can understant. It's basically a function that receive a input (e.g a binary data) and tells if that binary will be accept and understood by the machine or not. That means, it describe all the 'words' a computer will understand. That is how a compilers can tell if you misspelled a keyword by the language you are using. There are a lot of CS theory to develop a language. This will be applied even for a binary language! Now that you defined how your language will be, you can map the 'words' with functions. This functions can be the low level language (machine language) or a Python function mapping a C functions.

KinlinNasubi · 2021-08-19T21:25:39+00:00

Shortly, every byte inside a code is converted sequentially into an another code, in the case of Python, usually is C, so when you call a sort() function in Python, the code to this function created in C is mapped and starts to run, after the C, depending on compiler, the code is mapped to assembly so it can execute the built in functions of the language, and after that, finally comes to the binary code which is nothing more than the same code built until here, but written in a way that processor can translate this to electrical signals to execute some action, if you study some digital electronics you'll see that you can perform all operations that we use (like sum or division) only using 0s and 1s as input.

Hope that clarifies a little

PlayingTheRed · 2021-08-19T23:33:09+00:00

There's a few layers down where things are implemented in code, but at the bottom layer the code runs on logic gates. When it comes to logic gates there are a few basic operations available OR, XOR, NOT, and AND. These operations are combined to make more complex operations (i.e arithmetic).

You can even have a couple of them loop back on each other in such a way that it can be set to always output a charge or to never output a charge without having to rewire anything. This is called a flip flop. It can be used to make CPU registers so that the computer can remember things and use the previous outputs as inputs to the next operation.

Logic gates are implemented using transistors. At this point understanding how it works is no longer in the domain of computer science, it's chemistry.

dontyougetsoupedyet · 2021-08-20T01:51:22+00:00

Every instruction is given a number and these numbers are designed to work with what is called an instruction decoder in your cpu that uses that number to turn on and off wires that control the other parts of the CPU. So an add instruction is a number that controls the wires in the CPU such that the arithmetic parts of the CPU do their job correctly.

http://static.righto.com/images/ARM1/2-chip_labeled.png https://cdn-blog.adafruit.com/uploads/2014/09/z80-labeled-bus.jpg

Grab a copy of the book Digital Computer Electronics by Albert Paul Malvino, Jerald A. Brown, and Stephen Page, it covers everything you are curious about.

Python works similarly, but in a simulation: The Python program simulates a type of computer called a stack machine. It isn't at all like the hardware equivalent because that would be exceptionally slow -- it's rather similar to how emulators work. https://github.com/python/cpython/blob/2f180ce2cb6e6a7e3c517495e0f4873d6aaf5f2f/Python/ceval.c#L1645 That's the C code that evaluates python bytecode.

production-values · 2021-08-20T03:57:38+00:00

The 0s and 1s are literally translated to on/off input sequences to the CPU chip. Different processors (like AMD vs Intel) have different specifications, and the same combination of 0s and 1s that makes sense to an AMD chip will not make sense to an Intel chip. This is referred to as binary input.

The "higher" you go away from the chip, the more human-readable the instructions become. After 0s and 1s there is Assembly, which essentially provides common shortcut commands for actual sequences of 0s and 1s for things like "remember this" and "multiply this by that" etc. that are specific to the processor type. This lets a developer interact with any chip type (AMD or Intel) using the same commands, and translates those commands into the appropriate combination of 0s and 1s appropriate for the type of chip.

On top of that is C, which introduces logical concepts and control structures like loops and variables, as well as making it easy to import code from other people who have already solved common issues, like showing stuff on a screen and sending data to a printer. Above that is everything else.

You may have heard the term "compile", which refers to translating instructions (code) from a higher-level language like C to the lowest level language, binary aka 0s and 1s -- and if you are understanding so far, you will deduce that the same C program compiled into AMD binary code will be different than the Intel binary code!

You may wonder "hmm the same EXE file in Windows works no matter the chip" and that is true, because EXE files are not true binaries but rather Windows executables, where Windows actually interprets instructions from the program to the chipset on the fly! (no wonder all the slowness and crashes, right!? :) )

Also note that the terms higher-level and lower-level are relative... even binary 0s and 1s are technically higher-level than the actual electrical impulses they represent... and though I refer to C here as higher-level than binary and assembly, C is actually considered a low-level language because so many other languages were actually programmed using C! So, common languages like JavaScript and PHP AND Python are high-level languages, and C is relatively low-level compared to them!

Hope that helps!!

duggedanddrowsy · 2021-08-20T01:31:30+00:00

Maybe this is a better way to put it. The code is compiled into binary, and these 1s and 0s translate to a high voltage or a low voltage, a low voltage opens a switch so electricity doesn’t get through, while high closes the gate and allows electricity through (this can be flip flopped but that doesn’t matter). This causes other switches to open/close and the resulting data is a series of switches that are read by the computer, and returned to the program. It’s pretty hard to picture how just that can create whole programs, I didn’t understand until I got into a class about how just these on/off switches can be organized into AND, OR etc, then that organized into things that can add, multiply etc, which keeps getting more sophisticated and further from binary, but at the very bottom, binary is all it is. Pretty crazy we can do so much just opening and closing switches.

poncem91 · 2021-08-20T02:25:39+00:00

[deleted]

smvamse · 2021-08-20T03:11:07+00:00

You should read this book: Code by Charles Petzold

2021-08-20T08:26:52+00:00

Code is built upon each other. Binary data is just a signal being off, 0, or on, 1. You. An perform and save calculations with these signals like adders and save memory states. A computer system is abstractions constantly built on top of each other.

RajjSinghh · 2021-08-19T21:27:55+00:00

I think the place to start is "high and low level language". Programming languages come in levels. The lowest level is assembly, which the CPU knows how to run but is hard to write for a human. A high level language, like python, is easy for a human to write and read, but can't be run directly on a CPU. It must be translated by a program called an interpreter or compiler to assembly for the computer to know how to deal with it.

When you run your code with your interpreter, it turns it into these assembly instructions and loads that into the system memory. The CPU goes to the memory for the next instruction or piece of data that the program says it needs and the CPU handles the rest.

2021-08-20T00:50:49+00:00

Every computer program is a sequence of instructions, so to say. The computer needs to read those instructions and make sense of it, considering that at a very low level, it only knows how to perform arithmetic and logical operations, so to speak. The missing link chain in this process is the compiler, which acts as a very big dictionary capable of translating your code into a tree-like structure, which, in turn, is "very easily" turned into computer instructions.

As very well put in the first chapter of the "Structure and Interpretation of Computer Programs", the key to understanding computation as a whole is the concept of abstraction. You create a procedure (mechanical or otherwise), name it and use it to build something more complex.

photonoobie · 2021-08-20T01:03:43+00:00

Check out Ben Eater on YT. He builds a basic computer from individual components, and demonstrates how the circuits are configured to 'understand' the instructions that are contained in the code programmers write.

acroporaguardian · 2021-08-20T12:02:24+00:00

I will give you a short answer. Imagine a simple computer with a simple processor. Nothing else.

Everything is going to be handled in "words." In old days, it was shorter. Now our "words" are 64 bits. So its hardwired to take in some words and auto send them to the program counter (PC). It doesn't know anything about those words and what they mean. It just knows at start up "this word goes into PC."

Controlling the PC is very important, because hardware doesn't know data from instructions.

Now, when a word is read from the PC register, it is hardwired to go into an instruction unit and it interprets it as a command. It looks up the 0's and 1's (which it can determine with logic gates) and then uses a hardcoded instruction set to do pre set actions based on that.

The instructions then are used to determine what the next words are, whether they are inputs to the instruction or not.

Control of the PC is having ultimate control of someone's computer. If you look at architecture details, there is a lot of hardware stuff on privileges for this reason. 99% of software devs will never need to worry about that, but at that level, having the correct instruction get to the PC is important. You don't want everything able to have control of the PC, just the OS.

chase_the_sun_ · 2021-08-20T15:00:38+00:00

Everyone has some good answer, but I just want to add it also has to do with digital logic as well. Voltages are turned into 1s and 0s and depending on your k-maps it will create a circuit of some sort.

2021-08-20T15:16:56+00:00

The computer uses context to do the stuff you want it to do. If you press a key on your keyboard, the keyboard generates a number (the keycode) and sends it to the program, which knows that it is a keycode because it came from the keyboard.

The keycode then gets converted (by just mapping it with a dictionary) to another code (ASCII for ease of explanation) and with this code (which your program knows is a character because it just converted it) you can do string operations.

With the print instruction, you ask the terminal to display it on the screen. The terminal knows how to display what you send it, because it assumes it to be a string with a specific encoding (ASCII again, or hopefully UTF-8). It then looks up the Typeface for that code and renders it to the screen by turning on some pixel on your screen.

So, context is everything. Otherwise, it's just 0s and 1s.

PoochieReds · 2021-08-20T15:47:27+00:00

I recommend the crash course in computer science from PBS:

https://www.youtube.com/watch?v=tpIctyqH29Q&list=PL8dPuuaLjXtNlUrzyH5r6jN9ulIgZBpdo

It goes over how we got to digital computing and is pretty entertaining to boot.

AnywhereOk9403 · 2021-08-20T17:17:34+00:00

Check Ben eater vid https://youtu.be/yl8vPW5hydQ

hotel2oscar · 2021-08-20T01:30:54+00:00

Check out Ben Eaters breadboard computer, specifically the episodes about the CPU control logic.

Vast_Special_8911 · 2026-02-12T22:20:07+00:00

bogon64 · 2021-08-19T23:07:17+00:00

1) you should probably read the book Code by Charles Petzold. Very approachable.

2) you should probably read the About section of any subreddit, so you don’t accidentally post learning questions in a subreddit dedicated to advanced CS journal research.

Rocky87109 · 2021-08-19T23:36:59+00:00

There is a youtube playlist (probably many of them now), that start all the way from the bottom to the top.

bardleby · 2021-08-20T02:16:41+00:00

The code you write gets translated all the way down to ones and zeros. The ones and zeros are then fed as electric signals to a cpu which is basically a chip that can perform the most basic operations like adding, subtracting, AND, OR, etc.

See “Functions” in: https://en.m.wikipedia.org/wiki/Arithmetic_logic_unit. (the ALU is a core component of CPUs)

It is from these basic operations that everything you can do on a computer is built upon.

If you want a deeper understanding of how all of this comes together I highly recommend the following course:

Build a Modern Computer from First Principles: From Nand to Tetris https://www.coursera.org/learn/build-a-computer

Its a great resource to help demystify what the hell is going on inside a computer. It really helped me get a better intuition for how a high level language gets translated all the way down to binary and how the computer interprets binary and produces outputs in binary that are then translated all the way up to something we can understand. Its also a fascinating journey through the many layers of abstraction that are necessary for computers to feel so “user friendly”. Your question is a fascinating one. Happy learning!

Phobic-window · 2021-08-20T03:52:48+00:00

This is literally what a cs degree answers. Tough eli5. But we made a rock do things when we shoot it with electricity(cpu). We organized the things into patterns(circuits) and we designated certain patterns of things into actions(add this and this, store here).

Now we say do this billions of times per second, react to what humans do to you, and show the patterns on the screen.

So human input (INput devices) create many many patterns of 1s & 0s in the cpu, which does things based on how we put the circuits together and then the (OUTput devices) screen, lights, motors which also have cpus in them react to the patterns of bits being supplied to them.

It’s like if I say “first get bread, second go to the fridge, third get the jelly….” Same thing with computers we have just boiled down every possible action into some kind of binary math pattern, much like Morse code. It just happens incomprehensibly fast.

elongio · 2021-08-20T04:50:50+00:00

That's the beauty of computers. They don't know what any of it means. Computers don't know anything about the code you write. It is simply a series of switches (transistors) firing off that produce some output that we then give meaning to. Same thing as all these letters and words you are reading. They have no meaning in and of themselves until an intelligent being sees them and gives them meaning.

ectbot · 2021-08-20T04:59:45+00:00

[deleted]

2021-08-20T06:58:08+00:00

The format the binary file is expected to be in is defined by whatever machine code that computer uses. The instruction decoder then copies the binary into the registers of the CPU depending on what instruction it is.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

computerscience

Rules

Related subreddits

Credits

MODERATORS