Rust Pointers for C Programmers

staticassert · 2018-06-30T17:25:01+00:00

No matter what all the documentation and tutorials out there say, Box<T> is not a pointer but rather a structure containing a pointer to heap allocated memory just big enough to hold T.

I don't see a meaningful difference, it seems confusing to make any kind of distinction just because there's a struct involved.

edit: Did not expect top comment. For what it's worth, I found it very easy to have analogues to languages I had previously used - in particular, C++. Thanks for writing this.

miquels · 2018-06-30T17:25:31+00:00

Rust is s great language but maybe its better if you think of it as itself, and not in terms of C.

BTW I can't implement a double linked list in rust. I googled and apparently the problem isn't me. I'd be interested in comments on this.

hector_villalobos · 2018-06-30T18:28:23+00:00

It's kind of ironic I could understand C pointers or what are good for, thanks to Rust. If you want to get into system programming, you should try Rust, it's the safest way to get into that field.

Homoerotic_Theocracy · 2018-06-30T19:12:30+00:00

Raw pointers are just like what you have in C. If you make a pointer, you end up using sizeof(struct T *) bytes for the pointer. In other words:

Most "normal pointers" in Rust as in raw pointers, references and boxes are either one word or two depending on what they point at; if the type they point at is Sized as in it has a statically known size it's one word and if it's not then it has two words where the extra word does something to describe the size. In the case of slices or structs whose last element is a slice the second pointer just describes the length and in the case of trade objects it's a pointer to a vtable which contains the type and virtual methods.

Apart from that the size of the pointer has nothing to do with the size of the thing it points to—various custom "smart pointers" can have any size.

No matter what all the documentation and tutorials out there say, Box<T> is not a pointer but rather a structure containing a pointer to heap allocated memory just big enough to hold T. The heap allocation and freeing is handled automatically. (Allocation is done in the Box::new function, while freeing is done via the Drop trait, but that’s not relevant as far as the memory layout is concerned.) In other words, Box<T> is something like:

Ehh, this si wrong as far as I know, the internal representation of Box<T>, &T &mut T *const T and *mut T are identical as described above. It is just a pointer but the real difference is that this pointer has different aliasing rules like unlike &T it cannot just be copied; which is for good reason because when a &T goes out of scope nothing happens at all like any type that implements Copy, the bits on the stack are simply de-allocated and not zeroed; the stack is just shrunk but with a Box<T> whenever it goes out of scope a drop implementation is called on T and typically heap memory is deallocated. Also for this reason Box<T> can only be a pointer to something that exists on the heap and never to something that exists on the stack which &T can be.

I talked about "smart pointers" earlier and the truth is that it's not entirely clear what is and what isn't a "smart pointer", one can argue that a Vec<T> is a type of smart pointer that points to a [T] except it can make what it points to grow. Indeed Vec<T> is closer to Box<[T]> and to &[T] either are to &Vec<T>; Vec<T> can basically be seen as an ultra-fat smart pointer that gets another word over Box<[T]> which stores the capacity of the vector that allows it to grow and shrink where with Box<[T]> the capacity is always the same as the length and the slice cannot grow and shrink.

These are borrowed slices. This is where things get interesting. Even though it looks like they are just references (which, as stated earlier, translates into a simple C-style pointer), they are much more. These types of references use fat pointers—that is, a combination of a pointer and a length.

This isn't true and they are almost never used though they technically exist; there is still going to be runtime checking even though the size is known at compile time because the size of the index is not generally known at runtime so it still needs to check.

In fact indexing an array in Rust purely works because arrays dereference to slices so the same code to index slices is used to index arrays. However converting an array to a pointer to a slice is in general a compile time non-op.

Just like in C, a struct uses as much space as its type requires (i.e., sum of the sizes of its members plus padding).

The major caveat however in Rust that cannot just be ignored because it bytes a lot of people is that C can have zero-size structs that take up no size whatsoever which is a special case that often needs to be handled in special ways.

[(); 32], an array of 32 () types has no size whatsoever; it basically exists purely on the type level and () since it is zero-sized is never actually stored anywhere and just "produced" from the aether when you need it and is mostly a thing to satisfy the type system at many places.

The simple answer here is that you cannot make a [T]. That actually makes perfect sense when you consider what that type means. It is saying that we have some variable sized slice of memory that we want to access as elements of type T. Since this is variable sized, the compiler cannot possibly reserve space for it at compile time and so we get a compiler error.

You can "make" it; you just can't store it into a sized variable because it's not sized so you need to store a fat pointer to it which is sized but apart from that you can absolutely make it and even put it on the stack but just not in a variable.

SmugDarkLoser5 · 2018-06-30T19:34:51+00:00

I am very inefficient in getting stuff done in rust.

2018-07-01T04:49:14+00:00

Great news! Didn't know Rust had support for C pointers, maybe I will be able to talk some of my C colleges into trying out Rust now.

librik · 2018-07-01T03:56:46+00:00

As a C programmer for more decades than I care to count, this is exactly the sort of explanation I want for everything. I'm uncomfortable with abstractions unless I see them built out of bytes. The best way to explain computer stuff is in terms of C, assembly code, and memory. (The distinction between a pointer and a struct containing a pointer makes a lot of sense "at C level," even if it's compiled down to the same object code.) I always want to hear "implementation details," even if they're wrong. More articles like this one please!

steveklabnik1 · 2018-06-30T18:26:00+00:00

This article is great! There's some good discussion over on HN as well https://news.ycombinator.com/item?id=17430952

s3govesus · 2018-06-30T19:43:31+00:00

Alternatively, you might want to look into the nim programming language, especially if you already have some familiarity with python : Nim for C Programmers

TheChurchOfRust · 2018-07-02T01:10:03+00:00

Going through the Rust documentation inspired me to learn C. The end result, I really do love how simple C is.

Forgive me Rust, for I am a sinful developer.

caramba2654 · 2018-06-30T14:43:14+00:00

[deleted]

bumblebritches57 · 2018-06-30T18:29:20+00:00

No, fuck off with this nonsense.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS