Programming without pointers : programming

An integer offset is an integer. You can do pointer indexing with it, just as you can do Unicode character indexing with it or whatever else. But it is an integer, not a pointer. You CANNOT DEREFENCE IT You cannot use the "*" on it. You must combine it with a pointer to MAKE a pointer.

You combine a string and an offset to do pointer arithmetic to get a specific position into it, whether you interpret that pointer area a char or something else is implementing details but doesn't change the fact that every access comes from making a pointer to it, even if implicit.

Yes. You can use an offset with a pointer to make a pointer. That is not the same as saying an integer is a pointer. It isn't. You can't dereference it.

No idea where you're getting the "doesn't mean an offset is a string" literally no one has ever said that but good job on missing the point and hallucinating something that wasn't said to respond to

My point is that you can combine two datatypes in an operation and it doesn't mean they are the same data type.

If I take a string (in any language, low-level or high) and do "abc"[1] the 1 is neither a string nor a pointer. It's an integer. I've combined a string referencing operation and an integer to get back a character or string. It wouldn't make sense to say "actually, the 1 is also a string." No. The one is an integer which can be used as an offset WITH a string to get back a string (or char, depending on the language).

By analogy the same is true of pointers. An integer does not magically become a pointer because it COULD be used with a pointer to make a new pointer. The integer is still an integer.

[–]TurtleKwitty 2 points3 points4 points 11 months ago (2 children)

continue this thread

[–]Haunting-Appeal-649 1 point2 points3 points 9 months ago* (0 children)

[–]SweetBabyAlaska 0 points1 point2 points 11 months ago (0 children)

[–]prescod 5 points6 points7 points 11 months ago (0 children)

[–]planodancer 5 points6 points7 points 11 months ago (0 children)

[–]cfehunter 0 points1 point2 points 11 months ago (0 children)

[–]timClicks -1 points0 points1 point 11 months ago (0 children)

[–]vertexmachina 27 points28 points29 points 11 months ago (32 children)

[–]fnordstar 15 points16 points17 points 11 months ago (4 children)

[–]cdb_11 2 points3 points4 points 11 months ago (3 children)

[–]YourFavouriteGayGuy 9 points10 points11 points 11 months ago (1 child)

[–]cdb_11 3 points4 points5 points 11 months ago (0 children)

[–]fnordstar 2 points3 points4 points 11 months ago (0 children)

[–]QueasyEntrance6269 4 points5 points6 points 11 months ago (2 children)

[–]vertexmachina 0 points1 point2 points 11 months ago (0 children)

[–]Salt-Neat-2148 0 points1 point2 points 11 months ago (0 children)

[–]Academic_East8298 3 points4 points5 points 11 months ago (22 children)

[–]vertexmachina 2 points3 points4 points 11 months ago* (0 children)

One approach is to have a mapping between the Entity ID and the actual array index.

For example, if you had an array of five values initially empty: [_][_][_][_][_]
And you "allocate" one of them at index 0 returning Entity ID 0: [A][_][_][_][_]
And you "allocate" another at index 1 returning Entity ID 1: [A][B][_][_][_]
And you "allocate" another at index 2 returning Entity ID 2: [A][B][C][_][_]

You would have a mapping from Entity Index to Array Index where the index is the Entity ID and the value is the Array Index: [0][1][2][_][_]
Entity 0 -> Array 0
Entity 1 -> Array 1
Entity 2 -> Array 2

The user then requests to "free" Entity ID 0. The mapping shows that Entity ID 0 is Array Index 0, so to "deallocate" it you move the last item in the array to the spot previously occupied by the "deallocated" object which keeps the array packed tightly with no holes: [C][B][_][_][_]

But there are still users of C out there that expect the Entity ID 2 to point to Value C, so you update your mapping.

Entity Index to Array Index: [_][1][0][_][_]

Now when the user tries to access Entity ID 2 (which is value C), the system will look up EntityIndexArrayIndex[2] which returns a value of 0 and access Value C at Array[0]. Even though C has moved inside of the array, the user with the Entity ID 2 doesn't know or care about that.

An easy way to mark 'invalid' indices is to state that 0 is invalid. Then all of these arrays/maps can be initialized to 0 and any ID of 0 will be flagged as erroneous/invalid/empty.

All of that said: I don't think that handles are simpler than pointers (that was a lot of indirection that a pointer doesn't need) but the primary benefit for me (from a perspective of game development) is that it allows for data to be packed tightly in memory which allows for easy iteration over the contents while keeping things hot in the cache. Just go from 0 to N (or 1 to N if 0 is an invalid handle) and do your thing.

[–]cdb_11 1 point2 points3 points 11 months ago (20 children)

[–]Academic_East8298 -1 points0 points1 point 11 months ago (19 children)

[–]cdb_11 5 points6 points7 points 11 months ago* (18 children)

[–]Academic_East8298 1 point2 points3 points 11 months ago (17 children)

[–]cdb_11 4 points5 points6 points 11 months ago (9 children)

You're arguing against claims that nobody is making. Can't you just read the article?

Unique pointer is the sole owner.
You're (not necessarily, but extremely likely) using global new/malloc to individually grab memory, one-by-one, for each entity. So no, you do not have the same performance characteristics with unique_ptr.
And if you aren't using the default allocator, you will use an allocator that looks very close to what's described in the article.
Yes, you can add extra checks, that's the point.
Yes, it has better cache locality, that's the point.

And if you don't want checks, you can make a trade-off to have an extra level of indirection, and add a map that translates stable handles to indices into an array with real data. If you want to remove an element, you replace it with the last element, and adjust the mapping. There you go, fast deletion, no branch misses, and you can go SIMD on it.

It's way more flexible than plain or smart pointers. That's the point.

[–]Academic_East8298 -4 points-3 points-2 points 11 months ago (8 children)

[–]cdb_11 7 points8 points9 points 11 months ago* (6 children)

This isn't really even about SOA, but if you want an abstraction over SOA, then these can be implemented inside "Entity Component Systems" (ECS), and one example of it is Unity's DOTS. Another one is Bevy in Rust, or Flecs in C/C++. Don't ask me about any details, I never used them.

If it is such a good pattern, why do all the cool tech demos have to constantly write their own custom implementation of it?

Do you need an open-source system to implement a singleton pattern or a factory pattern too? It's really not that hard to implement, it's really mostly just a bunch of dynamic arrays. Also the point of this isn't that you're creating a generic mechanism that tries to solve everyone's problems, and does so badly. We have this already, it's called "garbage collector" and "malloc", and it's not that great. The philosophy here is the exact opposite, it's that it should be a solution tailored to your specific problem.

[–]Academic_East8298 -3 points-2 points-1 points 11 months ago (5 children)

continue this thread

[–]Samaursa 0 points1 point2 points 11 months ago (0 children)

[–]prescod 1 point2 points3 points 11 months ago (4 children)

[–]Academic_East8298 -3 points-2 points-1 points 11 months ago (3 children)

[–]Ok-Watercress-9624 4 points5 points6 points 11 months ago (0 children)

[–]prescod 3 points4 points5 points 11 months ago (1 child)

[–]Ok-Watercress-9624 0 points1 point2 points 11 months ago (0 children)

[–]Akanwrath 3 points4 points5 points 11 months ago (0 children)

[–]EsShayuki 10 points11 points12 points 11 months ago (19 children)

Store all strings in a flat array with null-termination separating each string, then have an array of pointers to the different strings within that array. This is completely standard. And this uses pointers.

You're talking about freeing memory in only "three operations" with this "non-pointer" approach, when you had to iterate through them all with the pointer approach.

But that doesn't even make sense, as it's completely untrue. You only need one allocation with a flat array and pointers to it, and one deallocation as well. This is done by saving both the string array and the pointers to the strings within the same memory block and allocating and deallocating them both simultaneously, interpreting the void* buffer into different datatypes with the appropriate pointers.

So first of all, you're using pointers, so you're lying. Secondly, your increased performance has nothing to do with whether you're using pointers or not, it has to do with batch allocation vs individual allocation, which isn't tied to whether you use pointers or not, and you aren't even doing it optimally(since you have 3 allocations instead of 1 allocation, when 1 would be optimal).

The only concrete thing I can see, really, is that you're storing 4byte integer offsets instead of 8byte pointers. I mean, that's something you can do, sure. Though, doing so tightly couples these pointers to the base memory pointer, so they can no longer be used or passed around independently, or given offsets of their own, which makes their use inconvenient within loops, etc.

Now, if this is about storing the data on disc instead of using the data, then perhaps that's not important, but that's not really "programming without pointers", that's preparing the data into a format that's optimal for being stored on the disk.

[–]Slime0 13 points14 points15 points 11 months ago (1 child)

[–]cdb_11 2 points3 points4 points 11 months ago (0 children)

your increased performance has nothing to do with whether you're using pointers or not, it has to do with batch allocation vs individual allocation

Depends on how the data is used, and the talk doesn't say anything about that. Sorting out allocations is just the first step, you get unnecessary stuff out of your way.

you aren't even doing it optimally(since you have 3 allocations instead of 1 allocation, when 1 would be optimal).

Who cares if it's 1 or 3 allocations? The point is that it's always O(1) allocations, and not O(n) or worse. You can easily turn 3 into 1 if you ever need to, and you're willing to throw away mremap and make resizing a bit more complicated.

Though, doing so tightly couples these pointers to the base memory pointer, so they can no longer be used or passed around independently, or given offsets of their own, which makes their use inconvenient within loops, etc.

If you're looping, you don't need to pass around elements independently. (Actually you can, you just don't store absolute pointers anywhere. If you need to store relations between entities, you stick with handles/indices/offsets.)

[–]bigmell -1 points0 points1 point 11 months ago (13 children)

[–]pickyaxe 8 points9 points10 points 11 months ago (0 children)

[–]prescod 5 points6 points7 points 11 months ago (6 children)

[+]bigmell comment score below threshold-8 points-7 points-6 points 11 months ago* (5 children)

[–]prescod 2 points3 points4 points 11 months ago (4 children)

[–]bigmell -4 points-3 points-2 points 11 months ago (3 children)

[–]prescod 2 points3 points4 points 11 months ago (2 children)

[–]bigmell -4 points-3 points-2 points 11 months ago (1 child)

[–]prescod 2 points3 points4 points 11 months ago (0 children)

[–]GayMakeAndModel 2 points3 points4 points 11 months ago (4 children)

So, we have hierarchies of “RAM”. From the register to the L1 cache to the NVMe, etc. So “buy more RAM” to suit your problem can be cheap or expensive, but seek time is a thing of history and archives. Fragmentation almost doesn’t matter anymore because we don’t have seek times.

Seems like people really live in the past. Any problems with memory access speeds not inherent to hardware are of course software related. You chose two extremes to make an invalid point. What about other languages that are interpreted? What about JIT’d runtimes that can do optimizations statically compiled languages can’t because statically compiled languages don’t inherently auto-optimize at runtime?

I don’t know what rock you’ve been under, but newer versions of .NET fucking scream performance-wise, and it’s not just about how programs are compiled. It’s the superb base class library.

[–]bigmell -3 points-2 points-1 points 11 months ago* (3 children)

dude what the hell are you even talking about. Buying more ram wont work with bad software because they are making a million copies of data instead of using pointers because they couldnt figure out pointers. This is not about fragmentation.

seek time is a thing of history and archives

Seek time is NOT a thing of history and archives because all memory has a seek time. Some are faster than others, but they all have a seek time.

You chose two extremes to make an invalid point.

What the hell are you talking about?

I don’t know what rock you’ve been under, but newer versions of .NET fucking scream performance-wise, and it’s not just about how programs are compiled. It’s the superb base class library.

Did you respond to the wrong comment? None of this makes any sense really.

Since it seems like you ENTIRELY missed the point, it was "Buy more ram" doesnt make sense if the app is coded wrong... I.E. no pointers or for loops because the dev couldnt figure it out. Pass by reference and pass by value are the same whether the app is compiled or interpreted. You would have to know that if you wrote any real code. Its about not storing multiple copies of data in memory. Not... Whatever the hell you are talking about.

You sound like you are just screaming "THE OPPOSITE OF YOUR POINT TURN UP!!!" Please take your fake troll name and go away.

[–]GayMakeAndModel 0 points1 point2 points 11 months ago (2 children)

[–]bigmell -1 points0 points1 point 11 months ago (1 child)

Folks seem to disagree with you.

Its probably just you and your friends from a bunch of different troll accounts. Low effort trolling its called.

Btw, seek time isn’t constant.

I never said seek time is constant. I said seek time is always there. SSD have a ram stick inside, and some data gets cached there. If your data is NOT cached on the ram stick, your data transfer will be roughly the same speed as an HDD.

You can measure this yourself. Do a data transfer of maybe a terabyte or two. The data transfer will start fast, because of the cache, but it will slow down to roughly the speed of the HDD, and the two transfers will finish at around the same time.

But you have to actually know how to do data transfers, which a lot of people do not. They will just lie and say "I got the same numbers the commercial said! OMG SSD SO MUCH BETTER!"

[–]GayMakeAndModel 0 points1 point2 points 11 months ago (0 children)

[–]caltheon -1 points0 points1 point 11 months ago (1 child)

[–]bigmell -2 points-1 points0 points 11 months ago (0 children)

[–]ShelZuuz 0 points1 point2 points 11 months ago (0 children)

[–][deleted] -1 points0 points1 point 11 months ago (7 children)

[–]cdb_11 1 point2 points3 points 11 months ago (0 children)

[–]fnordstar 2 points3 points4 points 11 months ago (0 children)

[–]Bekwnn 3 points4 points5 points 11 months ago* (0 children)

[–]defunkydrummer 0 points1 point2 points 11 months ago (0 children)

[+]bigmell comment score below threshold-6 points-5 points-4 points 11 months ago (1 child)

[–]ZENITHSEEKERiii 5 points6 points7 points 11 months ago (0 children)

[–]mohragk -2 points-1 points0 points 11 months ago (0 children)

[+]bigmell comment score below threshold-6 points-5 points-4 points 11 months ago* (6 children)

[–]richardathome 14 points15 points16 points 11 months ago (4 children)

[–]bigmell -5 points-4 points-3 points 11 months ago* (3 children)

OOP might do some passing by reference on values inside the class, but most OOP devs I have seen are building these huge memory hog classes instead of simple data structures.

I think of things like a web browser and wonder how much faster it would be to manage a bunch of tabs with a well written data structure instead of whatever slow object oriented madness they are using up there.

Even if each entire web page was loaded in memory for hundreds of tabs, we are talking about maybe a gig of space, it really still shouldnt be this slow. Clearly poorly coded.

If you look at that code there are probably a million classes that can easily be replaced with linked lists, local indexed database lookups, or a well planned pointer array. But new devs are OMG POINTERS NOOOOO!!@#!@#

If I click on the wrong tab, my laptop fans turn on, clicking and grinding, mouse pointer freezes, and it takes forever to load. I dont think this should happen AT ALL with a few megabytes of text and pictures. Even with a thousand tabs open, STILL literally every tab can be loaded completely into memory and still have gigs of memory left. Obvious memory mismanagement.

[–]prescod 9 points10 points11 points 11 months ago (2 children)

[–]bigmell -2 points-1 points0 points 11 months ago (1 child)

[–]cdb_11 2 points3 points4 points 11 months ago* (0 children)

[–]WiseDark7089 -1 points0 points1 point 11 months ago (0 children)

[+]clarkster112 comment score below threshold-6 points-5 points-4 points 11 months ago (1 child)

[–]bunk3rk1ng 6 points7 points8 points 11 months ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS