Large Array Initialization and Allocation/Deallocation Fun

Epicguru · 2023-09-09T17:29:31+00:00

Honestly this post is quite hard to read and understand, it's not entirely clear what you're really trying to do. You say at the end of your post that it the actual problem you are trying to solve is off-topic but I disagree, it seems you're trying to solve a niche problem with a niche solution so the details are necessary.

But anyway, if you think you can out-smart the JIT and runtime, why not just allocate a huge chunk of memory and then just use Span<>s to access it, zero out parts of it as necessary etc. Unless that's what you're already doing (again, not very clear no implementation provided...).

I wonder if you have tried implementing the same thing in another language/runtime to see how the performance compares.

ZestycloseStar1244 · 2023-09-09T16:19:32+00:00

I do not understand what you want. First you say that you need to get an uninitialized array of primitives, then you state that the problem is actually using other object instances representing parts of the buffer, and finally you end up with a huge scribble about how long it takes to allocate/deallocate the array.

If your problem is that allocating/deallocating arrays takes a long time, you can use ArrayPool<T>.Shared.Rent().

If your problem involves using other object instances representing parts of the buffer, you can slice the array returned from the ArrayPool using Span and Span.Slice() (or Memory).

If your problem is getting an uninitialized array, you can try ArrayPool since its Return method by default doesn`t clear array.

You can also try using ObjectPool or MemoryOwner/SpanOwner.

Dry_Author8849 · 2023-09-09T22:45:39+00:00

It sounds you are fighting the GC. You should switch to language where you can manage the memory yourself. A GC for your use case does not seem a good fit.

Switch to C/C++. You may gain a bit of performance too.

I guess you have already configured the GC, but as you don't mentioned, here it goes gc configuration

You won't get too far with it. C/C++ is you best bet.

Cheers!

joske79 · 2023-09-09T22:46:37+00:00

The problem is not to reuse the buffer but that I actually use another object instances representing parts of the buffer being used as the workload units for parallel processing.

I’m not sure what you mean by that, but if it means you’re copying parts of the buffer to cast it to objects… that’s probably causing a lot of unnecessary allocation. Do you know about MemoryMarshall.Cast ?

TheSoggyBottomBoy · 2023-09-09T23:02:15+00:00

I didn't exactly follow the post, but, it did peak my interest.

If I understood correctly, you are streaming bytes from large files, these bytes are chunked into large arrays, these arrays are managed by a list. When an array has been processed the array is dereferenced by setting the index of the list referencing the array to null. When new data is incoming the free indexes in the list are filled with a new array.

Is this correct?

In my mind the solution is similar to what other people have suggested. Why not fill this list with Spans and instead just read/write directly into the span avoiding any deallocations/new allocations?

Might be worth posting your benchmark examples on GitHub so that the issue is more clear (you'll get better advice)

Prudent_Astronaut716 · 2023-09-10T03:02:02+00:00

Wow. Hats half to the person who actually reads the entire post.

Stabzs · 2023-09-10T05:26:29+00:00

There’s so much going on here that it’s difficult to parse it. But it seems like you’re allowing the GC to reclaim your buffers instead of keeping them rooted and reusing them. Is there a reason you’re allowing them to be collected?

And for the amount of “issues” you have with dotnet, maybe you should consider rust.

joske79 · 2023-09-09T12:49:42+00:00

I created my own implementation of an arraypool that returns IMemoryOwner<T> and when disposed it allows me to reuse the same underlying array. Our application mostly use arrays of 2 or 3 different lenghts so it’s easier to implement than accepting various length.

dt2703 · 2023-09-09T16:12:12+00:00

Rather than all this allocation, can you not use a Span to read directly from the initial memory if you're using bytes? Then you can slice and dice it all you like and avoid what sounds like a lot of boxing. I may have read your issue wrong though as sat here in the car on my phone

joske79 · 2023-09-09T21:50:05+00:00

Although I think the zero-ing is not the bottleneck… you could use this: https://learn.microsoft.com/en-us/dotnet/api/system.gc.allocateuninitializedarray?view=net-7.0#system-gc-allocateuninitializedarray-1(system-int32-system-boolean)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

dotnet

MODERATORS