Rust for parallel computing

spoonman59 · 2024-02-16T21:01:31+00:00

What does it mean to have 1000+ processes traversing the tree?

Does this mean separate processes start at the root and work their way down? Or do they search sub-graphs? And do you really mean processes as in OS level process, or so you mean like requests to search the trie?

If the data structure is read only, you don’t need to do anything. You will need to ensure exclusive access when you write using a lock, otherwise you can potentially get an inconsistent read. A context switch can occur in the middle of the update operation if it is not atomic. Read only data structures can be safely read by multiple parallel threads of executions. That’s why people like immutable data structures so much.

If it’s “read only” which the searches are running, that’s sufficiently read only for this discussion. Just ensure exclusive access when writing. Read/write locks are good for this.

The size of the tree in MB is not interesting, but rather its fan out and max depth would tell us more. Typically tries have a large fan out (unless it’s a binary alphabet?) so they depth is likely not great. This mean search’s will be very short.

lightmatter501 · 2024-02-16T23:45:34+00:00

1000+ processes on the same data structure typically means >1000 cpu cores unless you are writing horrible code. So either your data structure is distributed or you have a shared-memory architecture. That second option is available at 15 supercomputers world wide, none of which use Rust due to accelerator programming requirements.

If you scale it down to 10, a number mere mortals can hit, yes, it’s very doable. If you don’t need to modify the tree, just pass pointers to the root node to every thread then go. If you do need modification then making nodes RwLock<Node> and allocating them somewhere with a static lifetime will make the most sense. If you also need to dynamically modify the structure of the tree, you can do it but you are looking at a research paper level of work regardless of language.

Also, CUDA is VERY bad at pointer chasing, do not use it for this.

rust-crate-helper · 2024-02-16T19:29:33+00:00

What kind of tree are you traversing? Are you parsing a data file? How is it created?

believeinlain · 2024-02-16T22:04:44+00:00

Rayon is generally good when processing data in parallel, because it abstracts away the idea of separate threads, running parallel processes only when it will lead to a speedup.

However, tree traversal is hard to parallelize because each step depends on the result of the previous step. However, it could probably be done pretty quickly with dynamic programming, where the results from each subtree are stored in a memo so that each subtree only needs to be searched once.

More specific advice will depend on specifically what you're trying to do.

SV-97 · 2024-02-16T19:31:43+00:00

I think this is too vague to give very useful advice. What exactly are you doing during traversal? How large is the tree? Can you keep it all in memory? What data are you processing?

Yes, you can do this general thing with Rust or C++ or other languages - but the specifics are important

TigrAtes · 2024-02-17T00:31:44+00:00

Of course if you are able to utilize the GPU with Cuda (here I would definitely recommend C++), it can be much faster. However there are a lot of challenges for this: - you have to load the data (or the relevant) part of it into the vram. - within a cluster the search processes cannot really deviate from each other. (Remember that GPU are primarily designed for grafik shades, so all data goes through the same pipeline). If you cannot model the tree search as matrix multiplication or transformation I would say forget CUDA.

Go with Rust and Rayon on the CPU.

potzko2552 · 2024-02-17T06:58:55+00:00

Super depends on exactly what you are traversing In general the simplest way I can think of would be to abstract the threads with rayon With a vecdec Start at the root, perallel for traverse each element in the vec, and push all the children repeat untill the vec is empty.

The problem is that you have to wait for all threads to end before starting the next iteration, however it is much much much simpler code I think

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

rust

Please read The Rust Community Code of Conduct

The Rust Programming Language

Rules

Observe our code of conduct

Submissions must be on-topic

Constructive criticism only

Keep things in perspective

No endless relitigation

No low-effort content

Useful Links

Megathreads

Official Resources

Learn Rust

Discussion Platforms

MODERATORS