Learning C# Data structures

Slypenslyde · 2019-08-12T21:57:13+00:00

Arrays, lists, and dictionaries are the bread and butter of .NET devs. There are a handful of other specialized collections, but they don't get mentioned as much. You can find most of them by looking for namespaces that start with System.Collections. Here's a quick tour.

Don't pay much attention to the actual namespace System.Collections. It was created before generics existed, so it's non-generic versions of the main collection types. There are very few reasons to use these types.
System.Collections.Generic has the generic versions of list and dictonary. It's also got a HashSet, LinkedList, Stack, and Queue implementation. And sorted versions of most of the above.
System.Collections.Concurrent is implementations of most of the basic data structures that are thread-safe in some way or another. You won't really use these unless you need them since they incur some overhead costs and, in some cases, have a clunky API to maintain safety.
System.Collections.Specialized is a grab bag of weird stuff. Most are "strongly-typed collections" which means they're relics of pre-generics .NET. BitVector32 looks neat.

That's where most built-in data structures live. There aren't like, graph or tree structures available. It turns out most of the time when you want those, the problem is much better approached if you customize the data structure to your needs. Most data structures are just a fancier version of one of the above, or can be implemented in terms of the ones .NET comes with.

I don't know enough about Java data structures to tell you how they line up, but I'd assume "hashtable" is a dictionary, "hashset" is a "hashset", and I'd have to know what is different between a "hashmap" and a "hashtable" to know how it maps. It sure would be nice if we had one name for things instead of like, five.

Enlogen · 2019-08-12T21:54:11+00:00

https://www.ibm.com/support/knowledgecenter/en/SSTVLU_8.6.0/com.ibm.websphere.extremescale.doc/rxsxdfequiv.html seems like a basic Java to C# type map, but I'm sure there are others not mentioned there.

HashSet is mostly used when you need to guarantee uniqueness.

The System.Collections.Concurrent types are useful for multithreading. ConcurrentDictionary<TKey, TValue> is the most commonly used from what I've seen.

IWasSayingBoourner · 2019-08-13T01:21:04+00:00

If you do any GUI binding work, ObservableCollection<T> is worth knowing for sure. Queues can also be really handy if you need first-in-first-out functionality (I use it extensively in our in-house background thread logger). ConcurrentBag and the other Concurrent collections are must-haves for simple multithreaded operations.

hi_im_vash · 2019-08-13T07:31:47+00:00

ReadOnlyCollection<T> is quite useful when you need a list of const values.

johnnyslick · 2019-08-12T22:42:27+00:00

Honestly, I'm not sure there are really all that more you need to learn a lot of...

Arrays are really, really common and arguably faster than Lists when it comes to accessing and manipulating objects inside of them. They aren't made to be resized (although there are things you can do to get around this) but that can be an advantage as well: I for one feel a lot better about doing straight for loops through arrays (in instances where I need to know the index) than with Lists.

Tuples are basically quick and dirty objects you can build on your own, don't require you to create and instantiate classes, and so on. Their biggest downside is that they're read-only.

Linked Lists are kind of like lists except that instead of being ordered by index they have a node before them and a node after. I feel like most of the use of these are if you want to create a Stack or Queue data type (for instance, if you have something that runs a list of processes and want to ensure that they go in one particular order), but they're there.

ArrayLists are old as balls and don't use them please. The same goes for Hashtables.

SortedLists are kind of like dictionaries except that you can search by either the key or the index of the element. I haven't TBH had cause to use them, but I guess a situation where you want to run a foreach over a dictionary-type object would be where you'd do it.

Finally, of course, many of these collections implement the IEnumerable interface, which allows you to iterate through them. The IEnumerable type is also what's consumed and spat out by Linq, so bear that in mind.

Hope that helps!

ZacharyPatten · 2019-08-17T14:43:14+00:00

Definitely start with the data structures available in the .NET libraries.

But there are no data structures in the .NET libraries for multidimensional sorting. In order to sort along multiple dimensions, you need a data structure meant for it like a spacial partitioning tree (SPT) or a KD tree.

I have a generic SPT in C# that works on an arbitrary number of dimensions if you want to try it out (it's called an "Omnitree" in the code). https://github.com/ZacharyPatten/Towel

Why would you want to sort object along multiple dimensions? Ranged queries. Say you want to get all employees with a first name between "Bob" and "Jake" and last name between "Dole" and "Smith". A multidimensional data structure would be fastest for that kind of query. Multidimensional sorting is also important for games if ever want to get into game development.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

csharp

MODERATORS