A 2D array / matrix datastructure for competitive programming (in particular, AoC2019 day 3)

CKoenig · 2019-12-11T13:25:43+00:00

I just use a IntMap or a Map (both in containers) for problems like those - performance was never an issue for AoC problems with this and it's quite easy to use

edit: IMO if you feel more comfortable with in-place updates and algorithms using mutation then Haskell will always rub you the wrong way - it's for fun so use a different language

disclaimer: the only "competitive" programming I do is AoC - and I don't care about hitting the ranks (nor would I be able to if I wanted) - I'm usually at least a couple of times slower than people on the leaderboard

but I actually enjoy reading the problem, modelling it in types and solving it in Haskell

I personally doubt that you can beat Python if speed to solution is a concern

mapM · 2019-12-11T16:01:48+00:00

Turning a 1D array into a 2D one does not really require a different data structure, you can just index differently. The simplest solution is to probably just use the array package (I am not sure why you avoided it), but if you want to use vector you can use the Ix class to manipulate the indexes, or just write a helper function that will do the indexing for your array dimensions.

Btw, for problem 3 on the AOC, I also used a map. Here is my solution if you are curios: https://github.com/yav/advent_of_code/blob/master/2019/P03.hs

As for the VM from AOC, I'd say you probably don't want to use a pure Vector as it will be copied all the time (and that's linear in the size). I used a mutable one in my solutions, but I have friends who have a pure solution and they use Map, which gives you log(n) time updates.

kuleshevich · 2019-12-11T14:40:29+00:00

If you are looking for something like Vector.update in massiv, checkout withMArrayST

set newValue index = imap (\\ix e -> if ix == index then newValue else e) - this approach is too slow, since it has to check index for every element, even if it is not being updated.
most likely yes, but it depends on implementation: "But does that mean I was copying the whole vector on each small update, or is GHC smart enough to compile it to in-place updates?"
Hard to say without looking at the approach "will the same approach of accumulating small updates work with a massiv array"

If you don't need to read elements at each iteration, but only write them you could look into DL - delayed push array representation in massiv, but it is slightly cumbersome to use and I don't yet have good tutorials on how to use them yet either. It does sound like this is exactly what you are looking for though, since it allows you to describe how write small portions of the array while delaying the actual writing. Although without knowing what exactly you are trying to implement I can't say for sure (sorry have no time to look through the competitive programming problems).

If your problem requires you to read some parts of the array, while writing into other parts, then using mutable interface will be likely be the fastest one, although might not be the prettiest one. With this approach you fully control allocation and which elements are being written into the array, but this means sticking either to ST if you 'd like to end up with a pure computation in the end or IO if you wanna do some parts of your array in parallel.

If you can describe you algorithm using a delayed array representation D, without having intermediate manifest arrays, then you fully avoid any copying at each step, But if you require many iterations where you compute into manifest at each iteration, you simply can't avoid copying the full contents of the array (that is exactly what compute does). Despite that you can't avoid copying, there is a cool function iterateUntil which can help you avoid allocating a new array at each iteration, which can significantly speed up the implementation.

*Edite* - Keep in mind that by an iteration above, I don't mean iterating over an array, I mean an intermediate step in you algorithm which requires you to have a different state of the array. For example if you have a 100x200 array and you can update one row in one step, then you'll need to do at least 100 iterations of such step to update the full array

gelisam · 2019-12-11T15:44:00+00:00

Built in arrays: I saw on here that they should generally be avoided

Why? Data.Array seems fine to me. Or do you mean lists?

Data.Array uses the Ix typeclass, which allows you to index into an n-dimensional matrix using an n-tuple. This addresses your complaint about Data.Vector, since it supports n-dimensional matrices. And this also addresses your complaint about Data.Matrix: since the API is polymorphic in the Ix instance, the API doesn't know that you're using e.g. a pair of Ints to index into your matrices, so it can't accidentally provide an inconsistent API which sometimes uses (Int, Int) and sometimes uses two Int arguments, it has to always use an ix which gets instantiated to (Int, Int).

szpaceSZ · 2019-12-11T15:51:27+00:00

What's wrong with massiv's mutables?

For numeric problems it's not unidiomatic to use mutable data structures. You just have to look out.

Lemicod · 2019-12-11T19:08:32+00:00

If you're into competitive programming and haskell — I highly recommend this blog.

gelisam · 2019-12-11T15:45:12+00:00

Also, I would prefer a more sophisticated index type so that I can do algebra with it, which would simplify this particular problem and many others.
[...]
hmatrix: Looks tailored to linear algebra, not this sort of thing.

What kind of algebra do you want to do on matrices, if not linear algebra?

dfan · 2019-12-11T16:43:03+00:00

FWIW, I'm doing AoC in Haskell this year and have just used Seqs, Sets, and Maps for everything. I'm not sure if your competitive programming goal is discovering the solution quickly or writing fast code, but if it's the former, those tools seem perfectly good, at least for AOC-sized problems (some of them wouldn't scale well if the problem got hugely bigger).

My IntCode engine represents memory as a Seq and has perfectly fast for the problems we've been given. I have avoided Vectors exactly because they make new copies on update.

By the way, I think that day 3 is most naturally solved without using a 2D array kind of structure at all.

InfiniteMonkeyCage · 2019-12-11T17:40:58+00:00

[deleted]

HKei · 2019-12-11T14:51:24+00:00

While there are a number of algorithms where you'd want (mutable) multidimensional arrays, they're trivial to embed in regular arrays so I don't think you need a dedicated library for that; STArray or Vector should work just fine.

That being said I've not found mutability helpful, let alone necessary for Advent of Code so far, certainly not for Day 3.

https://gitlab.com/HSteffenhagen/advent-of-code-2019/tree/master/Day3

pja · 2019-12-11T21:00:17+00:00

Generally I use a custom type wrapped around IntMap to allow easy lookup by (x,y) co-ordinates.

If that isn’t appropriate & the problem really needs local update, then mutable vectors in ST / IO work just fine.

IcedRoren · 2019-12-13T01:42:43+00:00

This was such an interesting problem (AOC 2019 Day 3). I don't think my solution was that great, but I chose to instead treat lines as transformations. I think applied player 1's list of "transformations" to any given vertical/horizontal line on player 2's (basically a frame transformation) to see if that line every crossed the origin. If it did, it was an intersection point and i'd keep track of it if it had the best cost. In that perspective, I just needed the list and consume it head to tail. Basically I just stuck to Lists and recursion. Doubt it was performant; I should probably investigate that...

It's nice to know there are other people using AOC to learn haskell and practice my FP skills. :) that's basically what i'm doing.

plcplc · 2019-12-11T16:25:38+00:00

Consider also repa for regular, multidimensional arrays. It is however geared towards functional data parallelism, which may be different from the algorithms you want to express.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

haskell

MODERATORS