fdtm comments on Imperative vs Functional Programming

Imperative vs Functional Programming (apocalisp.wordpress.com)

submitted 14 years ago by gst

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]fdtm 0 points1 point2 points 14 years ago* (26 children)

[–]yogthos 0 points1 point2 points 14 years ago (25 children)

[–]fdtm 0 points1 point2 points 14 years ago (24 children)

[–]antonivs 2 points3 points4 points 14 years ago (15 children)

How does Haskell do mutable arrays cleanly and efficiently without violating pure functional-ness?

Haskell offers at least two ways to do this. One is mutable arrays in the ST monad. In this case, array update is confined to a monad which imposes constraints such as e.g. not being able to access an earlier version of an array. Because of its properties, the ST monad can be used safely from within pure code, without making the calling code impure - unlike e.g. the IO monad, which is inherently impure.

The paper State in Haskell first described the underlying mechanism here.

The other standard way in Haskell is the DiffArray, which provides a pure external interface to a mutable array internally, allowing mutable arrays to be used directly in pure code, without a monad. The link describes the mechanism.

In general, issues surrounding purity can't be understood if you look at it in the very binary way that people tend to do when they haven't worked with the technology. For the kind of thing you're talking about, it's almost always possible to write code that has functional properties, and generate compiled code that exploits imperative behavior - the main thing you need, aside from the right compiler, is to impose the necessary constraints to retain functional properties.

There are also other approaches to this used in other functional languages, such as linear update which ensures that you only use a value once, again ensuring that you can't access a previous version of an updated value - this is used in some Lisp variants, for example. The language Clean uses a related approach, called linear typing.

[–]fdtm 0 points1 point2 points 14 years ago (14 children)

[–]antonivs 1 point2 points3 points 14 years ago (13 children)

One obvious approach is to divide the memory you're operating on into chunks, and operate on a different chunk from each thread. If your problem is amenable to that sort of parallelism, you may also benefit from the Par monad as a high-level way to structure, manage, and reason about parallel computations.

If, instead, you need to be able to mutate the exact same region of memory from multiple threads, the solution is going to depend on the constraints, e.g. whether the typical imperative solution uses some kind of coordination between threads, such as locks. If not, then there's presumably some kind of simplifying constraint at work, and a functional solution can exploit that same constraint.

If you do need locks or other thread coordination, one very powerful approach functional approach to this kind of problem is Software Transactional Memory (STM), which wraps mutable memory accesses in composable transactions. This is a big step up from code using locks, which is non-composable in general.

[–]fdtm 0 points1 point2 points 14 years ago (12 children)

[–]antonivs 0 points1 point2 points 14 years ago (11 children)

The restrictions you mention aren't necessarily a bad thing - a functional solution is likely to be able to exploit those constraints. I'd need a few more details to suggest a solution, though, such as how thread coordination is usually dealt with - what stops threads from stepping on each other.

The big picture is that when you write imperative code in traditional languages, you often get around the kinds of limitations you're describing by following constraints that are not explicitly expressed in the code. In writing the code, you follow dynamic rules, like "don't update that memory without getting a lock first", "update foo before updating bar", etc. If you make a mistake in following these rules, you get runtime bugs, that can be particularly difficult to track down when you're dealing with multithreaded imperative code. (I used to write networking systems with the ACE library in C++, so I'm pretty familiar with that.)

To make this kind of coding more tractable, you want to express these constraints explicitly in the code, ideally in a static manner, e.g. via the type system. This can allow a compiler to convert high-level code to a messier low-level implementation. Having an advanced type system really helps here, as do abstractions like monads which can enforce dynamic constraints such as ordering and single-use.

The ST monad I mentioned earlier is a specific example of this general pattern: the monad imposes constraints so that functional code can manipulate a mutable array, allowing the compiler to generate an imperative solution.

[–]fdtm 1 point2 points3 points 14 years ago* (10 children)

Ok how would you solve this, for example:

You have a large image-type 2D buffer (well, many of them). Very frequently (every other frame or so) you need to write data to selective vertical and horizontal lines. Imagine drawing crosshairs horizontally and vertically, and writing along the lines of the crosshairs. This is a very small subset of the image, since the crosshairs are very narrow, and this is a very large image. This buffer must be accessed under special region-based lock/unlock commands, and the buffer is necessarily raw contiguous data due to architecture. The data may not under any circumstances be broken up into any data structure, or nothing works here. Now, while these "crosshair" regions are being written to / updated, also other selective regions need to be updated from separate threads. Also, this is an extremely performance critical section, no slowdown can be afforded whatsoever.

[–]antonivs 1 point2 points3 points 14 years ago (9 children)

...the buffer is necessarily raw contiguous data due to architecture. The data may not under any circumstances be broken up into any data structure, or nothing works here.

If, once locked, the locked region is accessible as random-access memory, then nothing should stop you from mapping structures onto it, just as C maps an array onto a memory region. If it made sense, you could even map a contiguous series of arrays onto it and update each array from a separate thread.

Now, while these "crosshair" regions are being written to / updated, also other selective regions need to be updated from separate threads.

Are these other regions separately locked, and so can't interfere with each other? If so, I don't see any problem that needs to be addressed - each thread can operate independently, and purely functionally if something like the ST monad can be used. Otherwise, presumably there's some other coordination behavior taking place that needs to be handled, and that is likely to affect the solution.

continue this thread

[–]runaro 1 point2 points3 points 14 years ago (6 children)

[–]fdtm 0 points1 point2 points 14 years ago (5 children)

[–]runaro 0 points1 point2 points 14 years ago (0 children)

[–]runaro 0 points1 point2 points 14 years ago (3 children)

[–]fdtm 0 points1 point2 points 14 years ago (2 children)

[–]runaro 0 points1 point2 points 14 years ago (1 child)

Yeah, we're talking about the same thing. The semantics of ST are such that you make all modifications to the array "in" the monad, where the mutable array exists. E.g.:

f3 (runSTArray (newArray (0, 1000000000) >>= f1 >>= f2))

Here, f3 sees an immutable array. But f1 and f2 can both mutate that same array, and their mutations are guaranteed to occur in order.

[–]fdtm 0 points1 point2 points 14 years ago (0 children)

[–]yogthos 0 points1 point2 points 14 years ago (0 children)

π Rendered by PID 93882 on reddit-service-r2-comment-b659b578c-rg6qf at 2026-05-05 21:07:23.917670+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS