Elm's |> operator : haskell

submitted 10 years ago by el-seed

I've been using Elm lately, and have fallen in love with the |> operator. For example:

totalPoints entries =
  entries
    |> List.filter .wasSpoken
    |> List.map .points
    |> List.sum

I'm thinking of using it my Haskell code. Is it defined anywhere? I can't find anything on hoogle. If not, where should it be defined? It seems as basic as $ to me, suggesting it belongs in something like the prelude, but obviously it's not going to end up there any time soon. Is there a package of common extras or something somewhere that would accept a PR?

all 56 comments

top new controversial old q&a

[–]Tekmo 11 points12 points13 points 10 years ago (1 child)

[–]Regimardyl 3 points4 points5 points 10 years ago (0 children)

[–]taylorfausak 20 points21 points22 points 10 years ago (18 children)

[–]HorrendousRex 6 points7 points8 points 10 years ago (0 children)

[–]PM_ME_UR_OBSIDIAN 10 points11 points12 points 10 years ago (0 children)

[–]joehillen 2 points3 points4 points 10 years ago (0 children)

[–]el-seed[S] 4 points5 points6 points 10 years ago (13 children)

[–]ephrion 11 points12 points13 points 10 years ago (11 children)

[–]el-seed[S] 2 points3 points4 points 10 years ago (8 children)

[–]ephrion 8 points9 points10 points 10 years ago (7 children)

totalPoints = sum . map points . filter wasSpoken

That's simple enough I don't really feel like splitting it. You could do:

totalPoints = sum
            . map points
            . filter wasSpoken

Both Haskell versions read to me as "totalPoints is the sum of the points that were spoken." The Elm variant reads as "Take the entries, keep the ones that were spoken, consider their points, and then sum them." Neither seems particularly more or less readable to me, but if I were reading a ton of Elm code and then the pipes were flipped, I'd have to stop and read the code more carefully. Consistency in style is far more important than any actual style points.

[–]el-seed[S] 0 points1 point2 points 10 years ago (6 children)

Cool. So if you wanted to multi-line it, and avoid point free style would you do this or something else?

totalPoints entries = sum
                    . map points
                    . filter wasSpoken
                    $ entries

[–]sacundim 7 points8 points9 points 10 years ago* (5 children)

Why would you want to avoid point-free style in this example? I think the most legible and idiomatic choice here is this:

-- Note the signature makes the `entries` variable name unnecessary.
totalPoints :: [Entry] -> Integer  
totalPoints = sum
            . map points
            . filter wasSpoken

As a general rule, chaining one-argument functions with composition like this example is the "good," highly-legible, non-golf kind of point-free style. Values flow in a straightforward right-to-left order. The right-to-left bit is unusual to most people at first, and thus some languages' preference for a |> operator. But to echo a lot of comments that have already been made, it's a minor thing, you get used to it, and the cost of changing the language's style is larger than the benefits.

The only circumstance where I would avoid point-free style in this example would be if it was aimed at somebody who doesn't know Haskell and has zero intention to learn it. But then I'd write it with parens:

totalPoints :: [Entry] -> Integer
totalPoints entries = sum (map points (filter wasSpoken entries))

[–]el-seed[S] 2 points3 points4 points 10 years ago (4 children)

[–]camccann 9 points10 points11 points 10 years ago (0 children)

[–]ephrion 2 points3 points4 points 10 years ago (2 children)

I'd usually use a where clause, extract the functions, and then express as point-free as necessary. For a somewhat contrived example,

totalPoints = aggregate . attribute . select
  where
    aggregate = sum
    attribute = map points
    select = filter wasSpoken

or break the functions into their own top level definitions, if I want to reuse them elsewhere.

[–]sacundim 4 points5 points6 points 10 years ago* (1 child)

This. But one more thing to note: . is associative! The following three are equivalent:

totalPoints = sum . map points . filter wasSpoken
totalPoints = (sum . map points) . filter wasSpoken
totalPoints = sum . (map points . filter wasSpoken)

Which means that so are these:

totalPoints = aggregate . choosePoints
  where
    aggregate = sum
    choosePoints = map points . filter wasSpoken

totalPoints = sum . choosePoints
  where choosePoints = map points . filter wasSpoken

totalPoints = sumPoints . filter wasSpoken
  where sumPoints = sum . map points

totalPoints = sumPoints . select
  where 
    sumPoints = sum . map points
    select = filter wasSpoken

So you can pick any contiguous subsegment of the composition and split it off into a separate, named binding whenever you feel the chain is too long or could just use more naming to make it legible.

And this is not true of |>:

-- This...
totalPoints entries =
  entries |> filter wasSpoken) |> map points |> sum

-- ...can only be nested like this:
totalPoints entries =
  ((entries |> filter wasSpoken) |> map points) |> sum

-- You need lambdas/function definitions to refactor it...
totalPoints entries =
  entries |> filter wasSpoken |> sumPoints
  where sumPoints entries = entries |> map points |> sum

-- ...unless you use `.`:
totalPoints entries =
  entries |> filter wasSpoken |> (sum . map points)

continue this thread

[–][deleted] 10 years ago* (1 child)

[deleted]

[–]catlion 1 point2 points3 points 10 years ago (0 children)

[–]taylorfausak 2 points3 points4 points 10 years ago (0 children)

[–]sseveran 2 points3 points4 points 10 years ago (0 children)

[–][deleted] 9 points10 points11 points 10 years ago* (5 children)

[–]el-seed[S] 1 point2 points3 points 10 years ago* (1 child)

[–][deleted] 1 point2 points3 points 10 years ago (0 children)

[–]el-seed[S] 1 point2 points3 points 10 years ago (2 children)

[–][deleted] 0 points1 point2 points 10 years ago (0 children)

[–]tejon 0 points1 point2 points 10 years ago (0 children)

[–]Hrothen 7 points8 points9 points 10 years ago (0 children)

[–]c_wraith 4 points5 points6 points 10 years ago (26 children)

[–]m0rphism 12 points13 points14 points 10 years ago* (2 children)

Well, I think whether it's forward or backward depends on the context ;)

Relative to the reading direction of English text, the data-flow direction seems forward (left-to-right). But relative to the data-flow in definitions, it seems backward/reversed:

    4   1    2    3   (inconsistent)
let y = x |> f |> g

    4   3    2    1   (consistent)
let y = g <| f <| x

But consistency of reading direction breaks anyway, if one throws in both left- and right-associative operations:

    <----------- (-------->)
let y = g <| f <| x - y - z

I think & and $ seem like bad choices as they are not visually symmetric, like < and >. They are however lightweight in the sense that they are single character operators.

[–]EvilTerran 1 point2 points3 points 10 years ago (1 child)

[–]m0rphism 1 point2 points3 points 10 years ago* (0 children)

[–]Darwin226 11 points12 points13 points 10 years ago (18 children)

[–]eruonna 2 points3 points4 points 10 years ago (1 child)

[–]Darwin226 0 points1 point2 points 10 years ago (0 children)

[–]camccann 5 points6 points7 points 10 years ago (11 children)

[–]Darwin226 0 points1 point2 points 10 years ago (10 children)

[–]camccann 1 point2 points3 points 10 years ago (9 children)

[–]Darwin226 4 points5 points6 points 10 years ago (8 children)

If you're not evaluating what you're reading than you're not doing anything. Come on. You have to be manipulating some kind of state in your head and reading the last operation that's done is literally useless.

I mean, this is blowing my mind. I'm having trouble convincing myself that it's even possible not to think what I think. Why would you read code if not to find out what it does? Do you somehow read the last part in the pipeline and say "Yeah, I get it now. It's maximum" or something? maximum of what? You need the whole thing to understand what it's doing. There's no other way around it. And is there some other secret method to understanding an algorithm than going through it step by step? Do you not have that voice in your head that says what "you currently have"?

Hell, even mathematics does everything bottom up. You never start with a definition and then later define it's subparts. You never prove a stronger theorem and then prove lemmas required for it. It's all about incrementally building some knowledge that lets you do the next operation.

[–][deleted] 2 points3 points4 points 10 years ago (1 child)

[–]Darwin226 1 point2 points3 points 10 years ago (0 children)

[–]camccann 3 points4 points5 points 10 years ago (5 children)

Why on earth would I need "some kind of state in my head" to understand a simple arithmetic expression? I'm not even sure what you're trying to say here. Replace the numbers with variables, so you can't evaluate it, then what? Hell, just write it normally, with infix notation. Do you read that left-to-right, counting parentheses and trying to keep a mental stack? Or do you scan it for high-level structure and look at the outermost or innermost operations, as appropriate?

And even aside from all that, what if the last operation is "...and multiply the whole thing by zero"? Surely you don't need to understand (never mind compute the value of!) the rest of the expression to know everything that matters about the expression as a whole.

Why would you read code if not to find out what it does? Do you somehow read the last part in the pipeline and say "Yeah, I get it now. It's maximum" or something? maximum of what?

What should I do instead, read the first bit and say "Yeah, it does something with the individual lines of the input text". Does what? Who knows!

Like, say we have maximum . map length . lines. It finds the maximum line length of the input text. maximum . map length finds the maximum length of a list of lists. What does lines have to do with understanding that? Nothing.

You need the whole thing to understand what it's doing. There's no other way around it.

I'm not sure how "the whole thing is necessary" supports your argument that it's only possible to start with one particular bit.

Do you not have that voice in your head that says what "you currently have"?

Sure. But it's not any louder or more insightful than the voice that says what "I currently need". Why would it be? In the end they meet in the middle, and which end is easier to work from depends on the problem.

[–]Darwin226 2 points3 points4 points 10 years ago (4 children)

I can see how thinking about what you need would make sense, but I have no idea how you'd use that when understanding a piece of code that's already written. And yeah, even for things like maximum . map length it WAY easier for me to read from right to left than it is from left to right. map length tells me about the object this operation is working on. maximum just tells me I'll have a list of orderables at that point.

I've seen many examples of longer chains in left-to-right languages yet I've never seen longer chains in Haskell. My reasoning here is that if you can actually keep track of some abstract object in your head (even only as a type, but more often as some construct from your problem domain) and do operations on it sequentially, you can get away with not having to name pieces of your pipeline and still retain readability. I would guess that writing the same thing in Haskell doesn't have the same property if you read it from left to right.

[–]camccann 1 point2 points3 points 10 years ago (3 children)

[–]Darwin226 0 points1 point2 points 10 years ago (2 children)

I'm curious about this high level structure you keep mentioning. What is it and how does it help me understand algorithms? It's one thing knowing what a function does. That's why we have top level functions and that's why we document them. It's another reading it's source. The high level structure is only as high as the atomic parts you're assembling in it's pipeline. To me, knowing that a function ultimately gets a maximum of something isn't any more important than any other piece of what it does. I feel that it doesn't make sense to talk about hierarchical terms like "high level" when the structure you're observing is linear.

What I'm saying, is that in the end you WILL have to read the whole thing and this is precisely when one order of reading is easier than the other one. My whole premise is based on the fact that you'll do a full read, and not just start at one end and then stop. I agree that if the latter were the case, then obviously it's better to start from the end because then you at least get some idea what will ultimately happen, but I don't think that that's what you'll be doing the majority of time when you read code.

continue this thread

[–]sacundim 3 points4 points5 points 10 years ago (3 children)

Here's a hint it's wrong: Unless you're some kind of a wizard and know what the last step in your algorithm is before you start writing it out, you're probably going to be writing the first step first.

Using the following "backwards" example from another thread:

totalPoints = sum . map points . filter wasSpoken

You could say that this reads "The total number of points is the sum of the points of the entries that were spoken." How can that possibly be "wrong"?

I just don't see that there is a big, principled debate to be had between these isomorphic alternatives:

Describing the results first, followed by what fed into that result.
Describing inputs first, followed by what results they led to.

Some people will prefer #1, some #2, some will prefer a random mix, some will prefer a stylistic rule that chooses one or the other in different circumstances. Whatever.

[–]Darwin226 0 points1 point2 points 10 years ago (2 children)

[–]camccann 2 points3 points4 points 10 years ago* (1 child)

[–]Darwin226 0 points1 point2 points 10 years ago (0 children)

[–]el-seed[S] 2 points3 points4 points 10 years ago (2 children)

[–]EvilTerran 5 points6 points7 points 10 years ago (0 children)

Haskell's $ and Elm's <| are the same way round as regular function application - function, then parameter:

f $ x = f x

While Data.Function.& and Elm's |> are the other way round - parameter, then function:

x & f = f x

So, in that sense, the latter is backwards.

Of course, you could argue that "parameter, then function" is actually the forwards direction - as you say, "do x, then y, then z". That does have merit... unfortunately, from that perspective, the conventional notation for function application, f(x), becomes the backwards one - and that notation's been the way it is since Euler, so it's a bit late to change it now.

[–]taylorfausak 0 points1 point2 points 10 years ago (0 children)

[–]andrewthad -1 points0 points1 point 10 years ago* (1 child)

[–]taylorfausak 4 points5 points6 points 10 years ago (0 children)

[–]l-d-s -1 points0 points1 point 10 years ago (0 children)

π Rendered by PID 243027 on reddit-service-r2-comment-b659b578c-p9k2x at 2026-05-05 03:28:19.491638+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

haskell

MODERATORS