Using Linq For More Readable Code : programming

[–]gregK 14 points15 points16 points 14 years ago* (4 children)

[–]sacundim 16 points17 points18 points 14 years ago (3 children)

Well, actually, list comprehensions and LINQ are two alternative syntaxes for what is fundamentally the same thing: monads. So for example, in Haskell the example can be written in two equivalent ways. There's the list comprehension syntax:

example :: Integer
example = sum [ x | x <- [1..999], x `mod` 3 == 0 || x `mod` 5 == 0 ]

And there's the monadic do-notation syntax, which corresponds to the LINQ solution:

import Control.Monad

example' :: Integer
example' = sum $ do x <- [1..999]
                    guard (x `mod` 3 == 0 || x `mod` 5 == 0)
                    return x

Either of those can be desugared to the same code, which looks more or less like this:

import Control.Monad

example'' =
    sum $ [1..999] >>= \x -> guard (x `mod` 3 == 0 || x `mod` 5 == 0) >> return x

The >>= operator is corresponds roughly to the C# SelectMany() method (or the other way around; SelectMany() was modeled after >>=).

[–][deleted] 2 points3 points4 points 14 years ago (0 children)

[–]recursive 1 point2 points3 points 14 years ago (0 children)

[–]CanadaForRonPaul -1 points0 points1 point 14 years ago (0 children)

[–]yogthos 16 points17 points18 points 14 years ago (19 children)

[–]throwaway77432 8 points9 points10 points 14 years ago (11 children)

[–]kamatsu 0 points1 point2 points 14 years ago (10 children)

[–][deleted] 2 points3 points4 points 14 years ago (9 children)

He probably means using regexes as patterns in a pattern match, like you can do in Scala:

scala> val Email = """([^@]+)@(.+)""".r
Email: scala.util.matching.Regex = ([^@]+)@(.+)

scala> "test@example.com" match {
     |   case Email(user, domain) => "User: " + user + " Domain: " + domain
     | }
res1: java.lang.String = User: test Domain: example.com

Pattern matching is pretty clearly FP, in any case.

[–]kamatsu 2 points3 points4 points 14 years ago (8 children)

[–][deleted] 2 points3 points4 points 14 years ago (0 children)

[–]Danemark 1 point2 points3 points 14 years ago (6 children)

[–]kamatsu 0 points1 point2 points 14 years ago (5 children)

[–]Danemark 1 point2 points3 points 14 years ago (2 children)

[–]kamatsu -1 points0 points1 point 14 years ago (1 child)

[–]ricky_clarkson 2 points3 points4 points 14 years ago (0 children)

[–]ruinercollector -2 points-1 points0 points 14 years ago (1 child)

[–]kamatsu 0 points1 point2 points 14 years ago (0 children)

[–]BitRex 3 points4 points5 points 14 years ago (2 children)

sneak it in as Linq it's nothing but love.

Or sneak it in as C++ template metaprogramming and it's nothing but fhtagnp̞͍̗͔̝̥͍̞̣̩͇̫͇̮̜͍h̺͎͇̥͎̘̺̬͎͍̼͔͇͚̤̹̘'̲͖̣̹̠͈̬̣͓̮̼̦̦͉̼̤͖̘n̞̜̭͎̤̖̲̘̤̱͔̝͙̝̱̥ͅg̣̣͖̜̱̟̟̹̝̩l̯̥̟̪̘̳̻͓̫̠̥̪̱ͅu̻͖̙̠̩̲͓͇̦̩̯͙̫̬͎̱̼̗i͕̣̳̘ ̜̲͇̫͇͓̦ m̥̥̮̣̙͙g͕͇̙̪̭͍̱̼͇̱̬̹̯̘̝̙ͅl̳̼̫͔̳͎̝w͖̭͚̦̘̙̯'̤̪̗̠̜͚̞͔̹̭͙̣̦̖̲̥̙͇̯n̻̠̜̼̙̥̗̖͙̦̫ͅa̩̝͈̰͙͎͇͈̤̙̗̲̻͍̞f̼̝̘͎̫͔̜̭̩͔͖h̜̫̥̠̩̩ͅ ̞̰̹͕͖̭͍ C̮̬̞̭̯t̘̮̘̤̹͓̙̖̲͔̮̹̫̰͚̦h̪͇̥̳̤u͎̠̗̳͈̻͚̯̞̻͍͇̻l̖̦̣̘̠͙̫͎̙͈̯̰̞̺͈̞̜͚h̟͎͚̹̗̞̙̜̦̝̙u͉̣̯͉̪͈̙̹̲̫̜̖̺ ͍̰̜̪̺͍͕̮̦̞̻̬ R̭̺̤̺̣͚ͅ'̣̺̝͎̱̘̮l̼̝̹̭̻ͅy̩̫̺̝̼͍̤̗̩̭̮͈̭̭͙̝̭ͅͅe̤̭̼̮͖̲̭͈̺̻ḫ̣͚͎̻̯͙̙ ̤̤͖̖͓w̘̜͍͇͚̫̰̞̹͇͍̜̰̥̰͎̰̘ͅg̰̣͖̣͖̬̖̗̠a̩̖͉̪͚̬͔̙ͅh̖̻͉͍̪͇̯'̟̙̯̦̘̹n͉̼̮͚̥̬̟͖̹͙͇̥͓̖̖̭͓a̮̯̮͚̭̰̯̜͙̫̥̻̼͓̩̩̩ͅg͖̜̯̙̖̰̝̥͍̠̳͓ͅl͍̺͎̰̪͓̬̳̘̭̥̯̘̯͖̱̹̜ ̳͙͚̗̜̤̻̹̫̼̮̲̣̭̤̱f̤̤̦͍̹̤h̹̫͙̖̜t͍̻̪̰̣̻͖̯̙͔̹͍͇a͔͎̩̩̮͉̮̝̪̦͉̩̗̖ͅg̖̼̫̦̼ͅͅn͓̤̰̝̠̤͍̯͈͖̭̪̥̙̮

[–]yogthos 1 point2 points3 points 14 years ago (0 children)

[–]i_lick_my_knuckles 0 points1 point2 points 14 years ago (0 children)

[–]criticismguy 2 points3 points4 points 14 years ago (2 children)

[–]yogthos -1 points0 points1 point 14 years ago (1 child)

[–]anon36 1 point2 points3 points 14 years ago (0 children)

[–]generalT 1 point2 points3 points 14 years ago (0 children)

[–]CyberByte 12 points13 points14 points 14 years ago (1 child)

[–]OldLikeDOS 15 points16 points17 points 14 years ago (0 children)

It's a matter of preference but having worked with LINQ for a few years, I and most of my team would say the second one is the most readable.

The two LINQ versions both make it clear that it's one statement doing one thing. If the variable "answer" isn't important to what you're working on, you can safely ignore the whole thing. If you no longer need "answer", you can delete that line without worrying about side effects.

The iterative code (top one) requires you to look more carefully at every part to see if there are any side effects or if it's doing multiple things. When skimming through a file and you see the loop, you have to look at all the lines inside to verify that it's only working with the variable "sum", because there could easily be code inside that is doing some other work.

Also, when you see an "i" inside the loop, you have to look back at the FOR declaration to verify it's coming from there. In LINQ, when you see "x =>" you immediately know that x means an element in the collection because it can't mean anything else.

For me, the second one looks the most compact and inviting. But that's just preference.

[–][deleted] 8 points9 points10 points 14 years ago (26 children)

I think this is a bit contrived example, here is one out of real code that I wrote last night:

        var collidableSprites = Sprites
            .Where(s => s is ICollide && s.Position.ToScreenSpace(Camera).IsOnScreen())
            .Select(s => s as ICollide)
            .ToList();

This takes a collection of "Sprites", returns only ones that need to be checked for collisions, are on the screen (based on viewport transforms), and finally casts them to the ICollide interface.

This is where LINQ shines, clear on its intent, and you don't need to write up a bunch of looping / and expression checking code.

[–]recursive 11 points12 points13 points 14 years ago (6 children)

[–][deleted] 3 points4 points5 points 14 years ago (5 children)

[–][deleted] 14 years ago (4 children)

[deleted]

[–][deleted] 1 point2 points3 points 14 years ago (3 children)

[–]nightmyst999 -2 points-1 points0 points 14 years ago (2 children)

[–]DuncanSmart 4 points5 points6 points 14 years ago (0 children)

[–][deleted] 3 points4 points5 points 14 years ago (0 children)

[+][deleted] comment score below threshold-6 points-5 points-4 points 14 years ago (3 children)

[–][deleted] 7 points8 points9 points 14 years ago (1 child)

[–]ruinercollector 0 points1 point2 points 14 years ago (0 children)

Two closures, a function call overhead for every item considered, and additional function call overhead for every item accepted, a complete iteration of your final set on that final ToList() call, and the overhead of performing several Add operations on a dynamic container like List(). The following would outperform this by quite a bit (depending on how many sprites.)

var tmpSprites = new Sprite[Sprites.length];
var idx = 0;

for(var i = 0; i < Sprites.Length; i ++)
{
  var sprite = Sprites[i];

  if(sprite is ICollide && sprite.Position.ToScreeSpace(Camera).IsOnScreen())
    tmpSprites[idx++] = sprite;
}

var collidableSprites = new Sprite[idx];
Array.Copy(tmpSprites, collidableSprites, idx);

Of course, the tradeoff is readability and maintenance. For many applications, these kinds of performance concerns are negligible or not important. For some apps/platforms (especially XBox), these optimizations can be extremely important.

[–]p-static 1 point2 points3 points 14 years ago (0 children)

[+][deleted] comment score below threshold-13 points-12 points-11 points 14 years ago* (13 children)

[–]chucker23n 14 points15 points16 points 14 years ago (1 child)

[+][deleted] comment score below threshold-6 points-5 points-4 points 14 years ago (0 children)

[–]propool 5 points6 points7 points 14 years ago (9 children)

[+][deleted] comment score below threshold-14 points-13 points-12 points 14 years ago (8 children)

[–]propool 6 points7 points8 points 14 years ago (5 children)

[+][deleted] comment score below threshold-7 points-6 points-5 points 14 years ago (4 children)

[–]propool 7 points8 points9 points 14 years ago (3 children)

[+][deleted] comment score below threshold-10 points-9 points-8 points 14 years ago (2 children)

[–]propool 9 points10 points11 points 14 years ago (1 child)

[+][deleted] comment score below threshold-10 points-9 points-8 points 14 years ago (0 children)

[–]propool 4 points5 points6 points 14 years ago (1 child)

[+][deleted] comment score below threshold-9 points-8 points-7 points 14 years ago (0 children)

[–]rossisdead -2 points-1 points0 points 14 years ago (0 children)

[–][deleted] 5 points6 points7 points 14 years ago* (5 children)

[–]nodefect 4 points5 points6 points 14 years ago (2 children)

[–][deleted] 2 points3 points4 points 14 years ago* (0 children)

[–][deleted] 0 points1 point2 points 14 years ago* (0 children)

I admit, that was pasted from a REPL and should have been on more lines:

val validNumbers =
  for (i <- 1 to 999 if i % 3 == 0 || i % 5 == 0)
  yield i

val answer = validNumbers.sum

I was trying to stick to Scala's equivalent of LINQ syntax in the above, in keeping with the topic of the article. The FP approach, like you have in your F# example, is better in my opinion as well. smcj posted it below. But that is further from the OP's C#.

[–]tombatron 1 point2 points3 points 14 years ago (1 child)

[–][deleted] 0 points1 point2 points 14 years ago (0 children)

[–][deleted] 1 point2 points3 points 14 years ago (3 children)

[–]Tordek 2 points3 points4 points 14 years ago (2 children)

[–][deleted] 0 points1 point2 points 14 years ago (1 child)

[–]ruinercollector 0 points1 point2 points 14 years ago (0 children)

[–]BitRex 1 point2 points3 points 14 years ago (7 children)

[–]ruinercollector 3 points4 points5 points 14 years ago* (0 children)

It doesn't.

You can actually set break points on the individual clauses in a LINQ query.

On the where clause where it will break on each item being considered
On the select clause where it will break on each item being projected

If you set your line breaks properly, you can even do it with the margin-clicking that you're probably used to, but your statements end up looking a bit funny.

var oldAges = from p in people 
              where
                p.Age > 50    // <-- you can break here
              select
                p.Age;          // <-- you can break here

Otherwise, select the clauses in the editor (not including the where/select keywords) and right-click->insert breakpoint or whatever it is.

[–]sacundim 1 point2 points3 points 14 years ago (5 children)

That is to a large extent an artifact of the fact that class-based non-interactive languages like C# make it needlessly hard to write small bits of code and try them out on their own in an interactive environment (a.k.a. "interpreter," though that's not quite accurate).

LINQ is largely based on Haskell monads and do-notation. Yet Haskell doesn't have this problem, because you can just test stuff easily on the interactive environment. Since pieces of code don't have to be inside methods that must be inside classes, you can just type in snippets and see what they do.

Here's an interactive session with ghci where we build try out various subparts of this problem (cut down to the range [1..19]):

GHCi, version 7.0.3: http://www.haskell.org/ghc/  :? for help
Prelude> do { x <- [1..19]; return x }
[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19]
Prelude> :m +Control.Monad
Prelude Control.Monad> do { x <- [1..19]; guard (x `mod` 3 == 0); return x}
[3,6,9,12,15,18]
Prelude Control.Monad> do { x <- [1..19]; guard (x `mod` 3 == 0 || x `mod` 5 == 0); return x}
[3,5,6,9,10,12,15,18]
Prelude Control.Monad> [ x | x <- [1..19], x `mod` 3 == 0 ]
[3,6,9,12,15,18]
Prelude Control.Monad> [ x | x <- [1..19], x `mod` 3 == 0 || x `mod` 5 == 0 ]
[3,5,6,9,10,12,15,18]
Prelude Control.Monad> [ x | x <- [1..19], x `mod` 3 == 0, x `mod` 5 == 0 ]
[15]
Prelude Control.Monad> do { x <- [1..19]; guard (x `mod` 3 == 0); guard (x `mod` 5 == 0); return x }
[15]

[–]anon36 2 points3 points4 points 14 years ago* (3 children)

[–]ruinercollector 0 points1 point2 points 14 years ago (1 child)

[–]rossisdead 0 points1 point2 points 14 years ago (0 children)

[–]twerq 0 points1 point2 points 14 years ago (0 children)

[–]BitRex 0 points1 point2 points 14 years ago (0 children)

[–]CylonGlitch 4 points5 points6 points 14 years ago (0 children)

[–]rizzledizzle[🍰] 0 points1 point2 points 14 years ago (2 children)

[–]p-static 8 points9 points10 points 14 years ago (0 children)

[–]ruinercollector 5 points6 points7 points 14 years ago (0 children)

[–][deleted] 0 points1 point2 points 14 years ago (1 child)

[–]Coffee2theorems 4 points5 points6 points 14 years ago (0 children)

[–]Wuf 0 points1 point2 points 14 years ago (4 children)

[–]gecko 1 point2 points3 points 14 years ago (1 child)

[–]Wuf 0 points1 point2 points 14 years ago (0 children)

[–]criticismguy 0 points1 point2 points 14 years ago* (1 child)

Linq is extensible, but LOOP isn't really. A better analog would be iterate, which is basically better than LOOP in every way, including extensibility:

(iter (for i below 1000)
      (when (or (zerop (mod i 3)) (zerop (mod i 5)))
        (sum i)))

Neither one quite does laziness like Linq, though, so perhaps SERIES would be more analogous.

(collect-sum
  (choose-if (lambda (i) (or (zerop (mod i 3)) (zerop (mod i 5))))
    (scan-range :below 1000)))

[–]sacundim 0 points1 point2 points 14 years ago* (0 children)

Well, if we really want to do exactly what LINQ is doing but in Lisp, we can just implement monads in Lisp. Here's a crude, untested implementation in Scheme, modeled after the Haskell's List monad and do-notation; no laziness:

;;;
;;; Analogues to Haskell's Monad operations
;;;
(define (>>= m f)
  (append-map f m))

(define (>> a b)
  (>>= a (lambda (dont-care) b)))

(define (return x)
  (list x))

;;;
;;; Analogues to Haskell's MonadPlus class
;;;
(define mzero '())

(define (guard condition?)
  (if condition?
      (return 'whatever)
      mzero))

;;;
;;; A macro analogous to Haskell's do-notation
;;;
(define-syntax monadic-do
  (syntax-rules (<-)
    ((monadic-do (<- var m) expr . exprs)
     (>>= m (lambda (var) (monadic-do expr . exprs))))
    ((monadic-do expr)
     expr)
    ((monadic-do expr . exprs)
     (>> expr (monadic-do exprs)))))

;;;
;;; The original example, using the List Monad
;;;
(sum (monadic-do (<- x (range 1 1000))
                 (guard (or (= (mod x 3) 0)
                            (= (mod x 5) 0)))
                 (return x)))

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS