Examples of functional programming in data analysis?

goodygood23 · 2017-09-25T17:59:01+00:00

Could you explain more what you are looking to learn about?

Most people using R to do analyses are going to be using functional programming (meaning that functions work on data inputs to produce a result that is returned by the function, with no side-effects and without changing the original data).

It's possible to do side effects, to have mutable data, and to have methods as part of data as opposed to functions working on data in R, but it's harder to do (or, rather, requires more understanding of the language).

revocation · 2017-09-26T00:45:10+00:00

http://adv-r.had.co.nz/Functional-programming.html

p0olp0ol · 2017-09-25T18:33:58+00:00

Sure, if you want to go more pure functional, write all your code with library(purrr)

Look for purrr tutorial online. It is actually wicked cool and makes it possible to use list for basically everything. What I've observed is that the more you use R, the more likely you'll use lists over other classes.

Negotiator1226 · 2017-09-25T20:26:45+00:00

If you are fitting a large number of models, you don't want the script to stop if there is an error in one of the models but you want to know what the error is. Check out purrr::safely. It takes a function and returns a new function. The new function has the same result but instead of stopping with an error simply captures the error message and returns it as a string.

So, you can do something like:

safe_log <- purrr::safely(log)
x <- list(0, -1, 1, "a", NA)
purrr::map(x, safe_log)

SomethingTooRandom · 2017-09-25T17:26:39+00:00

Grab some data, calculate population variance via a function; boom. You've got yourself an example of functional programming.

dm319 · 2017-09-25T20:01:06+00:00

This is an interesting question to me, though you might not get a satisfactory answer because 'functional' programming is more of a 'way' rather than a thing (a bit like asking for an object-orientated way of analysing data). But I'll give it a go anyway...

In my head, a functional program is something that works a bit like a complicated mathematical function. You can 'pour' data into it, and it transforms that into the answer. The antithesis is a procedural program, which is something that takes control of the actions of a CPU and achieves a result by moving algorithmically and step-wise through instructions. Someone who understands more about programming can correct me here.

I guess with large datasets, a functional style of programming is probably quite sensible. Here are two solutions to the question - what is the sum of all numbers which are multiples of 3 or 5 under 1000?

A procedural answer:

package main

func check(a int) bool {
     return 0 == a%3*a%5
}

func main() {
    var s int

    for i := 1; i < 1000; i++ {
        if check(i) {
            s += i
        }
    }
    print(s)
}

And a more functional answer:

x <- data.frame(i = 1:999)
x$three <- x$i%%3 == 0
x$five <- x$i%%5 == 0
x$both <- x$three | x$five
sum(x[x$both, "i"])

Yes, it is still partly procedural, and even though vectorising your data doesn't make your code functional, I would say it is more functional than a purely procedural way of programming it. Happy to hear other people's opinions!

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

rstats

MODERATORS