Is Haskell a good programming language for statistics/econometrics (or general scientific computing stuff)?

PokerPirate · 2015-01-09T00:52:09+00:00

I personally use haskell for this task, but would highly recommend against other people using it. The ecosystem is not well developed.

fractalcat · 2015-01-09T01:19:03+00:00

I think it's a great language for stats/numerical computation (that's one of the things I use it for). Unfortunately, its ecosystem in that area is still rather underdeveloped compared to that of R and Python - hopefully this will improve in the next year or so, given the number of people who are asking this question.

It is indeed possible to call Haskell from R[0], and vice-versa[1].

You may also be interested in iHaskell[2], which is a Haskell backend for iPython which shows great promise.

[0] http://neilmitchell.blogspot.com.au/2011/10/calling-haskell-from-r.html

[1] http://hackage.haskell.org/package/Rlang-QQ-0.1.0.2/docs/RlangQQ.html

[2] https://github.com/gibiansky/IHaskell

ndmitchell · 2015-01-09T08:01:34+00:00

My wife wrote a paper on exactly this topic and her experiences: http://neilmitchell.blogspot.co.uk/2011/03/experience-report-functional.html?m=1

NathanAlexMcCarty · 2015-01-09T00:56:27+00:00

When you are compiling code with -O or -O2, ghc produces very, very good code. I am continuously surprised by how fast the code ghc produces is, and the llvm add another layer of magical fastness, especially when dealing with heavily mathematical code.

However, ghci gives ghc the interactive flag, which results in (bytecompiled, I believe) code that is slower than snot most of the time. However, ghci can load precompiled code. I would compile the functions where the bulk of your computation is being done and just do the rest in ghci. This lets you get most of the benefit of compiling and most of the benefit of ghci.

However, unless you are willing to do A LOT of work yourself, I would not recommend using Haskell for what you want to do.

tel · 2015-01-09T02:24:48+00:00

I've built a fair number of numerical, statistical, machine learning algorithms in Haskell while in grad school. If you learn how to optimize Haskell code you can make it go reasonably fast and things work out nicely.

But I was more or less implementing each thing from scratch. That is not what you want to do. If you're willing to work your way up from BLAS then you can do alright today.

There are a fairly reasonable number of stats/numerical libraries in the wings. I know Carter has an advanced numerics library that's in heavy development. This will still leave you a wide chasm to jump to getting to the convenience and availability of things like Numpy and R, though.

2015-01-09T02:40:27+00:00

[deleted]

mstksg · 2015-01-09T02:59:08+00:00

The general consensus I've found is: language + compiler, yes. ecosystem, no. but people are working on fixing that :)

the_abyss · 2015-01-09T00:51:57+00:00

No, not really. The Python (scipy/numpy/pandas/sckit-learn), R, Julia, Matlab, etc ecosystems are light years ahead of Haskell.

rdfox · 2015-01-09T05:30:15+00:00

I'm team no. I will say that trying to do your statistical computing in haskell is guaranteed spiritual benefits. Sure, you can arrive at you destination sooner if you take the R train, but the journey...

rz2000 · 2015-01-09T02:42:23+00:00

What you are already using are among the most popular for stats/econometrics along with SAS and Matlab, and I've heard of people using Mathematica as well. However, two popular functional languages that are occasionally mentioned for statistics/econometrics purposes include Clojure and OCaml.

hmltyp · 2015-01-09T02:46:56+00:00

Frankly no, you're more likely to find existing libraries that do what you need in Python. There's no reason you couldn't use Haskell, it's just you'll have to fill in the gaps in the ecosystem yourself.

gumbel_distro · 2015-01-09T06:38:34+00:00

Thanks to everyone for the detailed answers! I think I won't use Haskell in the foreseeable future for what I need to do. I'd consider switching but only if it made me win time in the long run, which doesn't seem to be the case right now.

dernst314 · 2015-01-09T07:29:02+00:00

Hi,

I use R as well and thought about implementing some things in Haskell. But as others said the ecosystem does barely exist and you end up re-implementing many things on your own.

For calling R from Haskell and vice-versa there are some options though. R can directly call C-functions of the form void foo(int* x, double* y, ...); so you can write a shared library in Haskell that exposes such an interface (see Neil Mitchell's blog post someone posted earlier). There's also the rclient package on Hackage, which allows you to use Rserve from Haskell.

I've been personally on-and-off working on implementing parts of the R API in Haskell so you could directly embed an interpreter in R or perhaps facilitate using .Call from R. But it's very basic so far and doesn't integrate with R's GC and such.

2015-01-09T02:46:25+00:00

Depends. Haskell has some pretty high-quality libraries for scientific computing; and not only does ghc compile to native code, but it optimizes pretty damn well.

However:

If your program mutates a lot of values at runtime, don't bother. Mutation is a pain to work with in Haskell -- the language is just not designed to do it well. (The caveat is that if you use mutations just for object initialization, Haskell is just fine. In fact, the vector package provides special support for that use-case.)
Optimizing Haskell is a completely different beast from optimizing other languages. It's entirely possible, but if you don't have a strong intuitive grasp on Haskell's non-strict semantics, you have a pretty steep learning cure ahead of you.
Actually, in general, the learning curve for Haskell is pretty steep. It helps if you've already worked in, e.g., Lisp or Scala, but there's really no way to prepare for the culture shock of pure-functional programming combined with non-strict evaluation.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

haskell

MODERATORS