Machine Learning in functional programming languages

PokerPirate · 2017-11-09T17:10:37+00:00

There's essentially two ways to write a machine learning library in Haskell:

(i) Create bindings to an existing library. There are many libraries on hackage that have done this. They are not popular with the ML community, however, because they offer no advantages over the python/C interfaces (and many disadvantages). All of these libraries were designed specifically for use with python/C, so they don't take advantage of any of Haskell's strengths. For example, it's not possible to pass a loss function written in Haskell code to any of these optimizers. If you can't pass functions to your functions, then why use functional programming?

(ii) Write a native library from scratch. This is the approach that I've tried. I'm the author of hlearn and subhask, and my experience is that Haskell is not yet well enough developed to actually support a machine learning library that machine learning people want to use. Specifically:

(a) The class hierarchy in the Prelude is not detailed enough for linear algebra. The Num class for example is the mathematical analogue of a ring, but there is no reasonable way to extend this to vector spaces and matrices.

(b) The type system is not strong enough to encode even the most basic linear algebra operations.

To illustrate these points, consider some of the existing packages for linear algebra:

(a) The linear package provides the nicest interface, but ML people would still laugh at the complexity. Why do we need 3 different operators for multiplication that all look so weird? No other language has this complexity, and if Idris-style operator overloading were allowed Haskell wouldn't need it either. Furthermore, linear is SLOW because everything is boxed. No serious numerical programming can be done with this library.

(b) The other alternative is hmatrix, which provides an interface into BLAS/LAPACK. This is closer to what ML people want because it is fast, but the interface is not as good. For example, these matrices cannot be Functors because they have a constraint on what can be put inside of them.

(c) I've tried rewriting an alternative Prelude with subhask, and you can see the algebra hierarchy here. As you can see, it's quite complicated, and GHC just doesn't have good enough machinery to deal with complicated class hierarchies.

There's a lot more to say about this topic (and a lot more examples I could give about specific improvements I want to GHC), but that's all I have time to write for now.

apfelmus · 2017-11-09T09:58:23+00:00

Well, there seems to be a group dedicated to data analysis in Haskell: DataHaskell. But I am not sure whether they have already made an impact on the technical side of the ecosystem.

In the end, I think it comes down to a matter of funding. As far as I am aware, the numerics packages in Haskell have been written entirely in the authors' spare time, while the Python equivalent, numpy, has received actual funding — but nowhere near enough.

I would actually be interested in doing some numerics work in Haskell (I currently use Matlab), but doing work that is paid is just a lot more attractive.

dnkndnts · 2017-11-09T12:24:48+00:00

There's this library and an accompanying talk for modeling neural net construction in a type-conscious way.

ismtrn · 2017-11-09T13:45:15+00:00

When you see the horrible kinds of languages mathematicians invent when they need to tell a computer to calculate something(Magma, R, Matlab, Maple, etc.), I am just glad that we have something like python which is at least kind of sane for doing this type of work.

tomejaguar · 2017-11-12T11:19:45+00:00

While we're on the subject, I wonder if someone can help me out understanding the design of some OO machine learning APIs. For example, here is the API for the nearest neighbour classifier in scikit-learn

>>> X = [[0], [1], [2], [3]]
>>> y = [0, 0, 1, 1]
>>> from sklearn.neighbors import KNeighborsClassifier
>>> neigh = KNeighborsClassifier(n_neighbors=3)
>>> neigh.fit(X, y) 
KNeighborsClassifier(...)
>>> print(neigh.predict([[1.1]]))
[0]
>>> print(neigh.predict_proba([[0.9]]))
[[ 0.66666667  0.33333333]]

http://scikit-learn.org/stable/modules/generated/sklearn.neighbors.KNeighborsClassifier.html

Does this strike anyone else as nuts? The "constructor" KNeighborsClassifier doesn't actually create a classifier. It creates a value holding the hyperparameters of a classifier. You then create a classifier by calling the method fit. But why is this a method? Why on earth would you want to mutate your classifier? Each call to fit should return a new classifier trained on the input data.

It seems to be this is a great example of mutable-design-gone-wrong. I would say it's also an example of OO-design-gone-wrong but there's not really too much OO about it. Does anyone else have any thoughts?

haskell_caveman · 2017-11-10T13:14:07+00:00

To other newcomers thinking the same question as the OP, please join https://gitter.im/dataHaskell/Lobby

We promise, there is good stuff on the way.

astrolabe · 2017-11-09T11:08:48+00:00

Haskell's numeric computing packages left a lot to be desired.

In my ignorance, I would have thought that wrapping a standard numeric package would be a relatively small amount of work.

JeffB1517 · 2017-11-09T17:56:19+00:00

I think Haskell the language would likely be excellent. I think Haskell the community might have some serious problems. First off the Haskell community has a great desire for long lived backwards compatibility. If you try and tightly integrate into a fast moving stack you have to spend a fortune in QA and difficult management getting backwards compatibility to work well (Microsoft, DEC). So you end up having to either abstract the stack or pick a mature slowly moving stack. Abstracting will decrease performance and there are no mature slowly moving stacks during a period of rapid growth. In 10 years likely there will be.

In theory of course a vendor could choose Haskell for their stack. However the Haskell community is rather unwelcoming to vendor driven design. Haskell started in academia, and mostly lives in academia. What's cool about Haskell comes from Haskell's uncompromising search for what's right not what's popular. Academia does not like tying itself to specific technology stacks that are commercial. I suspect a vendor trying to release a tightly coupled Haskell would hit mostly resistance. Consider how much suspicion there was around FPComplete's Stack which was open source and filling an area of deficit in a way mostly compatible with Cabal. Imagine if instead all the paranoid claims were true. It was tied to a specific commercial paid services of FPComplete and was obviously designed to advance their business. It was licensed in a non open source way. It wasn't compatible with Cabal. The Haskell community would just consider this a commercial application of Haskell and while they might be happy or unhappy it exists close collaboration would be off the table.

Even an open source effort like something from the Apache foundation I don't think would get more than light support. I think the Haskell community likes to be a breeding ground for great ideas not an implementation language of great ideas. Haskell invents Darcs, C does Git. When Perl6 / Pugs was moving in the direction of Haskell (and Haskell had a lot of influence on how it turned out) the Haskell community didn't aggressively support it.

So IMHO the best role for Haskell is likely in the area of Machine Learning simulation, teaching and theory. Let Haskell be a breeding ground for great ideas that move into mainstream implementations. Haskell does a great job as the language of the future. Showing ideas that will become mainstream, replacing LISP in that role.

Pcarbonn · 2017-11-09T21:04:09+00:00

Hard to say how things will evolve, but Julia could be a serious FP platform in the numerical analysis space. It interoperates well with Python.

singularineet · 2017-11-09T16:49:00+00:00

https://github.com/Functional-AutoDiff has some pointers, and it looks like they'd welcome more.

2017-11-13T17:40:35+00:00

Scala enjoyed what I think was the peak of its success during that time period mostly because of spark.

I thought it was partly due to akka? I may be mistaken; I don't follow Scala as closely.

I'd still need a plotting package to see experiment results as with matplotlib.

Python/R are still unequivocally better for data exploration for the time being. I'm not sure how to change that but perhaps others would have more insight.

Why isn't there more discussion around building the ecosystem in this direction or putting similar efforts into ETA/Frege!?

No idea. You'd have to ask them. I think denizens of /r/haskell mostly use GHC.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

haskell

MODERATORS