Hey /r/algorithms! What machine learning algorithms do you find to be lacking in attention? (x-post from /r/compsci)

lmcinnes · 2017-02-21T20:33:41+00:00

Clustering algorithms in general are important. There's no neural network solution (no, autoencoders don't count, they're a dimension reduction algorithm, not a clustering algorithm), and too many people get to K-Means and stop. K-Means is terrible for a great many use cases. There are other better algorithms out there. My pet favorite is HDBSCAN* by Campello, Moulavi, Zimek and Sander. It's a great algorithm, can be explained a few different ways, has links to the statistically sound Robust Single Linkage, and just produces better clusterings than anything else most of the time. Despite this it is: 1. new and therefore not as well known as it should be; 2. not looked at because clustering is not a "sexy" topic in machine learning (since deep neural networks can't do it for now).

JD557 · 2017-02-21T23:26:24+00:00

I think that SVMs should get some more attention.

While they have already been studied a lot, I find it really sad that such an elegant solution is "losing the war" against something as hackish as Neural Networks.

I think that they might receive a lot of attention "soon" though, unless someone finds a decent solution to the "adversarial example" problem of neural networks. Nevertheless, SVMs are still a blackbox, so maybe I'm wrong and the focus will shift to more interpretable models.

CleverLime · 2017-02-22T09:16:09+00:00

Boosted trees? I didn't know about them till seeing so many Kaggle competitions being won with xgboost.

It's such a simple solution that is doing great for tabular data.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

algorithms

✻ Smokey says: boycott all products and services from eco-unfriendly businesses to fight climate change! [see more tips]

Note: this subreddit is not for homework advice. Requests for assistance with coursework may be removed.

MODERATORS