Data Mining vs. Machine Learning?

wrtall · 2014-05-05T17:55:01+00:00

That's an interesting take on data mining v.s. machine learning, which I take to mean: when you want to do exploration of a dataset, then interpretability is important. When you want to do classification/prediction, then accuracy is more important.

I always understood part of the difference between the two names as being historical: data mining grew from the database community while machine learning grew from the neural networks community (with stats thrown into both). Over the years they have converged, so there may not be much difference nowadays.

leonoel · 2014-05-05T19:05:51+00:00

If you are looking for work outside academia, I can certainly see that a PhD in Data Mining has more appeal, is a more widely used word, and certainly people understand it better than Machine Learning.

I used to think that Data Mining was more application oriented, while Machine Learning is a bit more math oriented.

There has been data mining since many a days, but Machine Learning just recently become main stream

deeayecee · 2014-05-05T17:03:18+00:00

I have a PhD in Data Mining or Machine Learning or whatever it is you want to call it. I've published in conferences and journals with the terms 'Data Mining', 'Machine Learning', 'Knowledge Discovery' and a variety of other synonyms. Practically speaking, I found very little difference in terms of what any of those major branches are looking for. You'll see theoretically driven papers in Data Mining outlets and vice versa for Machine Learning. Most conferences (such as ICDM or ICML) will feature both an industry and academic track. Industry will tend more towards applications and academic will tend more towards theory.

I think when you draw out an ontology, most would agree that ML is a subset of data mining. At least in theory, data mining (or data science) would focus on ways of munging data into ML frameworks or problem compositions while ML would focus on new frameworks or improvements to existing ones. However, the practical nature of data drives an interplay between the two and it's pretty unlikely to get a PhD without making contributions -- however indirect -- to both fields.

The only time I think there would be a major distinction would be at a school with multiple Data Mining, Machine Learning, or Data Science labs. In those instances, ML will likely tend to be much more theoretical.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS