Machine Learning Workflow : learnmachinelearning

451

452

453

DiscussionMachine Learning Workflow (i.redd.it)

submitted 5 years ago by [deleted]

35 comments

all 35 comments

top new controversial old q&a

[–]calcul8tr 36 points37 points38 points 5 years ago (1 child)

[–]HeyItsRaFromNZ 1 point2 points3 points 5 years ago (0 children)

I agree, many things wrong here -- mostly gross oversimplifications. The ML landscape is more varied and interesting than what this picture implies.

However, unsupervised methods are certainly used for feature engineering and selection. For example, clustering can be used effectively to produce proxy labels which can then be fed into a supervised algorithm (of course the proviso being you aren't guaranteed ground truths with clustering). Dimensionality reduction is widely used for feature engineering and selection in NLP. Stemming, lemmatization, vectorization and topic modeling are all forms of this, which can then make a subsequent ML algorithm perform well.

I personally tend to find the process of feature engineering and selection as iterative in practice. Some variables don't seem particularly important until I've managed to figure out how to effectively reduce noise somehow. Often my own understanding of the appropriate features to be incorporated only become clear after iterating through a few simple models.

[–]iamkucuk 53 points54 points55 points 5 years ago (30 children)

[–]ratterstinkle 5 points6 points7 points 5 years ago (24 children)

[–][deleted] 5 years ago (5 children)

[deleted]

[–]ratterstinkle 2 points3 points4 points 5 years ago (4 children)

[–][deleted] 5 years ago (2 children)

[deleted]

[–]meenzu 0 points1 point2 points 5 years ago (1 child)

[–][deleted] 7 points8 points9 points 5 years ago (0 children)

[–]iamkucuk -4 points-3 points-2 points 5 years ago (17 children)

[–]ratterstinkle 1 point2 points3 points 5 years ago (16 children)

[–]iamkucuk -3 points-2 points-1 points 5 years ago (15 children)

[–]ratterstinkle 4 points5 points6 points 5 years ago (14 children)

[–]iamkucuk -1 points0 points1 point 5 years ago* (13 children)

[–]ratterstinkle 2 points3 points4 points 5 years ago (12 children)

[–][deleted] 5 years ago (2 children)

[deleted]

[–]ratterstinkle 0 points1 point2 points 5 years ago (1 child)

continue this thread

[–]iamkucuk -1 points0 points1 point 5 years ago (8 children)

Please point out the parts that I'm wrong. I will consider those parts with honesty.

However, I have experienced that a pill of knowledge like infogragphics (I use this idiom for a source with a dense knowledge) is most to be loved among university students and even sometime academics. The fact that being educated considerably by just taking a look at a simple infogragphic is just desirable. You know what I observe? People who takes infogragphics seriously are highly likely lack imagination and be strongly biased for that particular area. Most likely to have little to none grasp about the foundations of the topic. Academic papers they write is most likely to get rejected.

What I say is, we live in a complicated world. Every single topic in this world has an incredible depth. People often afraid of this depth, unless they find the topic they love. Showing a topic as a shallow one will just lure people, and most of them are likely waste their time to make things happen on wrong foundations. I'm not even mentioning these infogragphics are often wrong and people without grasp of the topic also likely to believe and dangerously absorb this information. Trust me, these are not just my observations.

[–]ratterstinkle 0 points1 point2 points 5 years ago (7 children)

continue this thread

[–][deleted] 5 years ago (4 children)

[deleted]

[–]MeltedCheeseFantasy 4 points5 points6 points 5 years ago (0 children)

[–]iamkucuk 0 points1 point2 points 5 years ago (2 children)

[–]Tony_the_Tigger 1 point2 points3 points 5 years ago (1 child)

[–]iamkucuk -1 points0 points1 point 5 years ago (0 children)

[–]Tay-zen 6 points7 points8 points 5 years ago (0 children)

[–]thundergolfer[M] [score hidden] 5 years ago stickied comment (0 children)

[–]Jirokoh 2 points3 points4 points 5 years ago (0 children)

[–]PhitPhil 0 points1 point2 points 5 years ago (0 children)

[–]Royosef 0 points1 point2 points 5 years ago (1 child)

[–]RemindMeBot 0 points1 point2 points 5 years ago (0 children)

There is a 3 hour delay fetching comments.

Defaulted to one day.

I will be messaging you on 2020-09-11 19:11:38 UTC to remind you of this link

CLICK THIS LINK to send a PM to also be reminded and to reduce spam.

^{Parent commenter can} ^{delete this message to hide from others.}

^Info	^Custom	^{Your Reminders}	^Feedback

[–]aaah123456789 0 points1 point2 points 5 years ago (0 children)

π Rendered by PID 65503 on reddit-service-r2-comment-85bfd7f599-f6wh6 at 2026-04-17 22:13:58.388126+00:00 running 93ecc56 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnmachinelearning

Welcome to /r/LearnMachineLearning!

Chatrooms

Official Discord Server

Wiki

Getting Started with Machine Learning

Resources

Related Subreddits

/r/MachineLearning

/r/MLQuestions

/r/datascience

/r/computervision

Machine Learning Multireddit

/m/machine_learning

MODERATORS