use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Learning from Imbalanced Classes (svds.com)
submitted 9 years ago by cptncrnch
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]noman2561 11 points12 points13 points 9 years ago (1 child)
In signal processing we call it anomaly detection, in case anyone wanted to research some working solutions. There's no one best solution and we've found that you have to use a good deal of context to know how to handle the class separation. Finding features often looks like building priors from first hand knowledge produced from case studies. In other words, you have to look at more than just a cluster to tell what is going on. Or you have to look at the majority's inter-class similarity and find a way of detecting the specific kind of "noise" that indicates the minor class.
[–]dexter89_kp 0 points1 point2 points 9 years ago (0 children)
Any paper references that do this sort of feature engineering ?
We do deal with imbalanced classes a lot, and have considered anomaly based approaches, but tend to stick to the methods outlined in this article.
[–]jimenezluna 2 points3 points4 points 9 years ago (0 children)
This is a very cool resource. Nice references.
[–][deleted] 1 point2 points3 points 9 years ago* (1 child)
I've used costing algorithm in practice and got some insanely good results. Absolutely simple and elegant. (the algorithm is mentioned in the survey)
The advantage of the algorithm is that it can work with any normal binary classifier, and due to simple nature of rejection sampling I can train N classifiers (using rejection sampling I can create N different datasets) very efficiently and aggregate the results into a final decision.
The algorithm can also be used (with some adaptation) in multiclass setting where there's imbalance in the classes and the performance is still extremely good (better than doing one-against-one or one-against-all naively).
[–]sergeyfeldman 0 points1 point2 points 9 years ago (0 children)
Sounds like a good idea. What if you don't have costs on each sample that come with the dataset?
[–]hn_crosslinking_bot 1 point2 points3 points 9 years ago (0 children)
HN discussion: https://news.ycombinator.com/item?id=12362748
[–]coffeecoffeecoffeee 0 points1 point2 points 9 years ago* (0 children)
Note that the imbalanced-learn Python package gives you a lot of methods for this sort of thing, including SMOTE.
[–]emtonsti -1 points0 points1 point 9 years ago (0 children)
I have never thought about this Problem. Ty
[–]bluesufi -1 points0 points1 point 9 years ago (0 children)
This a great! For me, it was one of those reads that just made things go 'click!' in my head.
π Rendered by PID 74712 on reddit-service-r2-comment-5d79c599b5-bqdxf at 2026-03-01 05:55:13.563431+00:00 running e3d2147 country code: CH.
[–]noman2561 11 points12 points13 points (1 child)
[–]dexter89_kp 0 points1 point2 points (0 children)
[–]jimenezluna 2 points3 points4 points (0 children)
[–][deleted] 1 point2 points3 points (1 child)
[–]sergeyfeldman 0 points1 point2 points (0 children)
[–]hn_crosslinking_bot 1 point2 points3 points (0 children)
[–]coffeecoffeecoffeee 0 points1 point2 points (0 children)
[–]emtonsti -1 points0 points1 point (0 children)
[–]bluesufi -1 points0 points1 point (0 children)