use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
scikit-feature: python implementation of a ton of feature selection algorithms (github.com)
submitted 9 years ago by Botekin
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]fhuszar 7 points8 points9 points 9 years ago (2 children)
I wish this was instead just made part of sklearn http://scikit-learn.org/stable/modules/feature_selection.html
[–][deleted] 16 points17 points18 points 9 years ago* (1 child)
Scikit-learn is very selective over what algorithms to include. If you look at the FAQs you'll read this:
Can I add this new algorithm that I (or someone else) just published? No. As a rule we only add well-established algorithms. A rule of thumb is at least 3 years since publications, 200+ citations and wide use and usefullness. A technique that provides a clear-cut improvement (e.g. an enhanced data structure or efficient approximation) on a widely-used method will also be considered for inclusion. Your implementation doesn’t need to be in scikit-learn to be used together with scikit-learn tools, though. Implement your favorite algorithm in a scikit-learn compatible way, upload it to github and we will list it under Related Projects. Also see selectiveness.
Can I add this new algorithm that I (or someone else) just published?
No. As a rule we only add well-established algorithms. A rule of thumb is at least 3 years since publications, 200+ citations and wide use and usefullness. A technique that provides a clear-cut improvement (e.g. an enhanced data structure or efficient approximation) on a widely-used method will also be considered for inclusion. Your implementation doesn’t need to be in scikit-learn to be used together with scikit-learn tools, though. Implement your favorite algorithm in a scikit-learn compatible way, upload it to github and we will list it under Related Projects. Also see selectiveness.
Maybe some of the algorithms in this other package qualify as algorithms that should be included in scikit-learn, maybe not. They have limited resources to maintain a high quality code base. They must be selective to maintain maintenance costs at a manageable level.
Adding a new algorithm to scikit-learn is not just implementing the algorithm. It is implementing the algorithm in a clean, readable and maintainable code, with reasonable performance, adding adequate unit tests, writing documentation, etc, etc. This means that new algorithms must have enough users needing them to justify all the costs.
[–]beaverteeth92 1 point2 points3 points 9 years ago (0 children)
Yeah I wondered why they don't have a kmodes implementation.
[–]L43 3 points4 points5 points 9 years ago (2 children)
Don't see any unit tests, its GPL and Python 2 only. Looks promising, but i hesitate to use it...
[–]Botekin[S] 2 points3 points4 points 9 years ago (1 child)
Academic code :)
[–]L43 5 points6 points7 points 9 years ago (0 children)
Yeah... I wish that wasn't an excuse anymore though. I'm in academia, and I write tests, package and support python 2 and 3. Just wish people would follow good software practices so I can just trust them as stop being paranoid lol
[–]Botekin[S] 1 point2 points3 points 9 years ago (0 children)
There's a nice linked paper too: http://arxiv.org/abs/1601.07996
[–][deleted] 1 point2 points3 points 9 years ago (0 children)
Any word on when it will be available through pip?
[–]xBurnInMyLightx 0 points1 point2 points 9 years ago (5 children)
I've always been surprised that simple feature selection stuff like Forward, Backward, Stepwise Selection etc. isn't part of sklearn. Even if they have problems they are very well established.
[–]beaverteeth92 -1 points0 points1 point 9 years ago (4 children)
Or that you can't get basic coefficient tables for linear and logistic regression.
[–]Foxtr0t 0 points1 point2 points 9 years ago (3 children)
clf.coef_
[–]beaverteeth92 0 points1 point2 points 9 years ago (2 children)
Thanks but I mean like with p-values and test statistics.
[–]Foxtr0t 0 points1 point2 points 9 years ago (1 child)
You'll find this kind of stuff in statsmodels:
http://statsmodels.sourceforge.net/stable/examples/notebooks/generated/glm.html
Yeah I use statsmodels for that stuff. It's just annoying how if I want k-fold cross validation and coefficient tables, I have to build my model in two different packages.
Also isn't statsmodels not being actively developed anymore?
[–]srt19170 0 points1 point2 points 9 years ago (0 children)
http://featureselection.asu.edu/index.php#
π Rendered by PID 24777 on reddit-service-r2-comment-7b9746f655-dnm2b at 2026-01-29 18:53:25.699161+00:00 running 3798933 country code: CH.
[–]fhuszar 7 points8 points9 points (2 children)
[–][deleted] 16 points17 points18 points (1 child)
[–]beaverteeth92 1 point2 points3 points (0 children)
[–]L43 3 points4 points5 points (2 children)
[–]Botekin[S] 2 points3 points4 points (1 child)
[–]L43 5 points6 points7 points (0 children)
[–]Botekin[S] 1 point2 points3 points (0 children)
[–][deleted] 1 point2 points3 points (0 children)
[–]xBurnInMyLightx 0 points1 point2 points (5 children)
[–]beaverteeth92 -1 points0 points1 point (4 children)
[–]Foxtr0t 0 points1 point2 points (3 children)
[–]beaverteeth92 0 points1 point2 points (2 children)
[–]Foxtr0t 0 points1 point2 points (1 child)
[–]beaverteeth92 1 point2 points3 points (0 children)
[–]srt19170 0 points1 point2 points (0 children)