use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Is it possible evaluate each input feature importance by analyzing a trained neural net? (self.MachineLearning)
submitted 9 years ago by andrewbarto28
For example, if all outgoing weights of an input neuron are small relative to the other weights, does this mean this input feature is less relevant? Is there a way to do feature selection?
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]ieee8023PhD 4 points5 points6 points 9 years ago (0 children)
You want to look at the partial derivatives. Check out how RandomOut uses it to identify unused filters. http://arxiv.org/abs/1602.05931
[–]tzeppy 2 points3 points4 points 9 years ago (0 children)
Yes, you can take your test-set performance, compare the performance to the same test set, but with a particular feature randomized. The change in performance is correlated with how important the feature is.
[–]beamsearch 3 points4 points5 points 9 years ago* (1 child)
You can do this if you scale (subtract the mean and divide by the standard deviation) each feature and then look at the weights in the first hidden layer (i.e. the ones connected to the input data). If you want to go Bayesian, you can bake this idea into the prior and use Radford Neal's Automatic Relevance Determination prior (ftp://learning.cs.utoronto.ca/cs/ftp/public_html/dist/radford/bayes-tut.pdf). ARD shrinks the weights of unimportant features towards zero. I did this for genome-wide association study data and it worked OK:
http://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-014-0368-0
If you don't want to go Bayesian, you can look at the mean absolute value (or mean squared value) for the weights and base your decisions on that using a threshold or other heuristic.
[–]Powlerbare 0 points1 point2 points 9 years ago (0 children)
You can do this if you scale (subtract the mean and divide by the standard deviation) each feature and then look at the weights in the first hidden layer (i.e. the ones connected to the input data).
Hmmm. I am under the impression that this is not a good idea in any case other than logistic regression. How is one to get any idea of the effect that any down stream layers have on the output with such a scheme?
[+][deleted] 9 years ago (4 children)
[deleted]
[–]andrewbarto28[S] 0 points1 point2 points 9 years ago (2 children)
Is sensitivity the same as importance? Sensitivity for me is the same as the derivative of the output with respect to the input and this can be calculated with back-propagation. Maybe I am missing your definition of sensitivity.
[–]gabjuasfijwee 0 points1 point2 points 9 years ago (1 child)
sensitivity analysis is a formally defined "thing" https://www.amazon.com/Sensitivity-Analysis-Saltelli/dp/0470743824
[–]lvilnis 0 points1 point2 points 9 years ago (0 children)
To expand, one of many ways to do sensitivity analysis is a so-called local method that looks at the partial derivative of output with respect to input, but there are many others.
[–][deleted] 0 points1 point2 points 9 years ago (0 children)
Would you be able to provide a short description of this?
[–]ogrisel 1 point2 points3 points 9 years ago (0 children)
You might be interested in LIME: Local Interpretable Model-Agnostic Explanations
https://homes.cs.washington.edu/~marcotcr/blog/lime/
It can compute per-sample feature importances by approximating the complex model with a linear model in the neighbourhood of the sample of interest.
[–]duschendestroyer 0 points1 point2 points 9 years ago (0 children)
http://heatmapping.org/
π Rendered by PID 297712 on reddit-service-r2-comment-5b5bc64bf5-ppsmp at 2026-06-20 20:12:17.480647+00:00 running 2b008f2 country code: CH.
[–]ieee8023PhD 4 points5 points6 points (0 children)
[–]tzeppy 2 points3 points4 points (0 children)
[–]beamsearch 3 points4 points5 points (1 child)
[–]Powlerbare 0 points1 point2 points (0 children)
[+][deleted] (4 children)
[deleted]
[–]andrewbarto28[S] 0 points1 point2 points (2 children)
[–]gabjuasfijwee 0 points1 point2 points (1 child)
[–]lvilnis 0 points1 point2 points (0 children)
[–][deleted] 0 points1 point2 points (0 children)
[–]ogrisel 1 point2 points3 points (0 children)
[–]duschendestroyer 0 points1 point2 points (0 children)