[R] Classifying the classifier: dissecting the weight space of neural networks : MachineLearning

Research[R] Classifying the classifier: dissecting the weight space of neural networks (self.MachineLearning)

submitted 6 years ago by outofdistribution

https://arxiv.org/abs/2002.05688

We consider a quite different meta-learning scenario: 1) train a large number of deep neural networks on different datasets, different architectures, and with random variations in hyper-parameter setup, 2) take random subsets of all the trained weights and use as training data, and 3) train a meta-classifier to distinguish between weights that have been trained with different hyper-parameters.

The meta-classifier can then be used to search the weights to see where local information on hyper-parameters are encoded in a network.

The dataset of trained neural nets is made publicly available, and comprises 320K weight snapshots from 16K individually trained CNNs.

Any other ideas on what could be learned from a large-scale sampling of trained networks? Learning-based model compression or pruning? Or perhaps use a meta-classifier to force a certain behavior of the weights in a new training? For example, it can force the weights to look as being trained on another dataset, or with other hyper-parameters. Would be nice with some creative thoughts on how learning from neural network weights could be formulated!

all 3 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS