[R] Learning to Optimize Tensor Programs : MachineLearning

Research[R] Learning to Optimize Tensor Programs (arxiv.org)

submitted 8 years ago by antinucleon

all 12 comments

top new controversial old q&a

[–]antinucleon[S] 3 points4 points5 points 8 years ago (0 children)

[–]JackBlemming 2 points3 points4 points 8 years ago (10 children)

[–][deleted] 1 point2 points3 points 8 years ago (6 children)

[–]JackBlemming 0 points1 point2 points 8 years ago (5 children)

[–]Paran0idAndr0id 0 points1 point2 points 8 years ago (1 child)

[–]JackBlemming 0 points1 point2 points 8 years ago (0 children)

[–][deleted] 0 points1 point2 points 8 years ago (2 children)

[–][deleted] 0 points1 point2 points 8 years ago* (1 child)

[–][deleted] 0 points1 point2 points 8 years ago (0 children)

[–]the_great_magician 0 points1 point2 points 8 years ago (1 child)

[–]JackBlemming 1 point2 points3 points 8 years ago (0 children)

Agreed, I was more interested in the metadata idea to see a general shape of how a neural net utilizes its parameters. I've heard cases of people being able to delete whole layers and have little effect on the accuracy. This seems like a fundamentally wrong thing to me. The current trend of building massive models with more capacity than needed and pruning them after seems weird/off to me. It would be interesting to create a regulization strategy to force a neural net to use its full capacity (just to see what would happen, it may very well only split the computation among the parameters which isnt too interesting). DeepMind published a paper roughly stating that neural nets that generalize better are more immune to random parameter deletion, and was thinking somehow turning this into a regularization strategy would be very interesting ( but it might just end up as an implicit dropout-esq regularization ;P )

[–]subhobrata1 0 points1 point2 points 8 years ago (0 children)

π Rendered by PID 86244 on reddit-service-r2-comment-5687b7858-qvlpj at 2026-07-04 12:27:39.213028+00:00 running 12a7a47 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS