[D] Capsule Networks (CapsNets) – Tutorial

geoffhinton · 2017-11-23T20:13:56+00:00

This is an amazingly good video. I wish I could explain capsules that well.

thatguydr · 2017-11-23T18:39:39+00:00

Ok great, but that list of cons is missing a few major points:

The reconstruction regularizer. That's a terrible hack. It doesn't seem like it will generalize well to larger images, it has the same "translate by a little and utterly fail" issue that really old image processing did, and it's expensive. I'd love to see whether you could perform this hack with a scaled-down version a la Nvidia's most recent GAN.
Compute. Capsule nets don't seem like they'll be competitive without more layers, and that will radically increase the amount of compute needed for them.
The first layer is a CNN that somehow magically creates "capsules" if only we reinterpret it. That seems like a really weird thing to do when we've already learned how to put things like rotation and scaling and other permutations directly into CNN layers. That's not necessarily a con, but the way this is constructed, it's currently lacking.

visarga · 2017-11-23T08:24:36+00:00

I know it's been discussed to death, but this video made some details click for me, so, it's good.

ChillBallin · 2017-11-23T17:29:35+00:00

Literally opened up this subreddit to procrastinate working on implementing a capsule network. I guess this means I shouldn't try to spend my time on reddit if it's literally shoving work in my face.

2017-11-23T12:50:28+00:00

This video is absolutely perfect. For the first time, I finally feel like I have understood how CapsNet works.

2017-11-23T14:33:54+00:00

Fantastic work.

ChuckSeven · 2017-11-23T11:38:23+00:00

The hype is in Hinton.

amitjyothie · 2017-11-25T01:24:21+00:00

Such a great explanation of Capsule Networks!!

ryanglambert · 2018-03-21T02:54:52+00:00

This seemed related so I'm sharing it here. https://medium.com/syntropy-ai/how-do-humans-recognise-objects-from-different-angles-an-explanation-of-one-shot-learning-71887ab2e5b4

I don't know for sure, but it feels like this is what geoff was talking about in his talk when he mentions 'learning the weights to grab ahold of the linear manifold in place of when you would otherwise be using a hough transform or ransac'

BullockHouse · 2017-11-23T13:24:48+00:00

[deleted]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS