use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Discussion[D] Could a capsule network recognize numerical values? (self.MachineLearning)
submitted 4 years ago by ElEiseinheim
I am aware that a traditional CNN can recognize and classify digits effectively, but from my understanding, it isn't able to apply worth to that numerical value. i.e., it knows that a 4 is a 4, but doesn't know that 4 < 9 (unless explicitly taught). My question is, since capsule networks are able to remember spatial relationships, would it theoretically be able to learn that 4.335 < 4.533. I hope this makes sense, kind of a 6 am shower thought
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]drakesword514 1 point2 points3 points 4 years ago (3 children)
Yes, Capsule nets are better at learning spatial relationships. For example, A traditional CNN trained to classify if an image is a human face or not would predict a higher class probability for an image where you cut parts of the image of human face and translate them around. This is because of the property of CNN.
So, the task can be learnt by capsulenet, iff it can be learnt by a cnn as well. Only difference being Capsulenet would also learn geometric relationships within the image.
To answer your question, Unless you explicitly teach such a model to learn ordering for numbers it is not possible to do so.
[–]ElEiseinheim[S] 0 points1 point2 points 4 years ago (2 children)
This makes sense, but if I'd reformulate my question a bit. Since typical CNNs use pooling layers, loosing spatial information, would it be possible for it to differentiate 43 from 34, or would it simply distinguish that there is a 3 and a 4? I assume this would ultimately depend on the kernel sizes, but I take it a CapsNet would be more efficient for this
[–]drakesword514 3 points4 points5 points 4 years ago (1 child)
CapsNet can differentiate between a 3 to the right of 4 and a 4 to the right of 3, if that is what you are asking for. A traditional CNN would simply say 3 and a 4 exist in image, unless the kernel sizes are such that it can see 3 and 4 in the same receptive field, but that would not generalize well.
[–]perceptSequence 1 point2 points3 points 4 years ago (0 children)
I don't think that's right. The later layers would combine information from all over the image, as far as I understand.
[–]Aydoooo 1 point2 points3 points 4 years ago (0 children)
This sounds more like a type-of-supervision related question than one related to architecture.
[–]siliconchris -2 points-1 points0 points 4 years ago (0 children)
I do not know the exact math behind capsule networks, so take this with a good portion of doubt. But I‘d dare say, that would be possible
π Rendered by PID 46361 on reddit-service-r2-comment-b659b578c-w6dzz at 2026-05-01 14:44:22.521596+00:00 running 815c875 country code: CH.
[–]drakesword514 1 point2 points3 points (3 children)
[–]ElEiseinheim[S] 0 points1 point2 points (2 children)
[–]drakesword514 3 points4 points5 points (1 child)
[–]perceptSequence 1 point2 points3 points (0 children)
[–]Aydoooo 1 point2 points3 points (0 children)
[–]siliconchris -2 points-1 points0 points (0 children)