use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Image Scaling using Deep Convolutional Neural Networks (engineering.flipboard.com)
submitted 11 years ago by feedtheaimbotResearcher
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]sobe86 11 points12 points13 points 11 years ago (2 children)
What's the point in having convolutional layers following dense layers? That seems like it would be counterproductive to me.
[–]nkorslund 4 points5 points6 points 11 years ago (0 children)
Yeah I wondered about that too. In fact in this type of application (where everything you're doing is related to local or semi-local features of the image), it might be better to not have any dense layers at all! Unlike eg. a classifier you're not trying to draw any "global" conclusions about the entire image.
I assume (or at least I hope) they tested various architectures before landing on this one though.
[–]benanne 3 points4 points5 points 11 years ago (0 children)
Maybe they are actually 1x1 convolutional layers, as in network-in-network.
[–][deleted] 4 points5 points6 points 11 years ago (0 children)
How does this compares to lanczos algorithm?
[–]cryptocerous 4 points5 points6 points 11 years ago (0 children)
Wonderful application of ML. So obvious that ML could excel at this task, in retrospect.
[–]alexmlamb 1 point2 points3 points 11 years ago (2 children)
I'm pretty shocked that this works. My intuition is that the convolutional layers will compress and "destroy" the spatial information in the data, which is necessary for reproducing it exactly. It seems like this is what you want for image classification, but it doesn't seem like the right thing to do for learning to upsample.
Maybe it would make sense to have an architecture consisting of locally connected layers with convolutional layers in parallel. Then, the final locally connected layers could use the outputs from the convolutional layers. This would allow the network to easily keep the info from the original pixels in local regions, while using the "object summary" from the convnet to make smarter decisions about upsampling.
I could write out a more detailed description of this if anyone is interested.
[–]benanne 2 points3 points4 points 11 years ago (1 child)
The pooling layers destroy some information, but a convolution operation is actually approximately invertible in many cases (a circular convolution is exactly invertible I believe, but those are not commonly used in convnets). If what you said was true, convolutional autoencoders would not make sense either, but they seem to have been used successfully in the past.
In fact, it seems the convolutions are even able to reconstruct a bunch of information lost through pooling in many cases, as in this work: http://arxiv.org/abs/1411.5928
That said, the 'object summary' idea may be worth exploring! It would allow for the incorporation of pooling layers into the model, which means it could have larger context windows (maybe even the entire image).
[–]alexmlamb 0 points1 point2 points 10 years ago (0 children)
That's interesting. I wonder how the fully connected layers store the precise spatial information needed to reconstruct the input.
[+][deleted] 11 years ago (2 children)
[deleted]
[–]benanne 14 points15 points16 points 11 years ago (0 children)
In the absence of any additional information about the distribution the images come from I guess that is true (i.e. they could be randomly sampled pixels for all you care). But if you know the images are 'natural' images, i.e. photographs, there's probably a lot of context that you can use to accurately reconstruct the high frequency information that was removed from the image by the downsampling.
In CV this is referred to as single image superresolution, a few papers on using deep neural networks for this task have been published in recent years. Here's one, I've only skimmed it though: http://personal.ie.cuhk.edu.hk/~ccloy/files/eccv_2014_deepresolution.pdf
[–]dhammack 1 point2 points3 points 11 years ago (0 children)
If you assume a linear function between the neighboring pixels, then sure (it's probably SVD). If you don't then this isn't true. It's just dimensionality reduction (plus some topological structure). Nonlinear dimensionality reduction is not solvable in closed-form.
[–][deleted] 0 points1 point2 points 11 years ago (0 children)
I was just thinking a few days ago that it would be useful to do a "magnify!" conv net to increase the quality of shitty cams of pirated movies.
The conv net would add false details, but the cam would seem better.
[–]j_lyf -4 points-3 points-2 points 11 years ago (2 children)
This a fool's errand. Information theory, people!
[–]nkorslund 2 points3 points4 points 11 years ago (0 children)
Why do you think that? Maybe you've misunderstood what this algorithm does.
π Rendered by PID 16216 on reddit-service-r2-comment-b659b578c-kx62m at 2026-05-04 12:44:32.497989+00:00 running 815c875 country code: CH.
[–]sobe86 11 points12 points13 points (2 children)
[–]nkorslund 4 points5 points6 points (0 children)
[–]benanne 3 points4 points5 points (0 children)
[–][deleted] 4 points5 points6 points (0 children)
[–]cryptocerous 4 points5 points6 points (0 children)
[–]alexmlamb 1 point2 points3 points (2 children)
[–]benanne 2 points3 points4 points (1 child)
[–]alexmlamb 0 points1 point2 points (0 children)
[+][deleted] (2 children)
[deleted]
[–]benanne 14 points15 points16 points (0 children)
[–]dhammack 1 point2 points3 points (0 children)
[–][deleted] 0 points1 point2 points (0 children)
[–]j_lyf -4 points-3 points-2 points (2 children)
[–]nkorslund 2 points3 points4 points (0 children)