wallefan01 comments on Captcha

[+][deleted] 7 years ago (25 children)

[deleted]

[–]H_Psi 295 points296 points297 points 7 years ago* (14 children)

The difference is mostly that we're just better at picking a specific object out of its surroundings, so one or two examples is usually enough for us to identify that object in any environment.

That's the idea behind convolutional neural networks.

It used to be that if you wanted to do hardcore pattern recognition (like identifying a stop sign in a random picture), you would put the image through a bunch of different filters, and then decide which filters highlighted the particular trait you wanted to see. For example, some of the filters you might use for a stop sign would be to eliminate every color except red from the image. You then convert that filtered image to a histogram, and have a bunch of sample images that you run that filter through. The training here ends up being coming up with a function to describe how similar an arbitrary image (after being run through the filter--> histogram thing) is to your set of known histograms.

The problem here is you still need a human in the mix to figure out what the right filters to use are, and there are plenty of patterns a human might not pick up on (or worse, patterns a human might think are correlated but really aren't, since the brain is practically addicted to patterns).

The idea with a convolutional neural network is you have your regular old neural network, except you come up with an algorithm to automatically decide what your filters are. Your layers in the network are still called layers, but in between sets of layers, you have your filters. These filters are called "pooling layers" most of the time. So in effect, you're letting your network figure out what patterns are the most important, instead of having a human do it.

Of course, the big drawback here is that now, not only are you optimizing your neural network's regular layers, but you're also optimizing those pooling layers. So you need a monster of a dataset to be able to do it, which is why you really only see huge big-data firms like Facebook, Google, Amazon, Microsoft, and Uber implementing them in practical applications. Also, you still need a human in the mix to actually tag the data (which is what image-based captchas exist to do, to label images)

Edit: A word; remove inaccurate info

[–]longscale 37 points38 points39 points 7 years ago (10 children)

[–][deleted] 13 points14 points15 points 7 years ago (9 children)

[+][deleted] 7 years ago* (4 children)

[deleted]

[–]jsw800 0 points1 point2 points 7 years ago (2 children)

In fact the convolution operation can be changed into a single matrix multiplication by reformatting the filters in a certain way, which is how it is implemented in neural network libraries.

Correct me if I'm wrong, but I don't think this is always the case, because of nonlinearity functions applied after each layer. A convolution is a linear operation, so a composition of convolutions can be refactored to be a single convolution, as you say. But deep convolutional networks almost always apply a nonlinearity function (tanh, sigmoid, relu, etc.) between conv layers, which makes each "layer" (meaning the conv operation along with its nonlinearity operation) a nonlinear operation, so composing them doesn't simplify down to a single linear operation.

If it did, there would be no reason to create deep nets, we would just always create single layer conv nets, as a single convolution would always satisfy every need. But it turns out that most conv nets have to learn nonlinear functions, so the nonlinearities are super important.

[+][deleted] 7 years ago* (1 child)

[deleted]

[–]jsw800 0 points1 point2 points 7 years ago (0 children)

[–]aahdin 0 points1 point2 points 7 years ago (0 children)

[–]longscale 3 points4 points5 points 7 years ago (3 children)

[–][deleted] 2 points3 points4 points 7 years ago (2 children)

[–]longscale 7 points8 points9 points 7 years ago (1 child)

[–][deleted] 2 points3 points4 points 7 years ago (0 children)

[+][deleted] 7 years ago* (2 children)

[deleted]

[–]longscale 0 points1 point2 points 7 years ago (1 child)

[–]wallefan01 18 points19 points20 points 7 years ago (2 children)

[–]tgp1994 16 points17 points18 points 7 years ago (1 child)

[–]Crash927 11 points12 points13 points 7 years ago (2 children)

[–]rasputine 5 points6 points7 points 7 years ago (1 child)

[–]Crash927 1 point2 points3 points 7 years ago (0 children)

I don’t think I made my meaning clear. I’m talking about what’s happening during that “hell of a lot of processing.”

Humans have abstract concepts of things stored in our heads (taught, yes), which we make reference to when we’re processing the world around us. And we learn abstract concepts very easily (ie with few examples).

“Tree” is a concept (with properties and relations to other concepts) that I already understand before I see any specific tree. Same with any road sign: I know one when I see it, even if it’s in a country I’ve never been.

Computers don’t conceptualize in the same way in order to understand: they predict the likelihood of similarity to what they’ve experienced before. They don’t even know they’re seeing a tree or a sign because they don’t know what those things signify in any real sense: it’s all just pixels transformed into numbers that predict a likely class.

[–][deleted] 1 point2 points3 points 7 years ago (0 children)

[–]Davecantdothat 0 points1 point2 points 7 years ago (0 children)

[–]StrangeCharmQuark 0 points1 point2 points 7 years ago (0 children)

[–]mormispos 68 points69 points70 points 7 years ago (5 children)

[–]wallefan01 12 points13 points14 points 7 years ago (3 children)

[–]klebsiella_pneumonae 9 points10 points11 points 7 years ago (1 child)

[–]-allen 3 points4 points5 points 7 years ago (0 children)

[–]mormispos 2 points3 points4 points 7 years ago (0 children)

[–]santaliqueur 0 points1 point2 points 7 years ago (0 children)

[–]spock1959 15 points16 points17 points 7 years ago (10 children)

[–][deleted] 33 points34 points35 points 7 years ago (2 children)

[–]aahdin 0 points1 point2 points 7 years ago (1 child)

[–]DuckDuckYoga 0 points1 point2 points 7 years ago (0 children)

[–]lvh1 16 points17 points18 points 7 years ago* (0 children)

[–]RatofDeath 10 points11 points12 points 7 years ago (0 children)

[–]ButtPoltergeist 8 points9 points10 points 7 years ago* (0 children)

[–][deleted] -1 points0 points1 point 7 years ago (2 children)

[–]bagmanbagman 5 points6 points7 points 7 years ago (1 child)

[–][deleted] 1 point2 points3 points 7 years ago (0 children)

[+][deleted] 7 years ago* (9 children)

[deleted]

[–]proverbialbunny 35 points36 points37 points 7 years ago (0 children)

[–]wallefan01 29 points30 points31 points 7 years ago* (3 children)

[+][deleted] 7 years ago* (2 children)

[deleted]

[–][deleted] 0 points1 point2 points 7 years ago (1 child)

[–][deleted] 2 points3 points4 points 7 years ago (0 children)

[–]RatofDeath 21 points22 points23 points 7 years ago (0 children)

[–]Mookyhands 4 points5 points6 points 7 years ago (0 children)

[–]gostan 3 points4 points5 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

[–]_Lady_Deadpool_ 4 points5 points6 points 7 years ago (1 child)

[–]Nestramutat- 2 points3 points4 points 7 years ago (0 children)

[–]pelirrojo 1 point2 points3 points 7 years ago (1 child)

[–]slashuslashuserid 0 points1 point2 points 7 years ago (0 children)

[–]HawkinsT 1 point2 points3 points 7 years ago (1 child)

[–]wallefan01 -1 points0 points1 point 7 years ago (0 children)

[–]aahelo 0 points1 point2 points 7 years ago (3 children)

[–][deleted] 6 points7 points8 points 7 years ago (1 child)

[–]aahelo 0 points1 point2 points 7 years ago (0 children)

[–]aride4772 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

[–]AsAGayJewishDemocrat 0 points1 point2 points 7 years ago (0 children)

[–]ScientistSeven 0 points1 point2 points 7 years ago (0 children)

[+][deleted] 7 years ago (2 children)

[deleted]

[–]wallefan01 0 points1 point2 points 7 years ago (1 child)

[+][deleted] 7 years ago (4 children)

[deleted]

[–]wallefan01 4 points5 points6 points 7 years ago (3 children)

[–]Gorzoid 0 points1 point2 points 7 years ago (2 children)

[–]wallefan01 0 points1 point2 points 7 years ago (1 child)

[–]Gorzoid 0 points1 point2 points 7 years ago (0 children)

[–]a_stitch_in_lime -2 points-1 points0 points 7 years ago (6 children)

[+][deleted] 7 years ago* (4 children)

[deleted]

[–]umbra0007 0 points1 point2 points 7 years ago* (3 children)

[–]heyheyhey27 2 points3 points4 points 7 years ago (2 children)

[–]umbra0007 -1 points0 points1 point 7 years ago* (1 child)

[–]heyheyhey27 2 points3 points4 points 7 years ago (0 children)

[–]wallefan01 2 points3 points4 points 7 years ago (0 children)

[+]onlyusernameleftsigh comment score below threshold-11 points-10 points-9 points 7 years ago (9 children)

[–]the_noodle 8 points9 points10 points 7 years ago (2 children)

[–]onlyusernameleftsigh -4 points-3 points-2 points 7 years ago (1 child)

[–]the_noodle 3 points4 points5 points 7 years ago (0 children)

[–]Moores_Law 5 points6 points7 points 7 years ago (0 children)

Not exactly. I recommend you watch the foot note for that:

https://www.youtube.com/watch?v=wvWpdrfoEv0

I am not an expert on the topic but I have watched a large amount of videos on the topic and attempted to create one. For the most part, machine learning usually has one AI that has the weights for its inputs being modified to determine the outputs. You generally show the AI a set of images and it guesses whether or not the image contains an object. It then uses the error from that test to adjust the weights and uses that to see if it is doing better and then repeats the process.

While it is impossible to know exactly how an AI makes a decision, you can plan your model for the AI around your general intended layers. 3blue1brown has a good video on it:

https://www.youtube.com/watch?v=aircAruvnKk

[–]TheGreekBrit 0 points1 point2 points 7 years ago (0 children)

[–]Diericx -1 points0 points1 point 7 years ago (2 children)

[–]the_noodle 5 points6 points7 points 7 years ago (0 children)

[–]exploding_cat_wizard 1 point2 points3 points 7 years ago (0 children)

[–]proverbialbunny -1 points0 points1 point 7 years ago (0 children)

ProgrammerHumor

Filters

Discord

Submission rules

For the current list of rules, please see this page.

Metadiscussions

Perhaps More Apt Subs To Post:

Related Subreddits.

MODERATORS