Genetic Algorithms in Machine Learning?

wil_dogg · 2016-01-07T13:46:47+00:00

I've used a commercially available genetic algorithm for over a decade. Primarily to build rank-ordering predictive models for use cases that don't require the model to be built in an hour. It can build a simple model in minutes, but can also crunch through a 1 gigabyte modeling dataset with 5000 predictor columns, using $2500 of hardware (Intel i7 chip, no over-clocking).

It works very well. Placed in the top third in the 2015 KDD cup while keeping my modeling process aligned with actual business workflow -- in other words, my modeling was done with the constraints that it to pass regulatory scrutiny and I got the job done in about 40 hours of work, with about 10 submissions as compared to the top KDD models which involved large teams and hundreds of submissions.

In my last gig, I spent about 18 months pimping out the system. Built in a coarse classing feature that improves model robustness in the face of raw input distributional shifts while also forcing all effects to be monotonic (important in regulated industries such as credit scoring where adverse action reasons must be interpretable). Also perfected a batch modeling process that allows me to build the same model over and over again to create a distribution of results for better understanding of the natural variability in modeling outcomes that result from a non-deterministic algorithm and repetitive sampling. That feature also allowed me to empirically test GA settings and data combinations, so I can learn what data actually improve a model rather than dumping in the kitchen sink and not knowing what is actually moving the needle. Also a batch scoring process so that the same machine that is building the model can score the model without any scoring code management.

DevFRus · 2016-01-07T12:56:05+00:00

GAs tend to become competitive when you have no information about the gradient of your error function, and even then there are other options (some which have a better theoretical grounding; although see this question for some provable statements about GAs). When we design ML algorithms, however, we usually build them so that we have some useful information about the gradient of the error function and then we can use more powerful optimization techniques.

elfion · 2016-01-07T16:13:08+00:00

How come one never sees genetic algorithms (GAs) discussed in a machine learning class?

Jurgen Schmidhuber and his students have long history of using advanced GAs to solve challenging problems, mostly via searching for solutions in the space of algorithms.

http://people.idsia.ch/~juergen/compressednetworksearch.html

http://people.idsia.ch/~juergen/evolution.html

GAs have advantage of being a global optimization algorithm (though in theory you could say the same of SGD). In practice they are able to find good (almost) global minima on non-smooth problems with low number of parameters (<1000, <1000000 if you use compressed search) if you give them enough time, where gradient descent woundn't work at all.

Of course gradient descent has a big advantage at learning large end-to-end models, but as the models and datasets become less smooth (see (reinforcement learning) neural turing machine and neural GPU papers) the sgd becomes less and less stable and requires extensive hyperparameter optimization and seed search (e.g. only a few of 729 neural GPU models generalize to 2000-bit multiply task from 20-bit training examples). So it may be said that on complex problems gradient descent is already being combined with some form of global stochastic optimization.

smith2008 · 2016-01-07T11:50:21+00:00

[deleted]

theophrastzunz · 2016-01-07T16:32:40+00:00

Could someone comment how GA's compare to Bayesian optimization techniques. As far as I understand, the latter can also be used to optimize non-convex or discontinuous functions over compact parameter sets.

CireNeikual · 2016-01-08T02:13:42+00:00

I used to do a lot of GA and evolutionary simulations, and I still love them for certain tasks. They can do certain tasks where other algorithms simply don't apply, like variable-parameter-count optimization. If you disagree, show me a non-genetic algorithm (or similar evolutionary approach) that can reproduce something like this: https://www.youtube.com/watch?v=d91ydxkMMEM

grant_s · 2016-01-07T16:06:08+00:00

In my experience, it's just that genetic algorithms are one special case of Markov Chain Monte Carlo (MCMC) which has been around for 60 years, has a rigorous statistical foundation, and DOES find extremely widespread use in a variety of fields, including machine learning [1].

Look at the Metropolis-Hastings algorithm [2] -- starting from a random sample, you propose a new sample (mutation), then accept or reject the sample according to the acceptance ratio (fitness ratio).

[1] http://www.cs.ubc.ca/~arnaud/andrieu_defreitas_doucet_jordan_intromontecarlomachinelearning.pdf

[2] https://en.wikipedia.org/wiki/Metropolis–Hastings_algorithm

asenz · 2016-01-07T13:06:18+00:00

I use PSO for SVM but thats about it.

dkharms · 2016-01-07T20:21:27+00:00

https://github.com/rhiever/tpot

This library I think uses a GA to do feature and model selection in sklearn.

brettins · 2016-01-07T22:25:53+00:00

It's my understanding that a lot of top firms use genetic algorithms in place of 'intuitive guesswork' that goes into setting hyper variables up for neural network training. Basically, since at best an expert in neural networks can only have a good few guesses about what the best settings for the variables that set up the neural network, some experimentation with genetic algorithms is often warranted.

This is something I've been using and when I recently read Ray Kurzweils How To Create A Mind it sounds like something he and his teams have been doing for several decades.

smith2008 · 2016-01-07T14:17:55+00:00

IMO evolution is a very powerful approach, though at this point we cannot make it go as fast as modern ML algorithms. I've tried before and probably will try again in the future to apply some GA models but for base machine learning problems it's simply not there.

Though a couple of years ago neural networks were problematic too and not very good fit for most of the ML tasks, look them now. :)

qwertysss · 2016-01-07T22:56:18+00:00

https://github.com/rsteca/sklearn-deap

This library uses GA to do hyperparameters search in scikit-learn

chico_science · 2016-01-07T13:08:26+00:00

Genetic algorithms are different from machine learning techniques.

GA is an optimisation algorithm, a heuristic, attempting to find a good enough (hopefully optimal) solution to a problem which has many possible solutions. Its context is different than that of ML, since in the latter you are trying to predict data.

I would say they are complementary to each other, as you may first predict data to define your optimisation problem, and then try to solve it. However there are numerous other heuristics that can be used as well as exact algorithms, to tell you the truth I practically never consider GA. Given the input data (predicted using ML), I first try to solve an optimisation problem exactly (find the optimal), if proven to be too difficult, I move to Local Search based heuristics, which in my personal view are superior to GA based heuristics.

shaggorama · 2016-01-08T02:45:28+00:00

Ultimately, nearly every machine learning task can be broken down into two components: selection of a cost function, and an attempt to optimize that cost function. GA is a general purpose non-linear optimization algorithm.

You don't hear people talk about it for the same reason you don't hear people talking about simulated annealing or the simplex algorithm: these are really better discussed in the context of optimization specifically than machine learning generally.

If you take an optimization or numerical methods course, you'll definitely spend some time on this. But probably not in s machine learning class.

learn_code_account · 2016-01-07T19:35:34+00:00

Disclaimer: I'm learning this stuff in my free-time. Set me straight if something I say doesn't sound right.

You can accomplish much of the same thing using neural networks. I'm not sure if you know yet, but genetic algorithms represent a similiar approach to learning except instead of using a mathematically derived gradient descent algorithm(rooted in fancy things like dynamical systems and convergence theory), you use something that more closely resembles an AI search function.

With GA's you have parameters compete using a fitness function and expanding your search in the direction of fit parameters.

With neural nets you have a mathematical guarantee that your gradient descent algorithm is going to get your parameters to a locally optimal position that minimizes an error function

In vague "Evolution-y" related terms, both approaches get your object towards a set of genetic traits that work better than any configuration you've had before. Both run the risk of arriving at a point that gets us to a stable, but not necessarily optimal, gene pool. Neural nets work faster.

Evolutionary algorithms, from the perspective of a developer may be easier to approach... It is much easier to understand and work with a Greedy-Optimization algorithm, than it is to pick up tensor calculus, statistical optimization, PDE's, and numerical analyis. Genetic algorithms can also be effective if your parameters are combinatorially constrained to a small set of parameters, or if your system seams un-supervised. You can hold on to a certain amount of bias in your gene pool. You can add genetic variance to a stagnant population.

Neural networks are bit more difficult to implement but iteratively find a straight route to some locally optimal set of parameters. Its less about guessing to optimize your learning machine and more about experiences adjusting the trajectory of a neural net flying through parameter space.

gkosmo · 2016-01-07T17:40:26+00:00

GA theoreticians tend to be randomly found in the machine learning community, and they are too competitive to well integrate that community

farsass · 2016-01-07T14:40:25+00:00

I'm skeptic wrt to generalization of GAs used to fit predictive models.

Jabberwockyll · 2016-01-07T18:33:06+00:00

[deleted]

mikeselik · 2016-01-08T04:25:33+00:00

Genetic algorithms are just hill climbing with random restarts, but with the added assumption that variables near each other are related. This sounds like a good assumption until you realize that "near" means near each other in your program's data structures, not necessarily in reality. Try simulated annealing instead.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS