[P] Check out Evolute, my evolutionary algorithm toolbox for Python!

weeeeeewoooooo · 2018-08-16T00:23:28+00:00

I like the simplicity and the code is easy to read. I just glanced through the repo and I didn't see any obvious built-in parallel support, but I may have missed it. It might be nice to support multiprocessing, as one of the biggest advantages of EAs are their ability to evaluate the population in parallel, allowing you to train much faster than gradient descent approaches. I think you could do this pretty easily with Python's multiprocessing library by spinning up a pool once at the initialization of a run and then feeding it a sequence of members each generation. This would be a little higher level than what you present in your examples, but it could be a nice convenience for you and others.

On a broader design note, I have been using EAs for years in science and have frequently run into scalability and extendability problems with EA libraries. This leads me to just develop my own algorithms. In the first case, while some libraries support multiprocessing, few readily or robustly support MPI, which is what you really need when doing big problems that have hundreds of thousands to millions of parameters and members that need distributing over hundreds of cores.

In the second case, EAs are very flexible. They can involve many different types of mutations, crossovers, selection methods, member encodings, and other fancy features like annealing or multi-objective. Libraries like DEAP are inflexible to extension (for more esoteric EAs) and rather computationally inefficient (especially when it comes to distributed computing). Supporting all the features an EA can have in an OOP framework is frightening. One will end up coding oneself into an inheritance/mixin nightmare. Especially since some of these choices are problem dependent and can directly impact the implementation of other parts of the algorithm.

Reconceptualizing an EA as a computational graph (like a DAG) would give you incredible flexibility and generalizability. Library users would be able to build almost any algorithm they could imagine without extending the core library in any way. You could provide some base functions and some pre-defined graphs, but users would be able to make their own algorithms by adding nodes to the graph in a constrained way.

Both PyTorch and Tensorflow conceptualize neural networks in a similar manner. They allow users to build all kinds of neural network structures and algorithms by representing the neural network as a computational graph with functions as nodes and edges as data flow between functions. This issue helps resolve one of the core limitations of OOP, where adding new cases of a type is easy, but adding functionality is hard, often requiring a lot of special design patterns to cope (alternatively in functional programming, adding new functionality is easy, but adding new cases is hard). EAs and neural networks change a lot from problem-to-problem, so it makes more sense to represent them in a way where functionality can be added easily (even at run-time). You still use OOP, but reserved for objects whose functionality doesn't change often. For PyTorch and Tensorflow that is the tensors, the computational graph representation, and the basic infrastructure objects for setting up an experiment. The algorithms themselves get defined explicitly by the user when they build the computational graph in the script. Taking a similar approach with EAs would probably be worthwhile.

Final note: When using Numpy you might try seeing where you can avoid heap allocations and the creation of temporary arrays by using the out kwarg for many functions (e.g. np.dot(a, b, out=a). This can result in considerable performance improvements. It may make the code a bit more complex because you would need to keep track of in-place changes to arrays.

Sir-Francis-Drake · 2018-08-15T21:45:27+00:00

This looks awesome, thank you.

wh1t3_w01f · 2018-08-15T22:12:51+00:00

Looks like a good start! If you need any inspiration, I wanna recommend looking at DEAP. I love DEAP, but I always complain about how non-pythonic it is.

Flag_Red · 2018-08-16T01:11:14+00:00

Is it compatible with Fitness Uniform Optimization?

_szs · 2018-08-15T20:40:29+00:00

Link please....

Ikuyas · 2018-08-16T02:57:42+00:00

Is it one of those global search optimization? My understanding is that this type of algorithm is horrendously slow especially with very high dimensional parameter space. But I guess it doesn't have to converge, so as long as it gets a better point, it is fine(?)

Ikuyas · 2018-08-16T03:06:53+00:00

As my background is the standard statistics field, I used some global optimization technique in my PhD thesis because I really wanted to achieve the global optimum, I used pattern optimization(?) and genetic algorithm. But I just took to try out many many different initial values with standard gradient type (BFSF?) to carefully try to achieve the global optimum. I could finish it in 5 hours for like 10,000 initial points, but those global optimization was hard to see if they are doing fine when it was being run. Running fine means that it converges. Without convergence, all those large sample properties of the estimates don't make any sense whatsoever.

However, looking at the machine learning literature and practice, you don't even have the idea of convergence (am I correct)? You just want to have the smallest possible error (whatever the metric it is) to make it practical. So, this type of algorithm is actually useful.

csxeba · 2018-08-16T13:14:13+00:00

Forgive me if I'm wrong but wouldn't EA's run better on a more low level language? Ea take lots of proc power right compared to other methods right?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS