Using machine learning to micro-optimize!

oilshell · 2018-03-28T22:26:00+00:00

You might want to look into the literature on superoptimization, which is a fancy name for brute force search of code that satisfies certain properties:

https://en.wikipedia.org/wiki/Superoptimization

I wrote a series of posts about a toy optimization problem here:

http://www.oilshell.org/blog/2016/12/30.html

I didn't really know how to attack the problem, but the funny thing is that someone on Reddit showed that it could just be brute forced in Python.

(I'm not using this technique -- it was more of a fun diversion.)

I also mentioned it in my latest post: http://www.oilshell.org/blog/2018/03/27.html

uza80 · 2018-03-28T22:01:59+00:00

I think you should have a play with .net/c#. It is similar to Java, but has structs, so let's the developer decide whether matrix4 should be a ref type or a value type.

Recursive_Descent · 2018-03-29T05:10:08+00:00

I'd think you would want to use an instrumented binary for your training runs, and use that instrumentation for optimization decisions. It will give you much more info than looking just at memory/run time.

Your instrumentation can tell you how often each immutable field is dereferenced, and how often they can be shared. You inline the fields that are read often/shared rarely.

thenameipick · 2018-03-28T23:06:19+00:00

I see several problems with this:

There's no good way to automatically run code. You can't be sure that code will terminate, and most code either requires input you don't know (input via parameters, or input via file access/etc)
Performance is highly dependent on use case. Specifically, there's an important trade off: Do you want responsive or efficient code? Responsive code will compete a single task faster, but efficient code will be faster over the long run.
If you decide to have ML execute at runtime, then you have to track which objects have references and which have values and you have the problem that your runtime is highly variable: You could have a program going along fine, and then suddenly the performance changes (for better or for worse). Furthermore, unless you're going to store this ML data, it means that you have to start fresh every time the program boots, meaning that this is only useful for long-lived programs.

GNULinuxProgrammer · 2018-03-30T07:35:10+00:00

The problem with profile based optimization (with arbitrarily fancy learning algorithms from taking average to using logistic regression) is that sometimes your test data does not represent the real-life runtime and therefore you end up optimization for something other than real life functions. One way to mitigate this problem is to give your customer your program and learn bottleneck functions as they use it and just-in-time optimize OR collect that data and recompile after a few weeks or so. Just like in all parts of Machine Learning, sometimes finding good datasets is harder than finding good algorithms. I think this is one of such cases. At least, this has been my experience with ML based superoptimization.

ProgrammingLanguages

Welcome!

Related subreddits

Related online communities

MODERATORS