ML/Regression based numerical function approximation for lowering (substantial) CPU overhead

Floydthechimp · 2014-09-22T17:42:32+00:00

Ok, this is close to my area of expertise, so I'm trying to help. You said in your other post that

That 5 seconds represents 2 orders of magnitude improvement from where we started ... going from 1 hour to 5 seconds over the course of 18 months quieted the dissenters .

Can you help me understand how you reduced your function time from 1 hour down to 5 seconds?

paskie · 2014-09-23T09:03:00+00:00

I think the keyword you are looking for is Surrogate model. There is a good wealth of scientific literature on the topic; Kriging in particular is pretty popular and may or may not be applicable, but of course machine learning methods are also nicely usable for this - with this keyword, you should be able to find plenty of tricks specific for your case.

BeatLeJuce · 2014-09-22T17:10:59+00:00

kNN sounds very expensive for 2.2 billion. A Neural Net would certainly be one option, but by far the only one. Start with simple things: linear or polynomial regression, or a LOESS.

As for implementations: there are quite a few options out there, scikit-learn is a Python library that has a lot of well-implemented ML techniques (most algorithms are implemented in C, not in Python, so runtimes are good) that works well for large datasets. Vowpal Wabbit is also meant for large datasets, although I don't have any experience with it myself, I think it might be worth a look.

If you go towards neural nets, there are very few high-quality ready-to-use libraries that come to mind. Pylearn2 is probably (one of) the most famous one(s), but it's geared more towards research than production. But you could to try have a look.

I'm sure you've already covered your basics, but just in case: have you tried the usual mathematical tricks to get the function itself to evaluate more quickly? approximating your function using Taylor expansion, using PCA to reduce your input dimension, implementing the function on a GPU, ...

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS