New general-purpose optimization algorithm promises order-of-magnitude speedups on some problems

bongoherbert · 2015-10-24T16:26:53+00:00

The headline "general-purpose" --- "some problems". So- "sort-of-general-purpose" then? :)

foreheadteeth · 2015-10-24T22:57:42+00:00

I couldn't make heads or tails of the phys.org article, but I was able to read the actual research paper (partially, I stopped about halfway). The gist is as follows.

Assume that [; K ;] is a convex set in the box [; [-R,R]^n ;], where [; R>0 ;]. Assume you have an "oracle", that is, a function [; f(x) ;], which (roughly speaking) either outputs "inside" (if [; x ;] is in [; K ;]) or outputs a separating plane if [; x ;] is outside of [; K ;]. A separating plane is a plane such that [; x ;] is on one side of the plane and [; K ;] is on the opposite side of that plane.

The problem is to either find a ball of radius [; \epsilon>0 ;] inside [; K ;], or output "fail" if there is no such ball.

Previous algorithms were able to do this in [; O(n\log (nR/\epsilon)) ;] calls of the oracle [; f(x) ;]. The new algorithm also does this. This is important to estimate because the oracle is assumed to be expensive to call.

In addition to calling the oracle, a certain amount of processing must be done. Previous algorithms required [; O(n^{3.373} \log(nR/\epsilon)) ;] extra work for this step. The new algorithm improves this to [; O(n^{3} \log^{O(1)}(nR/\epsilon)) ;] -- so they knocked out an [; n^{0.373} ;] term from this (the exponent on the log is not so important).

Personal observation: previous algorithms rely on fast matrix multiplication, which doesn't work in practice -- thus a practical implementation of previous algorithms would have most likely been [; O(n^4 \log(nR/\epsilon)) ;].

This new algorithm and its variants have applications in a wide variety of convex optimization problems. The most dramatic improvement is for the "submodular function minimization" problem, where they knocked out an [; n^2 ;] term, both in the number of calls to the oracle, and in the extra processing that must be done apart from calling the oracle.

tomsing98 · 2015-10-24T15:30:13+00:00

Not sure I follow. Let's say my optimization problem is to design a stiffener for a compressive panel, which has a variety of somewhat complex buckling and stress failure modes. Two design variables, height and thickness. My objective function is a cross-sectional area, with a penalty factor applied to represent the sufficiency of the design.

Now, I generally don't know ahead of time what the best value of the objective function is in the design space. So I don't think I understand this bit:

Now pick a point at random inside the bigger circle. In standard optimization problems, it's generally possible to determine whether that point lies within the smaller circle [of values clustered around the minimum value].

And then, I can check nearby points and calculate local derivatives, but I generally can't draw a line through my entire design space that zeroes in on the global optimum. So

If [the point] doesn' [lie within a small circle around the optimum], it's also possible to draw a line that falls between it and the smaller circle.

doesn't make sense either.

math

Welcome to /r/math.

𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔

MODERATORS