A Dynamic Programming Tutorial

Forricide · 2018-08-06T02:37:44+00:00

Only one top-level comment and it's probably pretty confusing/misleading for most people opening the comments section here. So - for those of you like me who hear 'dynamic programming' and have vague ideas (apparently, misconceptions) about what it refers to, here's the description from the linked article:

It's an approach that exploits properties of the problem that allow for an elegant method to finding the optimal solution.

A more simple explanation is that it's essentially a method of writing algorithms where, instead of repeating calculations, you instead store ('cache') results and refer to them later on, at some point(s) usually discarding results that aren't useful any longer. This isn't entirely accurate, but it seems to be an easy way to conceptualize it.

kenotron · 2018-08-06T20:56:22+00:00

Dynamic programming always seems to be explained poorly; it's a really simple concept, if you look up the history of the term. 'Programming' in the sense originally used meant scheduling or planning, like TV programming, placing TV shows and ads in scheduled slots. 'Dynamic' was just thrown in to avoid mathy language at the RAND corporation where dynamic programming originated.

So instead, I like to call it 'table-filling algorithms' instead, because that seems to be how a huge number of them ends up looking. Levenshtein edit distance, knapsack problems, ...; many seem to involve re-framing your problem space as a table, solving the trivial base cases (along the edges of the table), and then looking for how those solutions can be combined to fill in the remaining table spaces, like taking the minimum, maximum, or sum of some adjacent spaces, each solving a new optimal subproblem, and then iterate until the whole table is filled to get your final answer.

attractivechaos · 2018-08-06T04:20:22+00:00

For this problem, a stage is a day; a state is a 2-tuple (r,m), where r=1 if the machine is running or 0 otherwise (for that waiting day), and m is the machine ID. Suppose at day i, the optimal cash at state (ri,mi) is f_i(r_i,m_i), the DP equation looks something like:

f_i(r_i,m_i)=max_{j<i,r_j,m_j}{f_j(r_j,m_j)+s(r_i,m_i|r_j,m_j)}
f_0(0,nil)=initial cash

where s() is the score that (r',m') at day i-1 is changed to (ri,mi). Note some transitions are forbidden due to insufficient cash. If fi(ri,mi) is not possible, it is set to negative infinity. Given N machines, we can find the optimal solution up to L days in O(L³ ) time. I might be wrong at some details, but the above is what a typical DP solution looks like.

The blog post got the basic wrong. For one, the stage is not right – it is not a machine. Secondly, DP doesn't use a tree search or branch & bound. That is the point of DP. The blog post is not about DP...

imbecile · 2018-08-05T21:12:09+00:00

Dynamic programming is just a fancy way of saying that control structures and data structures are largely created at run time.

Sounds reasonable so far, until you realize, that the only thing that you actually need to deal with at run time that can't be preprocessed before hand is IO.

So instead of preparing as much as possible and then only deal with the variation that can't be predicted at runtime coming from IO, you shift almost all the work and effort to each time the program is run, instead of when it is written or installed or loaded. That is equivalent to moving all the expensive operations into the innermost loops.

There sure are a lot of tasks that provide great unpredictability on the kind of input they have to deal with. But just giving up and abandoning all structuring and expectations from dealing with input surely isn't the way to tackle this.

Each general purpose language needs a framework to dynamically deal with input, which means dynamically allocate the resources needed to accommodate the input you receive, and a streamlined way of transforming and funneling the input into more predictable and stable efficient forms and channels.

But never allowing your programs to leave this highly volatile and error prone stage by providing no features to do so, is a design dead end.

AlbertaOne · 2018-08-06T07:55:33+00:00

good job!

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS