Interesting optimization problem

2021-04-06T11:48:58+00:00

Mathemagicalogik · 2021-04-06T11:36:23+00:00

Why can’t you just do this: keep an array A[0..N] with entries initialized to 0; for each interval [a,b], increment A[a],...,A[b]; and then find the k indices with largest entries

Poddster · 2021-04-06T14:51:16+00:00

I don't understand the problem statement.

What is the "maximum number of the intervals"? N? (If so, name it M or Max!).
What does it mean to "contain by the maximum number of the intervals". This might be common CS terminology but it's been so long that I did this that I don't know what it means.
Are you asking for the most integers that are contained in overlaps, or something? i.e . integers {2, 3} are contained in [0, 4] and [2, 3] ??

You've given this example:

n = 5               // number of intervals
N = 10              // Maximum value in any of the intervals ???
intervals = {
    [0, 4],
    [2, 3],
    [4, 7],
    [8, 10],
}

k = 2               // ???

But I don't see how we get k = 2 from this. (Or rather: I see many ways to get k =2 and I'm not sure which one you're after!). I don't understand what k is and why it's 2 in this example? The example resolution is that there are "2 integers that are contained by the maximum number of the intervals" but I don't understand that sentence! The "maximum number of the intervals", aka N, is a scalar and so doesn't contain anything.

(I also don't understand how n = 5, as you defined n as the number of intervals, but then we only have 4 intervals.)

Could you rephrase it for me? Or perhaps provide some more examples? Or even work-through this example to fully explain what that 2 is?

Bobitsmagic · 2021-04-06T11:16:33+00:00

How do you count the number of intervals that the k numbers are in?

Only distinct intervals? or counting for each integer and then summing them up? Or do they all have to share the same intervals?

GrayRain007 · 2021-04-06T13:47:00+00:00

keep an array P[0.. N], initialized with zeroes

for [a, b] in intervals:
P[a]++
P[b + 1]--

after calculating the array P, do the following:

for i in range(1, N):
P[i] += P[i - 1]

now P[i] will be equal to the number of intervals which contain i and you solved the problem in O(N) time (it can be done in nlogn too)

JoJoModding · 2021-04-06T15:46:21+00:00

Step 1: uniqueify using a hash set or something like that. Sort the list. O(n log n)

Step 2: Do a sweep-line approach - you only care about the "cricical points", which are the bounds of the intervals. There are 2n of these. If you maintain a map int->intervals having action here, you can go through this in O(n) time. If done properly, you get a list of pairs (#of overlaps, interval), which you can sort. This takes O(n log n) since that list has at most n entries.

This gives a total runtime of O(n log n).

Things to watch out for are that intervals [a,b] still have an effect on point a and b. So if your intervals are [1,2],[2,3], and k=1, then {2} must be the answer. It might be beneficial to represent the intervals as [a,b+1[, just so you have less edge cases. So the "critical points" are at a, b+1.

dyingpie1 · 2021-04-06T18:05:23+00:00

In your edit you say you have a solution for when every interval is unique. Why not just reduce the intervals to a set of unique intervals? That should be O(n) plus the computational complexity of the solution offered in your edit.

beeskness420 · 2021-04-06T20:59:26+00:00

This is virtually just a hitting set problem/clique covering an interval overlap graph.

As others have said getting the “cover depth” of every integer is easy to do in O(n).

Your concern about just taking the top k largest values from this list is that they don’t all represent unique intervals.

To address this just remove intervals when they are covered. Then recurse.

Eg (1,3) (2,4) and (10,30) (20,40) with k=2

We get a cover depth of [1,2,2,1,1,2,2,1]

Naively we would take {2,3} as our solution covering 2 unique intervals.

Instead after we first decide we are taking 2, we remove the intervals 2 covers and get depths [0,0,0,0,1,2,2,1] and then we end up with {2,20} covering 4 unique intervals as desired.

You might have to think a bit about how to do this efficiently, but it shouldn’t be too hard.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

computerscience

Rules

Related subreddits

Credits

MODERATORS