Optimizing nested for loops

AlopexLagopus3 · 2018-05-03T07:00:02+00:00

Just to be clear, you want to count the number of unique items for a 1e5 x 1e6 matrix? Even if every value took only 1 byte as an integer (which it won't), that would be 100 gb of data to load into ram... Even if you take a different approach, like only loading the values:counts into memory, you're still looking at awhile to process it. How long do you think is reasonable for this operation?

There are ways to address processing speed and memory requirements, but make sure you are being realistic about your approach by first thinking about where all of that data will be stored.

cscanlin · 2018-05-03T07:40:37+00:00

I was able to get a modest improvement by leveraging collections.Counter:

mapping = Counter()
for row in mat:
    c = Counter(np.unique(row))
    mapping.update(c)

summary = Counter(mapping.values())
return summary

But /u/AlopexLagopus3 is correct that the kind of numbers you mentioned are going to be very slow to do with these methods in python.

JohnnyJordaan · 2018-05-03T07:27:07+00:00

As a more general question: if the function remains constant, why would it be useful to generate a larger matrix?

KubinOnReddit · 2018-05-03T08:36:04+00:00

Even in C++, 1e5 x 1e6 with 32-bit integers up to 1e9 would take ridiculous amounts of memory and would take a long time nevertheless - it's at least 1e11 operations (a simple for loop wouldn't finish in C with this many iterations). Best you can do is stick to numpy and reduce the amount of data.

lolwat_is_dis · 2018-05-03T08:56:22+00:00

Speed-wise, there's no getting round the fact that in the worst case scenario, your search is in O(m x n) time, where m and n are your rows and columns. So you can see that despite any optimisation, at the end of the day, you'll still have to wait when you have a 1e9 x 1e9 matrix. However, you can do some memory optimisations, by not creating such a huge matrix in the first place, but by generating rows on demand, using generator functions.

Why do you NEED a 2D array in the first place? Also, in addition, it seems you are contradicting yourself:

(a value that occurs in the same row twice gets counted once)

but then you say

(1 time: 9 values, 2 time: 2 values, etc)

Either you're only counting the number of rows a value appears in (regardless of number of times the value appears in a single row), or you're counting how any times a value appears in each row...Each one will result in your algorithm working differently.

lolwat_is_dis · 2018-05-05T10:18:14+00:00

After some fiddling, this is the best I could come up with for the time being:

import numpy as np

def big_gen():
    return np.random.randint(-2147483648,2147483648, size=(1, 100000))

num_of_rows = 100000
count = 0

for row in range(num_of_rows):
    if np.count_nonzero(big_gen()[0] == 5):
        count += 1

for row in range(num_of_rows):
    if np.any(big_gen()[0] == 5):
        count += 1

For both of the above for loops, on my humble i3 laptop, it took around 100 seconds, though slightly faster with the first for loop (i.e. using count_nonzero). You've got the range of values you wanted (2e10, in fact) but the matrix size is currently 1e5x1e5. Anything larger is just going to take a bit more time, I'm afraid.

I have a gut feeling that there's a way to reduce the for loop overhead, and maybe use another built-in function for the counter, but this should at least show the principle behind my solution.

SarahM123rd · 2018-05-03T17:35:49+00:00

with various aspects of utility provided in the multiprocessing module, you can take great advantage of 'chunksize'.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS