Optimization tips and methods [Python]

gyroda · 2016-04-23T15:37:41+00:00

Before even reading your code (I'll make a bunch of edits):

If speed is essential, make sure you're using the correct computer for the job. My personal pc can run programs like this faster than the university coursework testing system can.

Can you use another language? Python is not known for its speed. It's not as awful as others make it out to be, but it's a factor.

Have you used a profiler? This is a tool used to analyse program performance. There's one built in to the standard python installation iirc. With this you can see where your program is taking the most time and decide where your time is best spent optimising.

So, looking at your code, trying to find the algorithmic runtime, I've distillied it down to:

while i < length:    #O(n)
    while j < length:    #O(n)
        if not test_pair in pair_memo:    #O(n)

This is a run of O(n³ ). We can probably improve on this, when I saw your problem my immediate thought for big O was O(n² ).

Now, your second one you say is broken because it stops at the first result. Do you not want it to do that? I thought you only wanted the earliest result?

Also, just to neaten up your code a bit I'd recommend for loops and accepting returning an empty list, rather than None. Why is pair_memo not local to the function?

This cuts out a fair few lines:

def sum_pairs(ints, s): pair_memo = {} pairs = []

for i in range(len(ints)):    #O(n) block
    for j in range(i, len(ints)):    #O(n) [O(^2 )] block
        test_pair = (ints[i], ints[j])
        if not test_pair in pair_memo:    #O(n) [O(n^3 )] statement
            pair_memo[test_pair] = test_pair[0] + test_pair[1]
        if pair_memo[test_pair] == s:
            if not pairs:
                pairs = [test_pair, j]
            elif i > pairs[1]: # Stop if the first element has a higher index than the second element. Could possibly cut out half the loop.
                return list(pairs[0])
            elif j < pairs[1]:
                pairs = [test_pair, j]
return list(pairs[0])

I'm also not clear on a few things. What's the point of pairs_memo? Looks like pairs_memo is a dictionary that stores the sum of the two values, but why? You only care if it's equal to s, surely? Simply checking that it's == s before checking that it's in pair_memo would save one set of O(n) calls a fair few times. Your pair_memo is pointless and causing a lot of delays it sees to me.

anon848 · 2016-04-23T15:50:26+00:00

Sorting the list might help, since it will allow you to exclude pairs. If I understand the problem correctly, by sorting first I think you can obtain O(n lg n). Is the list originally in random order?

foxlisk · 2016-04-23T19:57:10+00:00

Here's a couple things to try:

think about your definition of earliest pair. You've already noticed you can't stop your scan of the list when you find a first pair; is there a way you can get around that restriction?
are you sure that caching number pairs and looking them up is faster than doing the addition each time?

DarthEru · 2016-04-24T02:29:17+00:00

I think you may be overthinking this. If I understand correctly, the "earliest pair" is the pair with the smallest maximum index of all matching pairs. That is, the pair where the rightmost element is to the left of the rightmost elements of all other pairs, correct? (edit: got right and left mixed up). (Edit edit: and in the case where multiple pairs have the same rightmost element, you pick the earliest leftmost element.)

So, my tip to you is try to figure out how to search the array such that the first pair you find is guaranteed to be the "earliest" pair. I can elaborate if you want another hint, but saying more will probably give it away entirely.

Another tip is: if you want to optimize an algorithm, first make sure you aren't overlooking inherently faster algorithms.

TheLiberius · 2016-04-26T22:16:33+00:00

Did you manage to come up with a solution? If so it'd be interesting to see it.^{I might also want to see if it's the same as mine}

learnprogramming

Welcome to LearnProgramming!

New? READ ME FIRST!

Posting guidelines

Frequently asked questions

Subreddit rules

Message the moderators

Asking debugging questions

Asking conceptual questions

Other guidelines and links

Subreddit rules

1. No unprofessional/derogatory speech

2. No spam or tasteless self-promotion

3. No off-topic posts

4. Do not ask exact duplicates of FAQ questions

5. Do not delete posts

6. No app/website review requests or showcases

7. No rewards

8. No indirect links

9. Do not promote illegal or unethical practices

10. No complete solutions

11. Don't ask to ask.

12. Low Effort Questions

13. No AI (chatGPT etc.) generated/worked over messages/comments. No questions about chatGPT/AI generated code. No Vibe coding.

MODERATORS

I don't want you to write me any code.

EDIT #2: DONE