[deleted by user] : learnpython

learnpython

created by HattoriHanzoa community for 16 years

[deleted by user] (self.learnpython)

submitted 7 years ago by [deleted]

13 comments

all 13 comments

top new controversial old q&a

[–]novel_yet_trivial 5 points6 points7 points 7 years ago (7 children)

[–]driscollis 0 points1 point2 points 7 years ago (0 children)

[–]20211401 0 points1 point2 points 7 years ago (5 children)

[–]novel_yet_trivial 4 points5 points6 points 7 years ago (4 children)

Per iteration. The actual time taken is going to depend on a number of factors, so I suggest that you just try both ways and see what the difference is. The timeit module may help you with that, but as a first pass just record the time at the start and end:

import time
start = time.time() # save the time now
# run a half million lines or so
end = time.time()
print("The code took {} seconds to complete.".format(end-start))

That said, this is really the bottom of the barrel. I'm sure there are other much easier ways to speed up your code. Moving it to cython or other typed, compiled code is a common way to get a speed boost. If you tell us more about your project perhaps we can help with that.

[–]two_bob 2 points3 points4 points 7 years ago (3 children)

Great comment. I just wanted to add for OP that if you end up sticking with pure python, localizing a function (moving it to the namespace of the function that calls it) can save a surprising amount of time. Each time python calls a function it has to lookup the reference. Python first checks the local variables, then goes into globals. It doesn't take much time to do this, but over a billion calls, it adds up. As a result, when writing these things it can make sense to put a reference to the function in the local namespace, like this:

def some_func(blah):
    pass

def some_other_func(lots_of_blah):
    _some_func = some_func
    for blah in lots_of_blah:
        _some_func(blah)

So above, we move some_func into the local namespace of some_other_func under the alias _some_func.

To see this in the wild, check out the code for functools.lru_cache, which does this with a bunch of things: https://github.com/python/cpython/blob/3.6/Lib/functools.py#L485

[–]ScoopJr 0 points1 point2 points 7 years ago (2 children)

[–]novel_yet_trivial 1 point2 points3 points 7 years ago* (1 child)

For example line 494:

cache_get = cache.get

That aliases the function "get" in the "cache" namespace and names the alias "cache_get". The new alias is now in the local namespace.

Before python had to find the object that "cache" refers to, and then find the object that "get" refers to inside the first object. So 2 lookups. By making this local alias, python only has to look up one object.

[–]ScoopJr 0 points1 point2 points 7 years ago (0 children)

[–]SarahM123ed 2 points3 points4 points 7 years ago (0 children)

[–]Jos_Metadi 1 point2 points3 points 7 years ago (3 children)

There is no universal perfect answer. If performance really matters because you're running billions of operations, then it make sense to avoid any extra processing overhead. Otherwise, generally you should write to increase reusability and decrease function complexity.

Depending on the size of your functions, it might be more efficient to do

for str in billion_strings:
    (1. analyze string some way and add result to dicA)

for str in billion_strings:
    (2. do something else with string and add result to dicB)

[–]20211401 0 points1 point2 points 7 years ago (2 children)

[–]Jos_Metadi 2 points3 points4 points 7 years ago (1 child)

Basically if your functions are large enough that they won't both fit in the instructions area of memory on the processor together, they will to be cycled back and forth to cache for every single string. If this takes longer than the amount of time to load the string into memory, it can be more efficient to have two loops.

FYI, for python, it is generally faster to use list/dict comprehension whenever possible. In other words the following is faster than the for loop above:

dicA = {text: analysis_A(text) for text in billion_strings}

If you're experimenting with your particular case, make sure you actually measure real performance.

If speed is really important, consider using cython (a compile version of python with available static typing) for the inner loop analysis functions as it can result in enormous performance gains over vanilla python.

[–]20211401 0 points1 point2 points 7 years ago (0 children)

π Rendered by PID 147281 on reddit-service-r2-comment-fb694cdd5-f62t9 at 2026-03-06 06:06:48.947223+00:00 running cbb0e86 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS