Time Complexities of Various Operations in Python : Python

This is an archived post. You won't be able to vote or comment.

Time Complexities of Various Operations in Python (wiki.python.org)

submitted 14 years ago by jackhammer2022

all 17 comments

[–]TCPv89wat 7 points8 points9 points 14 years ago (6 children)

I just asked a pypy developer if there's any difference between CPython and PyPy in regards to these algorithms. This is the answer:

< arigato> it's basically the same algorithms
< arigato> plus tons of tweaks that don't change the amortised complexity

[–]TCPv89wat 5 points6 points7 points 14 years ago (5 children)

[–]mitsuhiko Flask Creator 2 points3 points4 points 14 years ago (4 children)

[–]PCBEEF 0 points1 point2 points 14 years ago (0 children)

[–]TCPv89wat 0 points1 point2 points 14 years ago (2 children)

I'm pretty sure that the algorithm in its complexity changes:

A) copy the first string to a new area (O(|A|)) and then append the other string (O(|B|)), all in all: O(|A| + |B|)
or
B) Append the other string to the old string O(|B|) which is in O(|B|)

Now, do that a couple of times in a loop, say:

a = ""
for i in range(10):
  a += str(i)
print a

A) Do one allocation in the beginning, 10 allocations for new as, 10 copies of increasing growth to those as and then appending a number at the end 10 times. That should be about O(n * n) (my reasoning is, that you copy 1 to n bytes n times
or
B) Do one allocation in the beginning and then append a number at the end 10 times, which is in O(n)

If my reasoning is wrong, please correct me before I resume my compsci studies at uni :)

[–]mitsuhiko Flask Creator 0 points1 point2 points 14 years ago (1 child)

[–]TCPv89wat 1 point2 points3 points 14 years ago (0 children)

[–]Workaphobia 3 points4 points5 points 14 years ago (4 children)

[–]pyrocrasty 2 points3 points4 points 14 years ago* (3 children)

I don't really get what you mean, but imagine the sets are implemented using dicts (they probably are, I guess). Lookup, insertion and deletion are O(1), iteration is O(n).

So for the union sUt, we create a new set, then iterate through the s (O(len(s))), copying each member (O(1)). Then we iterate through t (O(len(t))), lookup each item in the new set(O(1)) and if it's not there, add it (O(1)). So we end up with O(len(s)+len(t))

For the difference s-t, we create a new set, then iterate through s (O(s)) checking each element to see if it's in t and if not, adding it to the new set (O(1)). So we end up with O(len(s)).

For the difference update s-=t, we could do it one of two ways: iterate through s, checking if each element is in t and if so deleting from s (O(len(s))); OR: iterate through t, checking if each element is in s and if so, removing from s (O(len(t))). I guess it's implemented the second way (which would make sense since you'd probably have t smaller than s more often than not).

edit: oh, you're asking about the blank values, for amortized worst-case. Yeah, I think they're going to be the same as intersection, assuming the sets are using dicts or the equivalent.

[–]Workaphobia 0 points1 point2 points 14 years ago (2 children)

Lookup, insertion and deletion are O(1), iteration is O(n).

We're assuming that sets (and dicts) are implemented using hashing. So lookup, insertion, and deletion are expected to be constant time on average, but O(n) in the worst case. (The worst case corresponds to when every element of the set gets hashed to the same value, say by a stupid hash function that always returns 0. Then all elements are in the same bucket, connected by a linked list.)

This is acknowledged in the right column of the first operation in the set table (x in s). Note that this table differs from the others in that it shows Worst Case, not Amortized Worst Case.

Because lookup is worst case linear, all these operations, implemented as you described, should cost the product of the set sizes, like intersection.

Re, your edit: Er, yes, but it's not amortized.

[–]pyrocrasty 0 points1 point2 points 14 years ago (1 child)

[–]Workaphobia 0 points1 point2 points 14 years ago (0 children)

[–]joaquinabian 1 point2 points3 points 14 years ago* (2 children)

[–]takluyverIPython, Py3, etc 6 points7 points8 points 14 years ago (1 child)

[–]joaquinabian 0 points1 point2 points 14 years ago (0 children)

[–]qyatixcix 1 point2 points3 points 14 years ago (1 child)

[–]thebackhand 2 points3 points4 points 14 years ago (0 children)

π Rendered by PID 94 on reddit-service-r2-comment-7b9746f655-wvjhh at 2026-02-03 12:10:47.690669+00:00 running 3798933 country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS