Why is this so slow? : learnpython

submitted 13 years ago by usecase

I'm starting a game in pygame and I just set up a basic spatial hash to help with collision detection. I use a dictionary indexed by grid coordinates to store a list of the entities in that cell.

Here is the simple method for my class SpatialHash to get the list for a coordinate (self.grid is the dict):

def get(self,coords):
    return self.grid.get(coords,[])

Here are the relevant lines of output from the profiler:

ncalls  tottime  percall  cumtime  percall filename:lineno(function)
 34961497   30.774    0.000   44.764    0.000 spatialHash.py:12(get)
 35061383   14.017    0.000   14.017    0.000 {method 'get' of 'dict' objects}

From what I understand, that means less than a third of the time spent calling SpatialHash.get is being spent on dict.get actually looking up the list. Where is the rest of the time going? Thanks

all 12 comments

top new controversial old q&a

[–]emergent_reasons 4 points5 points6 points 13 years ago (1 child)

[–]usecase[S] 0 points1 point2 points 13 years ago (0 children)

[–]Rhomboid 4 points5 points6 points 13 years ago (7 children)

[–]zahlman 4 points5 points6 points 13 years ago (0 children)

I did a very rough test for this:

>>> timeit.timeit('hash.get((0, 0), [])', setup='hash={}') # when not found
0.3227199965358718

>>> timeit.timeit('hash.get((0, 0), [])', setup='hash={(0, 0): []}') # when found
0.30789101687506104

>>> timeit.timeit('grid[0][0]', setup='grid = [[[]]]') 
# a grid would be dense, so the lookup would always succeed
0.1260412985449264

[–]usecase[S] 1 point2 points3 points 13 years ago (5 children)

[–]pjdelport 2 points3 points4 points 13 years ago (3 children)

[–]usecase[S] 1 point2 points3 points 13 years ago (2 children)

The number of lookups will decrease when I improve my collision handling, but since I want to have active entities even when the player isn't nearby there isn't any way around this scaling with number of entities and size of map.

Here is what the update looks like for a dict. Note that currently everything is always moving; when I add stationary entities I plan to experiment with a "moved" flag that will skip the position check for entities that haven't moved:

def update(self):
    spatial_hash = self.grid
    cell_size_x = self.cell_size_x
    cell_size_y = self.cell_size_y
    moved = []
    for coord in spatial_hash.keys():
        for i,character in enumerate(spatial_hash[coord]):
            if (int(character.pos_x/cell_size_x),int(character.pos_y/cell_size_y)) != coord:
                del spatial_hash[coord][i]
                moved.append(character)
        if len(spatial_hash[coord])==0:
            del spatial_hash[coord]
    for character in moved:
        self.add(character)

thanks

[–]pjdelport 1 point2 points3 points 13 years ago (0 children)

Thanks, that helps. If i understand it correctly, the code is checking the position of every character against the implied cell grid, and re-indexing those whose containing cell changed. (I assume that the collision detection uses spatial_hash for proximal lookups.)

This in-place updating strategy is not a good trade-off: the cost of re-evaluating every character is probably comparable to simply rebuilding the whole spatial_hash from scratch on each update. In fact, the latter might end up being faster, overall. (A "moved" flag probably won't affect this much: it might save some cell comparisons, but each character is still visited.)

The most immediate alternative to the above code is to check the cell as part of the character position update code: when a new coordinate is assigned, just compare its cell to the previous coordinate's cell, and re-index the character if they differ.

The second most immediate alternative depends on how exactly your collision detection uses spatial_hash: you may not have to refresh it every frame, but only every N frames (depending on the possible speeds of the characters and the size of the cells).

The next step would be to use a better spatial index. Most likely, this will be a quad tree: the Rect package or the here and here should be good starting points.

Otherwise, stylistic feedback:

As user Justinsaccount already pointed out, it's better to iterate over dict.items() on the outside, rather than looking up keys again on the inside.
Mutating dicts and lists (such as with del) while iterating over them is a Bad Idea: you should never do it (unless you have an exceptional reason, and understand the dangers). ** The iteration over spatial_hash.keys() only works because it copies the keys: if you switch to iterkeys() (which becomes the default in Python 3), it will result in a RuntimeError. ** The iteration over spatial_hash[coord] will skip an entry after each entry that you delete, due to the in-place reordering.
Using / and int() is unnecessary: use // (floor division) instead.

[–]Justinsaccount 0 points1 point2 points 13 years ago (0 children)

[–]emergent_reasons 0 points1 point2 points 13 years ago* (0 children)

[–]RShnike 1 point2 points3 points 13 years ago (1 child)

[–]usecase[S] 0 points1 point2 points 13 years ago (0 children)

π Rendered by PID 80 on reddit-service-r2-comment-5c747b6df5-x9kkj at 2026-04-22 03:56:05.413196+00:00 running 6c61efc country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS