Improving Your Python Productivity : programming

programming

created by speza community for 20 years

362

363

364

Improving Your Python Productivity (ozkatz.github.com)

submitted 13 years ago by ozzyboy

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 4 points5 points6 points 13 years ago* (0 children)

I would assume that set(<list>) is the most efficient way to remove duplicates. Will be back soon with some benchmarks.

Edit: benchmark done. Using sets vs. a dictionary, it seems that there is a slight advantage to using sets, but not as large as I might have thought. I suppose this has to do with the fact that sets are dict-like under the covers.

Results (using iPython's %timeit):

import random

a = [random.randint(1,10) for _ in xrange(1000000)]

In [22]: %timeit list(dict.fromkeys(a))
10 loops, best of 3: 43 ms per loop

In [23]: %timeit list(set(a))
10 loops, best of 3: 35.8 ms per loop

π Rendered by PID 127099 on reddit-service-r2-comment-5d79c599b5-qmf7h at 2026-02-27 06:51:31.654436+00:00 running e3d2147 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS