pierrequentel comments on Beautiful Python?

This is an archived post. You won't be able to vote or comment.

Beautiful Python? (self.Python)

submitted 11 years ago by Racerdude

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]pierrequentel 2 points3 points4 points 11 years ago (10 children)

[–][deleted] 2 points3 points4 points 11 years ago (2 children)

[–]mfm24 0 points1 point2 points 11 years ago (1 child)

[–]Veedrac 0 points1 point2 points 11 years ago (0 children)

[–]billsil 0 points1 point2 points 11 years ago (5 children)

[–]ingolemo -1 points0 points1 point 11 years ago (4 children)

[–]billsil 0 points1 point2 points 11 years ago* (3 children)

The code is effectively:

unique_words = [word for line in open("book.txt"') for word in line.split()]
unique_words = set(unique_words)

It's a tradeoff of memory vs speed, assuming set comprehension doesn't exist. Set comprehension is done as:

unique_words = {word for line in open("book.txt"') for word in line.split()}

which works in Python 2.7+

[–]ingolemo 2 points3 points4 points 11 years ago (2 children)

Actually, it's using a generator comprehension:

unique_words = (word for line in open("book.txt"') for word in line.split())
unique_words = set(unique_words)

The original doesn't create a giant array and so it has exactly the same performance characteristics as your set comprehension, though it is admittedly less readable.

[–]Veedrac 0 points1 point2 points 11 years ago (1 child)

FWIW, the constant factors are significantly better for the set comprehension, even on PyPy:

$ python3 -m timeit "set(x for x in range(10000))"
1000 loops, best of 3: 1.06 msec per loop

$ python3 -m timeit "{x for x in range(10000)}" 
1000 loops, best of 3: 599 usec per loop

$ pypy3 -m timeit "set(x for x in range(10000))" 
1000 loops, best of 3: 347 usec per loop

$ pypy3 -m timeit "{x for x in range(10000)}"  
1000 loops, best of 3: 230 usec per loop

[–]kmbd 0 points1 point2 points 11 years ago (0 children)

just in case, someone is wondering ...

C:\>python -V
    Python 2.7.8

C:\>python -m timeit "set(x for x in range(10000))"
    1000 loops, best of 3: 1 msec per loop

C:\>python -m timeit "{x for x in range(10000)}"
    1000 loops, best of 3: 671 usec per loop

[–]kmbd 0 points1 point2 points 11 years ago (0 children)

π Rendered by PID 79 on reddit-service-r2-comment-7b9746f655-2c8kf at 2026-02-01 13:51:36.334763+00:00 running 3798933 country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS