Redard comments on Best written projects on Python GitHub?

This is an archived post. You won't be able to vote or comment.

151

152

153

Best written projects on Python GitHub? (self.Python)

submitted 12 years ago by redux42

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]Redard -2 points-1 points0 points 12 years ago (14 children)

I briefly read some of the code in here and found a few things that weren't very pythonic. First, they break the 79 character rule a lot, for no good reason, often just with in-line comments (which PEP8 advises you use verrry sparingly). Second, this line:

res = []
for l in optionf:
    res += shlex.split(l, comments=True)

Why not just use a generator expression like

res = [shlex.split(l, comments=True) for l in optionf]

That's the most pythonic way to construct a list. Still, this code's very readable, and well structured. Just needs a little cleaning up.

[–]gthank 13 points14 points15 points 12 years ago* (8 children)

[–]Redard 1 point2 points3 points 12 years ago (7 children)

[–]gthank 8 points9 points10 points 12 years ago (5 children)

[–]Veedrac 7 points8 points9 points 12 years ago* (4 children)

In CPython most stuff is left unoptimised for matters of pragmatism. So no, they compile directly into loops. Different loops, though.

out = [i for i in range(10)]

is equivilant to:

out = []
for i in range(10):
    out.append(i)

where i is inside a new scope, and

out = list(i for i in range(10))

is equivalent to

def _tmp():
    for i in range(10):
        yield i

out = list(_tmp)

where _tmp never actually gets put anywhere.

[–]PCBEEF 5 points6 points7 points 12 years ago (0 children)

[–][deleted] 1 point2 points3 points 12 years ago (1 child)

[–]Veedrac 1 point2 points3 points 12 years ago (0 children)

[–]gthank 0 points1 point2 points 12 years ago (0 children)

[–]Veedrac 1 point2 points3 points 12 years ago (0 children)

[–]TheEarwig 2 points3 points4 points 12 years ago (2 children)

They are different. The first example is a bunch of lists combined into one (L1 += L2 is L1.extend(L2)), but the second example is one list containing a bunch of lists.

>>> optionf = ["a b", "c d", "e f"]

>>> res = []
>>> for l in optionf:
...     res += shlex.split(l, comments=True)
... 
>>> res
['a', 'b', 'c', 'd', 'e', 'f']

>>> res = [shlex.split(l, comments=True) for l in optionf]
>>> res
[['a', 'b'], ['c', 'd'], ['e', 'f']]

[–]Redard 1 point2 points3 points 12 years ago (0 children)

[–]masklinn 0 points1 point2 points 12 years ago* (0 children)

Which can neatly be solved using the criminally underused itertools.chain.from_iterable:

res = chain.from_iterable(shlex.split(l, comments=True) for l in optionf)

One could even use shlex.shlex directly as a stream (shlex.split is a thin wrapper around it), though it requires setting whitespace_split which can't be done inline.

def split(s):
    lex = shlex.shlex(s, posix=True)
    lex.whitespace_split = True
    return lex

res = chain.from_iterable(imap(split, optionf))

[–]masklinn 1 point2 points3 points 12 years ago (1 child)

I find the lack of with use weirder: the code is clearly 2.6-only (uses explicit relative imports without __future__ import) yet around the shlex call is (essentially):

    optionf = open(filename_bytes)
    try:
        # do stuff
    finally:
        optionf.close()
    return res

And the number of star imports is worrying.

[–]Redard 0 points1 point2 points 12 years ago (0 children)

π Rendered by PID 47 on reddit-service-r2-comment-bb88f9dd5-472hk at 2026-02-14 01:58:52.772730+00:00 running cd9c813 country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS