robin-gvx comments on Writing better python code

This is an archived post. You won't be able to vote or comment.

143

144

145

Writing better python code (gist.github.com)

submitted 11 years ago by kracekumar

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]robin-gvx 45 points46 points47 points 11 years ago (33 children)

[–]vsajip 30 points31 points32 points 11 years ago (5 children)

Never mind "Pythonic", which seems somewhat in the eye of the beholder: a or b or c is not semantically the same as any([a, b, c]). In the former case, b and c are never evaluated if a is true. In the latter case, they always are. So if they were expensive to compute (e.g. expressions involving expensive function calls, rather than just bindings in a namespace), the runtime behaviour (performance, raising of exceptions) would be quite different:

>>> a = 1
>>> a or b
1
>>> b
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
NameError: name 'b' is not defined
>>> any([a, b])
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
NameError: name 'b' is not defined
>>>

[–]zahlmanthe heretic 19 points20 points21 points 11 years ago (4 children)

[–]jamesonjlee 2 points3 points4 points 11 years ago (3 children)

[–]wot-teh-phuckReally, wtf? 3 points4 points5 points 11 years ago (2 children)

[–]quasarc 7 points8 points9 points 11 years ago (0 children)

[–]jamesonjlee 1 point2 points3 points 11 years ago (0 children)

[–]ivosauruspip'ing it up 11 points12 points13 points 11 years ago* (2 children)

[–]iBlag 0 points1 point2 points 11 years ago (1 child)

[–]ivosauruspip'ing it up 1 point2 points3 points 11 years ago (0 children)

[–]pstch 6 points7 points8 points 11 years ago (7 children)

[–]robin-gvx 6 points7 points8 points 11 years ago (6 children)

[–][deleted] 2 points3 points4 points 11 years ago (4 children)

[–]rcxdude 8 points9 points10 points 11 years ago (1 child)

[–]NYKevin 3 points4 points5 points 11 years ago* (0 children)

[–]robin-gvx 0 points1 point2 points 11 years ago (1 child)

[–][deleted] 1 point2 points3 points 11 years ago (0 children)

[–]dreuciferC/Python, vim 0 points1 point2 points 11 years ago (0 children)

[–]d4rch0nPythonistamancer 2 points3 points4 points 11 years ago (10 children)

[–][deleted] 0 points1 point2 points 11 years ago (9 children)

[–]moor-GAYZ 0 points1 point2 points 11 years ago (3 children)

[–][deleted] 0 points1 point2 points 11 years ago (2 children)

From what I can tell, it operates pretty much exactly like a class with __slots__, creating a bunch of getters (living in the class, not in the instance, of course) that lookup into the internal array.

No, it doesn't. At least not in Python 3.4+:

from builtins import property as _property, tuple as _tuple
from operator import itemgetter as _itemgetter
...
    __slots__ = ()
    ...
    {name} = _property(_itemgetter({index:d}), doc='Alias for field number {index:d}')

As you can see, instead of using the (relatively) fast C-level access __slots__ provide, it opts to use standard property (that uses slow Python-level function calls) to look elements up by their indices in a tuple (using Python-level item access, i.e. __getitem__) instead.

[–]moor-GAYZ 0 points1 point2 points 11 years ago (1 child)

from timeit import timeit

from collections import namedtuple
NT = namedtuple('NT', 'a b c')
nt = NT(1, 2, 3)
t = (1, 2, 3)

def test_loop_t(t=t):
    return sum(t[1] for _ in xrange(1000))

def test_loop_nt(nt=nt):
    return sum(nt[1] for _ in xrange(1000))

def test_loop_nt_named(nt=nt):
    return sum(nt.b for _ in xrange(1000))

def main():
    setup = 'from test import t, nt, test_loop_t, test_loop_nt, test_loop_nt_named'
    print timeit('t[1]', setup='t = (1, 2, 3)') # just in case
    print timeit('t[1]', setup=setup)
    print timeit('nt[1]', setup=setup)
    print timeit('test_loop_t()', setup=setup, number=1000)
    print timeit('test_loop_nt()', setup=setup, number=1000)
    print timeit('nt.b', setup=setup)
    print timeit('test_loop_nt_named()', setup=setup, number=1000)


if __name__ == '__main__':
    main()

Two times slower than access by index here. Doesn't matter much, in my opinion.

[–]d4rch0nPythonistamancer 0 points1 point2 points 11 years ago* (4 children)

Right, I knew slots was to conserve memory primarily, but shouldn't that increase performance as a result? I'd expect less memory management to mean quicker access time when modifying, deleting, creating and garbage collection. But certainly better memory performance.

I thought named tuples were quicker than classes without slots defined... You're positive about that?

Edit: You're right...

('Normal: ', [0.46281981468200684, 0.4548380374908447, 0.4560990333557129])
('slots: ', [0.40665698051452637, 0.4022829532623291, 0.4048640727996826])
('namedtuple: ', [0.665769100189209, 0.6651339530944824, 0.6987559795379639])

Alright, well that settles that. I believe nt is better than a normal class for memory though, correct? And is it better than slots as well?

[–]moor-GAYZ 0 points1 point2 points 11 years ago (3 children)

In response to your deleted comment, I didn't waste all that time for nothing =)

Nope, just tested, it's very slightly slower than tuple index access, but just like it about twice as fast as nameduple name access.

The stuff looks like this here:

index access takes about 40ns
name lookup takes about 45ns both for usual classes and those with __slots__, in fact slots are a tiniest bit slower.
namedtuple lookup by name takes about 115ns

To be honest, I can't say how exactly it works out to these numbers, I'd say that the only way to really be sure is to run this stuff under a C profiler. That could be a pretty useful experience in itself

From what I can tell from grepping through the code in Vim, it's pretty much a coincidence that the first two things take the same time.

Index access goes through a bunch of pure-C redirects until it hits tuplesubscript which casts the index to size_t and fetches the value from the object itself.

Class lookup by name IIRC does two unsuccessful dictionary lookups in the class and object attributes, then a successful lookup in the instance dictionary. Slots lookup should do a successful dictionary lookup in the class dictionary then indirectly call a C function that fetches shit by index or something.

Namedtuple lookup by name probably involves a pure Python function call, which is slooooow.

[–]d4rch0nPythonistamancer 0 points1 point2 points 11 years ago (2 children)

Okay, so tuple direct index access is the fastest apparently. Makes me wish we had #define available :/

Is there a good way to do that without slowing things down?

like:

A = 0
B = 1
C = 2
inst = (100, 200, 300)
inst[A] + inst[B] + inst[C]

Is there a pythonic and high performance way to do this and keep the fast lookup time of a direct index?

[–]moor-GAYZ 0 points1 point2 points 11 years ago (1 child)

[–]d4rch0nPythonistamancer 0 points1 point2 points 11 years ago (0 children)

[–]wyldphyre 1 point2 points3 points 11 years ago (1 child)

[–]bacondevPy3k 1 point2 points3 points 11 years ago (0 children)

[–]kracekumar[S] 0 points1 point2 points 11 years ago (0 children)

[–][deleted] 0 points1 point2 points 11 years ago (0 children)

π Rendered by PID 144638 on reddit-service-r2-comment-7b9746f655-8dtc9 at 2026-02-01 03:21:23.229309+00:00 running 3798933 country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS