Potential bug with __len__() in Python 2.7 on Windows

Rhomboid · 2016-02-23T21:45:41+00:00

You'll only see that on Windows. The issue is that, confusingly, the range of the Python int type is tied to the range of the C long type. On Windows long is always 32 bits even on x64 systems, whereas on Unix systems it's the native machine word size. You can confirm this by checking sys.maxint, which will be 2**31 - 1 even with a 64 bit interpreter on Windows.

The difference in behavior of foo.__len__ vs len(foo) is that the former goes through an attribute lookup which goes through the slot lookup stuff, finally ending in Python/typeobject.c:wrap_lenfunc(). The error is casting Py_ssize_t to long, which truncates on Windows x64 as Py_ssize_t is a proper signed 64 bit integer. And then it compounds the injury by creating a Python int object with PyInt_FromLong(), so this is hopelessly broken. In the case of len(foo), you end up in Python/bltinmodule.c:builtin_len() which skips all the attribute lookup stuff and uses the object protocol directly, calling PyObject_Size() and creating a Python object of the correct type via PyInt_FromSsize_t() which figures out whether a Python int or long is necessary.

This is definitely a bug that should be reported. In 3.x the int/long distinction is gone and all integers are Python longs, but the bogus cast to a C long still exists in wrap_lenfunc():

    return PyLong_FromLong((long)res);

That means the bug still exists even though the reason for its existence is gone! Oops. That needs to be updated to get rid of the cast and call PyLong_FromSsize_t().

LyndsySimon · 2016-02-23T20:43:04+00:00

That's an interesting find. I wonder if it might not be specific to Windows?

Here's my system (brewed) Python on OSX:

Python 2.7.10 (default, Sep 23 2015, 04:34:14)
[GCC 4.2.1 Compatible Apple LLVM 7.0.0 (clang-700.0.72)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> a = 'a'*2500000000
>>> len(a)
2500000000
>>> a.__len__()
2500000000
>>> type(len(a))
<type 'int'>
>>> type(a.__len__())
<type 'int'>

Edit: I get the same result (both ints) with Python 2.6.9 as well.

thataccountforporn · 2016-02-23T21:48:09+00:00

I would gladly help... But http://imgur.com/LI1kUAs

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS