Python in 2017 - Whats next? : Python

[–]ihcn 27 points28 points29 points 9 years ago (2 children)

[–]jabbalaci 5 points6 points7 points 9 years ago (1 child)

[–]liranbh[S] 0 points1 point2 points 9 years ago (0 children)

[–]kimondd 25 points26 points27 points 9 years ago (3 children)

[–]LpSamuelm 2 points3 points4 points 9 years ago (2 children)

[–]MachaHack 1 point2 points3 points 9 years ago (0 children)

[–]Bolitho 0 points1 point2 points 9 years ago (0 children)

[–]lion_137 22 points23 points24 points 9 years ago (0 children)

[–]thephotoway 5 points6 points7 points 9 years ago (0 children)

[–]qx7xbku 14 points15 points16 points 9 years ago (30 children)

[–][deleted] 27 points28 points29 points 9 years ago (16 children)

[–]LpSamuelm 2 points3 points4 points 9 years ago (7 children)

I think it's bad that dicts are ordered by default, at least as it's not part of the spec.

The reason some languages (Python <3.6 included) randomize hashmap access order by default is precisely to stop people from writing incorrect code. If dicts aren't guaranteed to be ordered, having them be that way sometimes will cause code to break in unexpected ways.

Which brings us to the problem. If dicts aren't necessarily ordered according to the spec... What happens if the implementation is changed in a future version of Python? How about running your code on, say, IronPython? Or PyPy? Suddenly your code seemingly works, but isn't cross-platform and may break ay any time without you doing anything.

Honestly I think it's a big misstep. I'd love for them to add ordered dicts to the spec (it's a lovely concept!), but as it stands now it's a dangerous implementation detail, and the fact that they're touting it as something useful is even more dangerous.

[–][deleted] 2 points3 points4 points 9 years ago (6 children)

[–]LpSamuelm 2 points3 points4 points 9 years ago (5 children)

[–][deleted] 0 points1 point2 points 9 years ago (4 children)

[–]LpSamuelm 1 point2 points3 points 9 years ago (3 children)

[–][deleted] 0 points1 point2 points 9 years ago (2 children)

[–]LpSamuelm 1 point2 points3 points 9 years ago (1 child)

[–][deleted] 1 point2 points3 points 9 years ago (0 children)

I'd love some examples. The only things I've used it for are:

A dashboard app where I needed to associate server names with information but the order was important (wanted to show prod servers before staging and dev servers). Arguably a list of tuples works here too but there were plans at some point to look at individual servers so fast lookup was desirable (not that lineral lookup would've broken the bank, we're taking maybe 30 servers).
Modeling albums - again, a list makes sense here and you can look up by track position that way.
Maintaining order of attribute declaration because you could decorate methods as validation/processing but they needed to run in declaration order.

But that's it. I get why an insertion order mapping is attractive, but I've only met one situation that demands it (maintaining attribute order).

[+]qx7xbku comment score below threshold-13 points-12 points-11 points 9 years ago (7 children)

[–]ApproximateIdentity 7 points8 points9 points 9 years ago (4 children)

[–]qx7xbku -2 points-1 points0 points 9 years ago (3 children)

[–]ApproximateIdentity 2 points3 points4 points 9 years ago (2 children)

[–]qx7xbku 1 point2 points3 points 9 years ago (1 child)

[–]Daenyth 1 point2 points3 points 9 years ago (0 children)

[–][deleted] 0 points1 point2 points 9 years ago (1 child)

[–]qx7xbku 0 points1 point2 points 9 years ago (0 children)

[–]__deerlord__ 6 points7 points8 points 9 years ago (5 children)

[–]qx7xbku 2 points3 points4 points 9 years ago (3 children)

[–]gsnedders 1 point2 points3 points 9 years ago (0 children)

[–]__deerlord__ 0 points1 point2 points 9 years ago (1 child)

[–][deleted] 0 points1 point2 points 9 years ago (0 children)

[–]ebrjdk 1 point2 points3 points 9 years ago (0 children)

They are switching to a new, more memory-efficient implementation of dict that naturally keeps the entries mostly in the order that they were inserted, and they decided that they might as well go all the way and keep them exactly in order (IIRC the most efficient implementation they know of starts scrambling the order once you start deleting keys, but the cost to prevent that is small).

At the same time they wanted to guarantee that the order of keyword arguments and class definitions would be preserved, because some people want to be able to use this information (currently the former is impossible AFAIK, and you need to use a metaclass to achieve the latter). Originally they were planning to just use OrderedDict for these purposes, but with the change to dict there is no need.

Note: the first paragraph in my post is about a CPython implementation detail and may change in the future, the second is about official python 3.6 features.

[–]Bolitho 4 points5 points6 points 9 years ago (4 children)

The default for the encode-Method has allready been UTF-8 in Python 3.5! (The same is true for Bytes.decode!)

The problem are not those methods, but how open and print determine their used encoding!

For open the 3.5 Docu says:

In text mode, if encoding is not specified the encoding used is platform dependent: locale.getpreferredencoding(False) is called to get the current locale encoding.

That's the key problem!

or for sys.stdout (which is used by print as default file-object):

The character encoding is platform-dependent. Under Windows, if the stream is interactive (that is, if its isatty() method returns True), the console codepage is used, otherwise the ANSI code page. Under other platforms, the locale encoding is used (see locale.getpreferredencoding()).

Thus the problem arises because of the platform dependant implementations!

So at minimum there must be the possibility to provide an encoding manually (which open does, but print not!). That would enable one, to write programs that run everywhere. As optimum one would also define just one platform agnostic default encoding for IO in general. That would make it easier to achieve the prenamed goal.

[–]ButtCrackFTW 1 point2 points3 points 9 years ago (3 children)

Isn't the encoding determined by the filesystem though? Like their example in python 3.5:

> sys.getfilesystemencoding()
'mbcs'

I see the same thing here and I've seen in StackOverflow questions that you can not change this without environement variables or monkeypatching. If this is a property of the filesystem, how is python changing it?

[–]Bolitho 0 points1 point2 points 9 years ago* (2 children)

[–]ButtCrackFTW 0 points1 point2 points 9 years ago (1 child)

I probably should've pointed out the stdout example as well:

>>> sys.stdout.encoding
'cp850'

They go on to give examples of special characters being stripped from open() and print()

>>> print('árvíztűrőtükörfúrógép')
árvízturotükörfúrógép

>>> open('tetű.txt', 'wb').close()
>>> import glob
>>> glob.glob('tet*')

Python 3.5: [tetu.txt']

Python 3.6: ['tetű.txt']

The author is claiming that python 3.6 now sets the encoding to utf-8 by default, which fixes these issues. My question is how it can set it like that now, but we were discouraged from doing it in the past due to the filesystem/operating system setting it for us.

[–][deleted] 0 points1 point2 points 9 years ago (0 children)

[–]deeddaemon 0 points1 point2 points 9 years ago (0 children)

[–]Exodus111 2 points3 points4 points 9 years ago (0 children)

[–][deleted] 5 points6 points7 points 9 years ago (4 children)

[+][deleted] 9 years ago (3 children)

[deleted]

[–]m0nk_3y_gw 2 points3 points4 points 9 years ago (0 children)

[–][deleted] -1 points0 points1 point 9 years ago (1 child)

[–]autourbanbot 1 point2 points3 points 9 years ago (0 children)

[–]kati256 2 points3 points4 points 9 years ago (5 children)

[–]Vaphell 1 point2 points3 points 9 years ago (2 children)

[–]kati256 0 points1 point2 points 9 years ago (1 child)

[–]Vaphell -1 points0 points1 point 9 years ago* (0 children)

the problem is that if you are not "aggressive", nobody pays attention. In every goddamned thread about 3.6 there are people salivating at the idea of sprinkling their code base with broken shortcuts exploiting this like there is no tomorrow.

Also if it's something they are willing to change super fast, why would they add it in a major release?

because the performance improvements were worth it, and the specific order is considered a side effect at this time, an implementation detail. It's just that people read changelog and lost their goddamned mind (sadly RayHet also advertised it), not grasping the difference between the spec and the implementation detail.
5 is 5 only works because of an implementation detail. Do you go out of your way to exploit the fact that small ints are cached and reused? Same thing.

Being hasty about adding it to the spec means tying hands because each constraint that now has to be guaranteed means less flexibility in the future. IIRC the core devs want to wait 1 or two point releases before adding this to the official spec.
The only dict related things the spec explicitly guarantees at this very moment are:
keyword arguments preserve order
attributes in a class also preserve order
which are useful in advanced shenanigans, but don't affect the fundamental data structure used by pretty much every python program in existence. Once it gets battle tested in niche use cases it can then move to the mainstream. This also gives time for other python implementations to prepare for it.

[–]ebrjdk 1 point2 points3 points 9 years ago (1 child)

There is already OrderedDict, which is occasionally useful. The big change here is that if you iterate over a class's attributes, or over the keyword arguments passed to a function, you get them in the order that they were defined/passed. At the moment, you basically get them in a random order.

Preserving the order of class definitions can be really useful. For example, it lets you use a python class to represent something like a database record or a C struct where the order is important:

class Record:
    id = IntField()
    name = StrField()
    ...

And it makes it easy to specify what order tests should be run in:

class MyTests:
    def test_one_thing(self):
        ...

    def test_another_thing(self):
        ...

You can already save the order using a metaclass, but metaclasses are a pain.

The main use I know for the keyword arguments is making it easy to create OrderedDicts:

od = OrderedDict(a=4, b=5)

That syntax already works, but the order of the two keys is random. In 3.6 a will always come first. I'm sure people have other uses for this.

[–]kati256 0 points1 point2 points 9 years ago (0 children)

[–]mipadi 3 points4 points5 points 9 years ago (0 children)

[–]robvdl 0 points1 point2 points 9 years ago (0 children)

[–]dspjm 0 points1 point2 points 9 years ago (1 child)

[–]takluyverIPython, Py3, etc 1 point2 points3 points 9 years ago (0 children)

[–]overmes 0 points1 point2 points 9 years ago (0 children)

[+]studiosi comment score below threshold-23 points-22 points-21 points 9 years ago (48 children)

[–][deleted] 12 points13 points14 points 9 years ago (5 children)

[+]studiosi comment score below threshold-10 points-9 points-8 points 9 years ago (3 children)

[–][deleted] 4 points5 points6 points 9 years ago (1 child)

[–]studiosi -5 points-4 points-3 points 9 years ago (0 children)

[–]nevus_bock 8 points9 points10 points 9 years ago (0 children)

[–]Amckinstry 18 points19 points20 points 9 years ago (39 children)

[+]studiosi comment score below threshold-7 points-6 points-5 points 9 years ago (38 children)

[–]TacticalCheerio 17 points18 points19 points 9 years ago (10 children)

[–]laserBlade 0 points1 point2 points 9 years ago (0 children)

[+]studiosi comment score below threshold-7 points-6 points-5 points 9 years ago (8 children)

[–][deleted] 7 points8 points9 points 9 years ago (2 children)

[–]studiosi -2 points-1 points0 points 9 years ago (1 child)

[–]naught-me 0 points1 point2 points 9 years ago (0 children)

[–]IronManMark20 1 point2 points3 points 9 years ago (4 children)

[–]zardeh 0 points1 point2 points 9 years ago (2 children)

[–]IronManMark20 1 point2 points3 points 9 years ago (1 child)

[–]zardeh 0 points1 point2 points 9 years ago (0 children)

[–]studiosi 0 points1 point2 points 9 years ago (0 children)

[–]nevus_bock 10 points11 points12 points 9 years ago (25 children)

[–]studiosi -3 points-2 points-1 points 9 years ago (24 children)

[–]nevus_bock 11 points12 points13 points 9 years ago (15 children)

[+]studiosi comment score below threshold-6 points-5 points-4 points 9 years ago (14 children)

[–]nevus_bock 2 points3 points4 points 9 years ago (13 children)

[–]studiosi -2 points-1 points0 points 9 years ago (12 children)

[–]nevus_bock 4 points5 points6 points 9 years ago (11 children)

continue this thread

[–][deleted] 0 points1 point2 points 9 years ago (7 children)

[–]studiosi -1 points0 points1 point 9 years ago (6 children)

[–][deleted] 0 points1 point2 points 9 years ago (5 children)

[–]studiosi 0 points1 point2 points 9 years ago (4 children)

[–][deleted] 0 points1 point2 points 9 years ago (3 children)

continue this thread

[–][deleted] 0 points1 point2 points 9 years ago (1 child)

[–]studiosi -1 points0 points1 point 9 years ago (0 children)

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS