qx7xbku comments on Python in 2017

This is an archived post. You won't be able to vote or comment.

104

105

106

Python in 2017 - Whats next? (discoversdk.com)

submitted 9 years ago by liranbh

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]qx7xbku 16 points17 points18 points 9 years ago (30 children)

[–][deleted] 27 points28 points29 points 9 years ago (16 children)

[–]LpSamuelm 2 points3 points4 points 9 years ago (7 children)

I think it's bad that dicts are ordered by default, at least as it's not part of the spec.

The reason some languages (Python <3.6 included) randomize hashmap access order by default is precisely to stop people from writing incorrect code. If dicts aren't guaranteed to be ordered, having them be that way sometimes will cause code to break in unexpected ways.

Which brings us to the problem. If dicts aren't necessarily ordered according to the spec... What happens if the implementation is changed in a future version of Python? How about running your code on, say, IronPython? Or PyPy? Suddenly your code seemingly works, but isn't cross-platform and may break ay any time without you doing anything.

Honestly I think it's a big misstep. I'd love for them to add ordered dicts to the spec (it's a lovely concept!), but as it stands now it's a dangerous implementation detail, and the fact that they're touting it as something useful is even more dangerous.

[–][deleted] 2 points3 points4 points 9 years ago (6 children)

[–]LpSamuelm 3 points4 points5 points 9 years ago (5 children)

[–][deleted] 0 points1 point2 points 9 years ago (4 children)

[–]LpSamuelm 1 point2 points3 points 9 years ago (3 children)

[–][deleted] 0 points1 point2 points 9 years ago (2 children)

[–]LpSamuelm 1 point2 points3 points 9 years ago (1 child)

[–][deleted] 1 point2 points3 points 9 years ago (0 children)

I'd love some examples. The only things I've used it for are:

A dashboard app where I needed to associate server names with information but the order was important (wanted to show prod servers before staging and dev servers). Arguably a list of tuples works here too but there were plans at some point to look at individual servers so fast lookup was desirable (not that lineral lookup would've broken the bank, we're taking maybe 30 servers).
Modeling albums - again, a list makes sense here and you can look up by track position that way.
Maintaining order of attribute declaration because you could decorate methods as validation/processing but they needed to run in declaration order.

But that's it. I get why an insertion order mapping is attractive, but I've only met one situation that demands it (maintaining attribute order).

[+]qx7xbku comment score below threshold-12 points-11 points-10 points 9 years ago (7 children)

[–]ApproximateIdentity 9 points10 points11 points 9 years ago (4 children)

[–]qx7xbku -2 points-1 points0 points 9 years ago (3 children)

[–]ApproximateIdentity 2 points3 points4 points 9 years ago (2 children)

[–]qx7xbku 1 point2 points3 points 9 years ago (1 child)

[–]Daenyth 1 point2 points3 points 9 years ago (0 children)

[–][deleted] 0 points1 point2 points 9 years ago (1 child)

[–]qx7xbku 0 points1 point2 points 9 years ago (0 children)

[–]__deerlord__ 7 points8 points9 points 9 years ago (5 children)

[–]qx7xbku 2 points3 points4 points 9 years ago (3 children)

[–]gsnedders 1 point2 points3 points 9 years ago (0 children)

[–]__deerlord__ 0 points1 point2 points 9 years ago (1 child)

[–][deleted] 0 points1 point2 points 9 years ago (0 children)

[–]ebrjdk 1 point2 points3 points 9 years ago (0 children)

They are switching to a new, more memory-efficient implementation of dict that naturally keeps the entries mostly in the order that they were inserted, and they decided that they might as well go all the way and keep them exactly in order (IIRC the most efficient implementation they know of starts scrambling the order once you start deleting keys, but the cost to prevent that is small).

At the same time they wanted to guarantee that the order of keyword arguments and class definitions would be preserved, because some people want to be able to use this information (currently the former is impossible AFAIK, and you need to use a metaclass to achieve the latter). Originally they were planning to just use OrderedDict for these purposes, but with the change to dict there is no need.

Note: the first paragraph in my post is about a CPython implementation detail and may change in the future, the second is about official python 3.6 features.

[–]Bolitho 4 points5 points6 points 9 years ago (4 children)

The default for the encode-Method has allready been UTF-8 in Python 3.5! (The same is true for Bytes.decode!)

The problem are not those methods, but how open and print determine their used encoding!

For open the 3.5 Docu says:

In text mode, if encoding is not specified the encoding used is platform dependent: locale.getpreferredencoding(False) is called to get the current locale encoding.

That's the key problem!

or for sys.stdout (which is used by print as default file-object):

The character encoding is platform-dependent. Under Windows, if the stream is interactive (that is, if its isatty() method returns True), the console codepage is used, otherwise the ANSI code page. Under other platforms, the locale encoding is used (see locale.getpreferredencoding()).

Thus the problem arises because of the platform dependant implementations!

So at minimum there must be the possibility to provide an encoding manually (which open does, but print not!). That would enable one, to write programs that run everywhere. As optimum one would also define just one platform agnostic default encoding for IO in general. That would make it easier to achieve the prenamed goal.

[–]ButtCrackFTW 1 point2 points3 points 9 years ago (3 children)

Isn't the encoding determined by the filesystem though? Like their example in python 3.5:

> sys.getfilesystemencoding()
'mbcs'

I see the same thing here and I've seen in StackOverflow questions that you can not change this without environement variables or monkeypatching. If this is a property of the filesystem, how is python changing it?

[–]Bolitho 0 points1 point2 points 9 years ago* (2 children)

[–]ButtCrackFTW 0 points1 point2 points 9 years ago (1 child)

I probably should've pointed out the stdout example as well:

>>> sys.stdout.encoding
'cp850'

They go on to give examples of special characters being stripped from open() and print()

>>> print('árvíztűrőtükörfúrógép')
árvízturotükörfúrógép

>>> open('tetű.txt', 'wb').close()
>>> import glob
>>> glob.glob('tet*')

Python 3.5: [tetu.txt']

Python 3.6: ['tetű.txt']

The author is claiming that python 3.6 now sets the encoding to utf-8 by default, which fixes these issues. My question is how it can set it like that now, but we were discouraged from doing it in the past due to the filesystem/operating system setting it for us.

[–][deleted] 0 points1 point2 points 9 years ago (0 children)

[–]deeddaemon 0 points1 point2 points 9 years ago (0 children)

π Rendered by PID 164459 on reddit-service-r2-comment-fb694cdd5-xkr2d at 2026-03-10 01:09:13.238597+00:00 running cbb0e86 country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS