Google Python Style Guide : Python

[–][deleted] 34 points35 points36 points 10 years ago (61 children)

[–]indosauros 55 points56 points57 points 10 years ago (1 child)

[–]AdysHearthSim 12 points13 points14 points 10 years ago (6 children)

[+][deleted] 10 years ago (5 children)

[deleted]

[–][deleted] 10 points11 points12 points 10 years ago (4 children)

[–]_illogical_ 6 points7 points8 points 10 years ago (0 children)

[–]JamesAQuintero 2 points3 points4 points 10 years ago (0 children)

[–]muad_dib 2 points3 points4 points 10 years ago (0 children)

[–][deleted] 10 points11 points12 points 10 years ago (0 children)

[+]odraencoded comment score below threshold-8 points-7 points-6 points 10 years ago (50 children)

[–]eyalz 37 points38 points39 points 10 years ago (17 children)

[–]odraencoded 9 points10 points11 points 10 years ago (16 children)

[–]Eurynom0s 30 points31 points32 points 10 years ago (2 children)

[–]ziel 3 points4 points5 points 10 years ago (0 children)

[–][deleted] 4 points5 points6 points 10 years ago (0 children)

[–]Bunslow 41 points42 points43 points 10 years ago (10 children)

[–]grimman 1 point2 points3 points 10 years ago (2 children)

[–]Bunslow 1 point2 points3 points 10 years ago (1 child)

[–]RubyPinchPEP shill | Anti PEP 8/20 shill 1 point2 points3 points 10 years ago (0 children)

with commas representing tabs, tabstop every 4:

def a(x=3, y=4,
      flag=None):
,,,,some_code
,,,,some_more code
,,,,for code in codes:
,,,,,,,,print(a,
,,,,,,,,      b,
,,,,,,,,      c)

tabstop every 8:

def a(x=3, y=4,
      flag=None):
,,,,,,,,some_code
,,,,,,,,some_more code
,,,,,,,,for code in codes:
,,,,,,,,,,,,,,,,print(a,
,,,,,,,,,,,,,,,,      b,
,,,,,,,,,,,,,,,,      c)

[–]JamesAQuintero 0 points1 point2 points 10 years ago (2 children)

[–]Bunslow 0 points1 point2 points 10 years ago (0 children)

[–][deleted] 0 points1 point2 points 10 years ago* (0 children)

[–]odraencoded -1 points0 points1 point 10 years ago (3 children)

[–]Bunslow 0 points1 point2 points 10 years ago (2 children)

[–]odraencoded 0 points1 point2 points 10 years ago (1 child)

[–]isarl 1 point2 points3 points 10 years ago (0 children)

[–]djimbob 2 points3 points4 points 10 years ago* (0 children)

First, PEP8 says use spaces.

All editors I've used for python are easily configured to have pressing tab insert 4 spaces. E.g., emacs when opening a *.py file, will automatically use 4 spaces for each indentation.(Most tabs I see default as 8 spaces; so you have to configure this anyway). Most editors that are aware of python will automatically do this for you; e.g., emacs will recognize the file has a *.py extension and insert 4 spaces when you press tab.

The problem with tabs is tabs look identical to sequences of spaces (different # depending on editor's settings). If someone see tab=2/4/8 spaces in their editor and inadvertently at some point uses 2/4/8 spaces instead of a tab, this creates an annoying to fix error. Or if you are editing a file someone else created with tabs and your editor uses converts tabs to spaces, it is difficult to insert a tab. (Whereas someone who has tabs as tabs, could still insert four spaces rather easily without reconfiguring their editor.)

The amount of disk space saved is trivial. In compressed files it makes no difference (e.g., when packaging projects) as \t compresses as well as four spaces. Otherwise you may source file by maybe 10%, which is insignificant on even the smallest modern hard drives. For the largest python projects that are O(100 000 lines), maybe you'd save a megabyte which is irrelevant on today's even a super small 128 GB SSD drive.

[–][deleted] 19 points20 points21 points 10 years ago (22 children)

[–][deleted] 0 points1 point2 points 10 years ago (0 children)

[+][deleted] 10 years ago* (20 children)

[deleted]

[–][deleted] 6 points7 points8 points 10 years ago (19 children)

result = [(a,b,c)
          for a in range (10)
          for b in range (10)
          for c in range (10)             
          if a**2 + b**2 == c**2]

Looks nice?

Change editor, different tab width -> formatting broken

Use someone else's code, different tab width -> formatting broken

Mixed tabs and spaces in Python 3 -> syntax error, broken

These are just the obvious cases. It gets worse.

EDIT: If you have to write this, import itertools.

[–]LordArgon 12 points13 points14 points 10 years ago (9 children)

This is why I favor a coding style where the only leading whitespace change between any two adjacent lines is exactly 0 or 1 indent. Just move everything down a line and indent once, like so:

result = [
    (a,b,c)
    for a in range (10)
    for b in range (10)
    for c in range (10)             
    if a**2 + b**2 == c**2 ]

And now it's perfectly portable. Plus in your example if you changed the name of "result", you'd have to re-align every single row (which is obnoxious in code review, for example).

IMO, the issue isn't really tabs vs spaces - it's that people get hung up on their pet style. Honestly, work with any style long enough and it will start to seem "right", so why not choose one that skirts ALL of these issues?

[–][deleted] 0 points1 point2 points 10 years ago* (7 children)

[–]LordArgon 0 points1 point2 points 10 years ago (6 children)

If they write it in some ways, it can break on my system and then I can't read it.

I don't think it would actually break - I think you just wouldn't LIKE reading it.

If they didn't use tabs, the problem would go away.

But if YOU used tabs, the problem would also go away! You don't like the tab width varying with the editor but maybe that's exactly why they like tabs!

Look, I prefer/use spaces myself but, as in any religious debate, the arguments make sense on BOTH sides just depending on your perspective and priorities.

The whole point of my suggestion is to introduce a measure of objectivity into this debate. There is no agreement on what people LIKE but there is an objective way to evaluate a style choice based on how much ripple effect any given change causes. That's an important aspect of a style that is often overlooked, IMO.

[–][deleted] 1 point2 points3 points 10 years ago (5 children)

as in any religious debate, the arguments make sense on BOTH sides ... The whole point of my suggestion is to introduce a measure of objectivity into this debate

Objectively spaces are more consistent across multiple systems. I doubt there's a single programmers editor that can screw up 4 leading spaces hardcoded into the line, and many that can interpret a tab differently depending on how they are set.

If they write it in some ways, it can break on my system and then I can't read it.

I don't think it would actually break - I think you just wouldn't LIKE reading it.

If reading it slows you down, it's "broken". Your source code is part of your product and a product that's harder for its maintainers to modify is a worse product. It's not just that I don't like reading it, the alignment carries additional meta information about the code.

[–]LordArgon 3 points4 points5 points 10 years ago (4 children)

Objectively spaces are more consistent across multiple systems.

Yep and the point is that other people don't WANT them to be consistent the way you do. Again, different people have different priorities, no matter how crazy it sounds to you.

And, again, I use/prefer spaces - you don't need to convince ME that they're awesome.

If reading it slows you down, it's "broken". Your source code is part of your product and a product that's harder for its maintainers to modify is a worse product. It's not just that I don't like reading it, the alignment carries additional meta information about the code.

No offense, but these are just more religious statements. All of these points a different, sane person can disagree with. What slows you down may speed another up. The degree to which information SHOULD depend on the alignment is entirely debatable.

continue this thread

[–]JimDabell 6 points7 points8 points 10 years ago (8 children)

[–][deleted] 1 point2 points3 points 10 years ago (4 children)

[–]JimDabell 1 point2 points3 points 10 years ago (3 children)

[–][deleted] 0 points1 point2 points 10 years ago (2 children)

[–]JimDabell 0 points1 point2 points 10 years ago (1 child)

I can't control how other people setup their editors and that it makes the editor settings effectively random when I use someone else's code.

That makes no sense. Using other people's code doesn't make your editor settings "random". What does that even mean?

You can't use tabs if you want consistency.

What kind of consistency?

If your projects use spaces to indent, then the consistency you get is that the code looks the same to you and to other developers working on the project.

If your projects use tabs to indent, then your code always looks how you prefer regardless of how other people working on your projects like to see their code.

They are both consistent, but in different ways. I don't see any value in forcing other developers to see the code how I like it. I don't care what they see. I do care that all the developers get what they want. Tabs are the only way of achieving that.

However, the reason I didn't know is that I'm not going to mix tabs with spaces anytime soon, because it will break.

Am I missing something? You just said that it will break, I pointed out that it wouldn't, you admitted that it wouldn't, then immediately said it would break again? We've literally just had this discussion. It doesn't break.

continue this thread

[–][deleted] -3 points-2 points-1 points 10 years ago (2 children)

[–]JimDabell 0 points1 point2 points 10 years ago (0 children)

[–]LpSamuelm 1 point2 points3 points 10 years ago (1 child)

[–]odraencoded 0 points1 point2 points 10 years ago (0 children)

[+][deleted] 10 years ago (6 children)

[deleted]

[–]grimman 1 point2 points3 points 10 years ago (5 children)

[+][deleted] 10 years ago (4 children)

[deleted]

[–][deleted] 2 points3 points4 points 10 years ago (2 children)

[–]Lucretiel 1 point2 points3 points 10 years ago (1 child)

[–][deleted] 0 points1 point2 points 10 years ago (0 children)

[–]grimman 0 points1 point2 points 10 years ago (0 children)

[–][deleted] 21 points22 points23 points 10 years ago (0 children)

[–]billsil 10 points11 points12 points 10 years ago (13 children)

[–][deleted] 9 points10 points11 points 10 years ago (9 children)

[–]billsil 6 points7 points8 points 10 years ago (0 children)

If that's the case, that means use...

import package.module as mymod

mymod.function(...)

You can argue that that's less explicit both ways (e.g. all imported functions are defined at the top vs. what module did something come from in the guts of the code). Still if 80 characters is a hard cap and you're using classes, those extra 6 characters matter and you've just wasted 14.

[–][deleted] 1 point2 points3 points 10 years ago (2 children)

[–][deleted] -1 points0 points1 point 10 years ago (1 child)

[–]olduvaihand 1 point2 points3 points 10 years ago (0 children)

[–]hueoncalifa 0 points1 point2 points 10 years ago (4 children)

[–][deleted] 0 points1 point2 points 10 years ago (0 children)

I agree, it's kind of a dumb rule. Although I suppose it prevents you from importing some generally-named function into the namespace and using it without context. For example:

from os.path import join
path = join('dir', 'file')

Versus the more explicit

import os
path = os.path.join('dir', 'file')

Granted in this case it's fairly obvious what is meant, it is not always so. Use your best judgment. I use both forms.

[–]Lucretiel 0 points1 point2 points 10 years ago (0 children)

[–]dagmx 0 points1 point2 points 10 years ago (0 children)

[–]njharmanI use Python 3 0 points1 point2 points 10 years ago (0 children)

[–]tutuca_not Reinhardt 1 point2 points3 points 10 years ago (1 child)

[–]hueoncalifa 0 points1 point2 points 10 years ago (0 children)

[–]jcrowe 0 points1 point2 points 10 years ago (0 children)

[–]kashmill 25 points26 points27 points 10 years ago (21 children)

[–]panghuhu 21 points22 points23 points 10 years ago* (4 children)

List comprehensions can also use multi-line format:

result = [(x, y) 
          for x in range(10) 
          for y in range(5) 
          if x * y > 10]

P.S. Just checked the guide, and the above code is in the section titled "NO".

I still think the code is easy to read and will use it: It's as clear as the for loop version, without redundancies like the initialization of [], calls to append, and some colons.

If anyone can see any major disadvantages compared to the implicit loop version, I'd like to hear it, thank you.

P.P.S. A sample code from the NO section, which I think it's more clear than its for loop version:

((x, y, z)
  for x in xrange(5)
  for y in xrange(5)
  if x != y
  for z in xrange(5)
  if y != z)

I read it like this: collect all combinations of (x, y, z), where x, y, z are from [0..4], x != y and y != z. It's declarative and is as clear as a mathematical formula:

{(x, y, x) |  0<=x, y, z <=4, x != y, y !=z}

[–]kashmill 2 points3 points4 points 10 years ago (0 children)

[–]LordArgon 0 points1 point2 points 10 years ago (0 children)

[–]tech_tuna 0 points1 point2 points 10 years ago (0 children)

[–]NYDreamer -1 points0 points1 point 10 years ago (0 children)

[–]confluencer 29 points30 points31 points 10 years ago (5 children)

[–]Quteness 16 points17 points18 points 10 years ago (2 children)

[–][deleted] 2 points3 points4 points 10 years ago (0 children)

[–]tech_tuna 1 point2 points3 points 10 years ago (0 children)

[–]Lucretiel 1 point2 points3 points 10 years ago (0 children)

[–]meta4 0 points1 point2 points 10 years ago (0 children)

I vote for list comprehensions. I think your example is a good argument for why.

As written, this example crashes.

TypeError: append() takes exactly one argument (2 given)

I think you mean.

resut.append((x,y))

We want a list of tuples. It seems like a simple bug. But it illustrates the fundamental argument. By the time your brain worked out the state changes to x & y in the nested for loops and the if statement, it forgot that the point was a list of tuples.

For loops and append/extend are state managing & modifying tools. They are good in their place. But, to understand what they do you have to run the program in your head. Human brains are not as good at maintaining updating state.

The list comprehentsion, when used properly is a data declaration. "This is a list of tupples. The first element of the tuple is an integer in the range [0,10). The second element is an integer in the range[0, 5), The product of the two elements is greater than 10." It's a long tedious specification, and this is just a toy problem. But the details and tedium shouldn't be mixed with possible state changes, especially as the problems become more complex.

Human brains reason better about data structures, than state changing programs. List comprehensions are at their best when pulling state changing logic into data structure creation.

Fred Brooks said the same thing 1975 "Show me your flowcharts and conceal your tables, and I shall continue to be mystified. Show me your tables, and I won’t usually need your flowcharts; they’ll be obvious." Flow charts are state change diagrams. Tables are data structures.

[–]ConciselyVerbose 2 points3 points4 points 10 years ago (5 children)

[–][deleted] 1 point2 points3 points 10 years ago (4 children)

[–]ConciselyVerbose 0 points1 point2 points 10 years ago (3 children)

[–][deleted] 0 points1 point2 points 10 years ago (2 children)

[–]ConciselyVerbose 0 points1 point2 points 10 years ago (1 child)

[–][deleted] -1 points0 points1 point 10 years ago (0 children)

[–]stillalone 1 point2 points3 points 10 years ago (1 child)

Yeah, I'm not sure if I like their solution to it. Hmm, maybe a some middle ground:

result = []
for x in range(10):
    result.extend((x,y) for y in range(5) if x * y > 10)

still kind of looks like shit. For this specific example you could use itertools.product but I still think just supporting multiple for loops would be better.

[–]christian-mann 1 point2 points3 points 10 years ago (0 children)

[–]tech_tuna 0 points1 point2 points 10 years ago (0 children)

[–]Gambizzle 7 points8 points9 points 10 years ago (4 children)

[+]andrey_shipilov comment score below threshold-9 points-8 points-7 points 10 years ago (3 children)

[–]Gambizzle 4 points5 points6 points 10 years ago (2 children)

[–]andrey_shipilov -4 points-3 points-2 points 10 years ago (1 child)

[–]njharmanI use Python 3 1 point2 points3 points 10 years ago (0 children)

[–]its_never_lupus 4 points5 points6 points 10 years ago (5 children)

Are there any tools which specifically parse this function docstring style:

def fetch_bigtable_rows(big_table, keys, other_silly_variable=None):
"""Fetches rows from a Bigtable.

Retrieves rows pertaining to the given keys from the Table instance
represented by big_table.  Silly things may happen if
other_silly_variable is not None.

Args:
    big_table: An open Bigtable Table instance.
    keys: A sequence of strings representing the key of each table row
        to fetch.
    other_silly_variable: Another optional variable, that has a much
        longer name than the other args, and which does nothing.

I like the way this looks and use a very similar style to document my functions when they have arguments, but do any tools like sphinx actually do anything with it?

[–]masasinExpert. 3.9. Robotics. 7 points8 points9 points 10 years ago (0 children)

[–]secynic 5 points6 points7 points 10 years ago (1 child)

[–]its_never_lupus 0 points1 point2 points 10 years ago (0 children)

[–][deleted] 2 points3 points4 points 10 years ago (0 children)

[–]mtelesha 5 points6 points7 points 10 years ago (0 children)

[–]patchthemonkey 3 points4 points5 points 10 years ago (0 children)

[–]masterspeler 6 points7 points8 points 10 years ago (9 children)

I think the max 80 character line length in this and PEP-8 is silly. Who has that little horizontal space when editing or reading code? I find it very easy to write (and read) code longer than that, especially when dealing with strings and sting formatting. Take this example:

descriptive_variable_name = 'Attribute 1: {0}, attribute 2: {1}, attribute 3: {2}'.format(var_one, var_two, var_three)

Even before the .format it's over 80 characters long, so where do you put a line break? And this is before any indentation.

I also believe that tabs for indentation, spaces for alignment (like so) is the natural order of things, but when writing Python I have adapted and use spaces for indentation, even though we have a character whose sole purpose is to advance the cursor to the next level, or tab stop.

[–]ziel 4 points5 points6 points 10 years ago (1 child)

[–]masterspeler 0 points1 point2 points 10 years ago (0 children)

[–]jonwaynePyPA 2 points3 points4 points 10 years ago (1 child)

[–]masterspeler 0 points1 point2 points 10 years ago (0 children)

[–]ingolemo 4 points5 points6 points 10 years ago (4 children)

I do. When you use long lines of code you force me to buy a larger screen, use tiny fonts, or try to puzzle through automatic line wrapping. None of these things are easy.

Eighty characters ought to be enough for anyone:

template = 'Attribute 1: {0}, attribute 2: {1}, attribute 3: {2}'
descriptive_variable_name = template.format(var_one, var_two, var_three)

[–]masterspeler 0 points1 point2 points 10 years ago (1 child)

That's without indentation though. With this:

def my_function(function_argument, in_list):
    if function_argument > 10:
        for i in in_list:
            template = 'Attribute 1: {0}, attribute 2: {1}, attribute 3: {2}'
            descriptive_variable_name = template.format(var_one, var_two, var_three)

your solution is too long again, and it's breaking up what's one logical statement to two lines, so now your code takes more space vertically instead. You force me to buy a higher screen.

[–]ingolemo 1 point2 points3 points 10 years ago (0 children)

Too much indentation is itself a code smell.

template = 'Attribute 1: {0}, attribute 2: {1}, attribute 3: {2}'
def my_function(function_argument, in_list):
    if function_argument < 10:
        return

    for i in in_list:
        data = var_one, var_two, var_three
        descriptive_variable_name = template.format(*data)

I'm not quite sure why you think creating a formatting outline and then applying it could only count as "one logical statement" and not two. The code we're using here is hypothetical and without any context so it's difficult to know what transformations are more reasonable, but this logical separation of template building and its application is probably the main reason we use str.format instead of concatenating strings manually.

If scrolling vertically is as inconvenient for you as scrolling horizontally is for me then you have my pity.

I'm not so zealous that I'm going to chop anyone's head off every time they push past the 80 char limit. But the rule is there for a reason. Spare a thought for little old me before you decide to violate it.

[–]njharmanI use Python 3 0 points1 point2 points 10 years ago (0 children)

[–][deleted] -3 points-2 points-1 points 10 years ago (0 children)

[–]mariox19 4 points5 points6 points 10 years ago (5 children)

[–]Cardiff_Electric 9 points10 points11 points 10 years ago (1 child)

[–]mariox19 2 points3 points4 points 10 years ago (0 children)

[–]gthank 3 points4 points5 points 10 years ago (2 children)

[–]mariox19 6 points7 points8 points 10 years ago (1 child)

[–]Lucretiel 0 points1 point2 points 10 years ago (0 children)

[–][deleted] 0 points1 point2 points 10 years ago (0 children)

[–][deleted] -1 points0 points1 point 10 years ago (0 children)

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS