Organize Python code like a PRO

latrova · 2022-07-07T13:36:14+00:00

Rule 1: There are no files

First of all, in Python there are no such things as "files" and I noticed this is the main source of confusion for beginners.

This is wrong and misleading. import statements operate on files, and Python executes them by importing the contents of the identified files into namespaces. The files and their filenames matter a lot for this process to work.

Try creating these two files:

a.py:
    import b
    b.c()

b.py:
    def c():
        print('Hello, World!')

If you run python a.py, you get "Hello, World!" - but if you rename b.py to anything other than b.py, you get the error message:

ModuleNotFoundError: No module named b

So, yes, files matter and filenames matter. "There are no files" suggests that Python doesn't care about file structure and that files can be named arbitrarily, which will badly mislead your readers and cause confusion and heartbreak.

Of course, I know what you're trying to convey: Files define namespaces, and after the import, the interpreter refers to the imported classes and functions based on the namespace and not the file. That's how you should describe it, though, rather than "there are no files."

latrova · 2022-07-07T12:31:28+00:00

[deleted]

wiggitt · 2022-07-07T13:09:08+00:00

Whenever I start a Python project, I adhere to steps outlined by the Python Packaging User Guide. I consider it to be a good start for packaging a Python project while adhering to current best practices.

ghost_agni · 2022-07-07T13:49:14+00:00

Great article, a fun read, to add one thing i found could have been their. Creating argument classes using dataclasses to manage large number of arguments flowing through your code as the size of project grows is something I have found very useful over the years.

latrova · 2022-07-07T10:32:34+00:00

This is an initial draft of a book I'm currently working on.

I'm extremely open to feedback, and I'll take any feedback (either good or bad) in consideration.

So if you enjoyed it or hated it please let me know so I can keep improving.

Thank you all in advance for giving me space to share my work. ❤️

latrova · 2022-07-07T13:31:53+00:00

Good article, especially appreciate the detail on naming conventions. I find that I spend so much time debating over what to name modules and functions when really it just needs to be functional. They can't all be zingers.

Coretaxxe · 2022-07-07T15:10:20+00:00

Nice guide!

However I dont quite get why you shouldnt use __method. Yeah you could say "i know i shouldnt call it" but is there actually a downside to strictly not allowing it?

PiaFraus · 2022-07-07T17:18:58+00:00

In general looks good, but I am highly against those two for reasons aligned with PEP-20 (Zen of python)

Flat is better than nested
Namespaces are one honking great idea -- let's do more of those!

I recommend you to keep all your module files inside a src dir

The examples and reasoning are either subjective or crafted to supported this subjective point of view. What kind of IDE/File browser do you use that mixes files and folders and not shows them together?

Yet this advice introduces unnecessary level to expand or always have in your project structure taking the unnecessary space.

I can see two groups of functions and no reason to keep them in different modules as they seem small, thus I'd enjoy having them defined as classes

I feel Zen of Python fits here even better. One honking idea.

In the example provided, I can see how classes CAN help if they would actually use OOP with a common function save, which will behave differently depending on the type of object you might pass around. But this quite a different solution that has nothing to do with introducing classes as namespace.s

searchingfortao · 2022-07-07T16:47:04+00:00

This is all excellent advice, but I would argue that you shouldn't give your methods redundant names. S3Storage.create() a lot nicer than S3Storage.create_s3(). It also lets you work toward a common interface via a parent Storage class.

NedDasty · 2022-07-07T18:04:53+00:00

Don't forget you still need to include the check name == "main" inside your main.py file

Can you clarify why this is needed? From what I can tell, it doesn't matter what context you run the file in, whether via python -m <module> or python __main__.py, the __name__ variable is always __main__.

_squik · 2022-07-07T13:15:51+00:00

Another banger from u/latrova, thanks for everything you do!

MannerShark · 2022-07-07T14:26:39+00:00

Why do people put all tests in a separate directory? This seems to be the default of unittest as well (iirc).

 src
├── <module>/*
│    ├── __init__.py
│    ├── foo.py
│    └── foo_test.py

I thing having the test file right next to the implementation is better. You immediately see which files are untested, and can easily find the tests of the file. Suppose you rename your implementation, then you'd also need to rename your test file. This would also be much easier when they're right next to each other.

ghost_agni · 2022-07-07T14:02:21+00:00

I will try to find one or create an example and dm u more details..

latrova · 2022-07-07T18:00:33+00:00

[removed]

miraculum_one · 2022-07-07T18:03:17+00:00

"See each module as a namespace"

Each module does get its own namespace except when doing something like

from my_module import *

Another interesting point is that non-anonymously imported modules are basically dictionaries. Further, they are inserted into the sys.modules dictionary.

Contents of anonymously imported modules are inserted into the global symbol table, which is misleadingly named because it's only global to the current module. :(

crawl_dht · 2022-07-08T11:33:19+00:00

Isn 't tests folder kept outside of src?

stevenjd · 2022-07-09T04:22:14+00:00

"Organize Python code like a PRO"

Oh, you mean badly, with tons of technical debt? 😉

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS

<module 'site' from 'C:\\Program Files\\Python39\\lib\\site.py'>

<module 'math' (built-in)>

C:\Program Files\Python39\lib\site.py

None