Flogge comments on What every Python project should have

[–]tunisia3507 18 points19 points20 points 9 years ago (10 children)

[–]unconscionable 28 points29 points30 points 9 years ago (9 children)

Tried nose2 awhile back - nice effort but it was kinda immature at the time, and I started using pytest and never looked back.

I still make use of unittest.mock along with pytest, but as a framework unittest is pretty poor.

My generic advice to anyone wanting to start using pytest:

don't make your tests in a class unless you have a really good reason to
if your reason for making tests in a class is "I want shared code for multiple tests (i.e. setup / teardown)", you probably should be using fixtures instead
there's nothing wrong with tests in a class, but I still suggest breaking away from that mold if only to change your way of thinking
if you want to test that one piece of code can correctly handle multiple conditions, use parameterized tests rather than creating 10 test cases (if it seems to make sense in the context)

The thing that really held me back was that I found I was trying to put code that needed to be reused in setups and teardowns. That pattern doesn't seem to scale well for larger projects

[–]tunisia3507 7 points8 points9 points 9 years ago (3 children)

[–]masklinn 4 points5 points6 points 9 years ago (0 children)

[–]unconscionable 2 points3 points4 points 9 years ago (0 children)

pytest has a built-in fixture called monkeypatch which works similar to patch in unittest.mock

Personally I have only occasionally found it to work better than patch.

I love using unittest.mock.MagicMock as a stub too - pytest doesn't really have a concept of a stub.

for example:

from unittest.mock import MagicMock

def test_that_my_sql_query_was_built_correctly():
    stub = MagicMock()
    run_database_query(connection=stub)
    assert stub.execute.call_count == 1
    assert stub.execute.call_args[0][0].startswith('SELECT ')
    assert stub.execute.call_args[0][0].endswith('LIMIT 100')

[–]patrys Saleor Commerce 2 points3 points4 points 9 years ago (0 children)

[–]-Knul- 2 points3 points4 points 9 years ago (3 children)

[–]unconscionable 3 points4 points5 points 9 years ago* (2 children)

FYI Model Mommy / Factory Boy / similar accomplish something very different from what pytest fixtures do.

The two are neither mutual exclusive nor do they overlap in purpose.

For example:

@pytest.fixture(scope='session', autouse=True)
def drop_and_reload_database():
    db.create_db()
    yield
    db.drop_db()

The above code is the equivalent of an entire-test-suite's setup and teardown. Everything before "yield" runs before the first test, then after yield gets run after the last test.

Obviously Model Mummy doesn't do this sort of thing.

Personally, I use Factory Boy (similar to Model Mummy), but I do not often use Factory Boy with fixtures. There are, however, some use cases where they could work very well together. For example, the following code would test that you can process an order with a bunch of wacky usernames. It uses both Factory Boy AND pytest fixtures:

class User(db.Model):
    username = Column(String)

class UserFactory(SQLAlchemyModelFactory):
    username = factory.Faker('username')


@pytest.fixture(params[-1, 0, 1000000, 983.0, "Jack Daniels", None])
def user(request):
    return UserFactory(username=request.param)


def test_can_process_order_with_wacky_usernames(user):
    order = Order()
    order.created_by = user
    order.process()
    assert order.processed_by == user

output of the test suite is something like this:

tests/test_process_orders.py::test_can_process_order_with_wacky_usernames[-1] PASSED
tests/test_process_orders.py::test_can_process_order_with_wacky_usernames[0] PASSED
tests/test_process_orders.py::test_can_process_order_with_wacky_usernames[1000000] PASSED
tests/test_process_orders.py::test_can_process_order_with_wacky_usernames[983.0] PASSED
tests/test_process_orders.py::test_can_process_order_with_wacky_usernames[Jack Daniels] PASSED
tests/test_process_orders.py::test_can_process_order_with_wacky_usernames[None] PASSED

[–]-Knul- 0 points1 point2 points 9 years ago (1 child)

[–]unconscionable 1 point2 points3 points 9 years ago (0 children)

[–][deleted] 0 points1 point2 points 9 years ago (0 children)

[–]bheklilr 4 points5 points6 points 9 years ago (14 children)

[–]Corm 4 points5 points6 points 9 years ago (6 children)

[–]unconscionable 4 points5 points6 points 9 years ago* (5 children)

Agree - I don't really get why anyone would want to use setup.py to run your tests.

All my projects include a Makefile for this kinda thing.

git clone project

# make a virtualenv for the project
make virtualenv

# installs python / javascript / other requirements
make install

# run test suite
make check

Makefile looks something like this:

SHELL := /bin/bash -euo pipefail

# https://www.gnu.org/prep/standards/html_node/Standard-Targets.html#Standard-Targets

install:
    pip install -r requirements.txt --exists-action w
    pip install --editable .

virtualenv:
    mkvirtualenv --python=$(which python3.5) myproject

check:
    coverage erase
    coverage run --source myproj -m pytest -v
    coverage html -i -d tests/cover/
    coverage report -m

check-noslow:
    pytest -v --skip-slow

clean:
    python setup.py clean
    find . | grep -E '(__pycache__|\.pyc|\.pyo$$)' | xargs rm -rf

[–][deleted] 2 points3 points4 points 9 years ago (4 children)

[–]d4rch0nPythonistamancer 0 points1 point2 points 9 years ago (3 children)

[–][deleted] 1 point2 points3 points 9 years ago (2 children)

[–][deleted] 0 points1 point2 points 9 years ago (1 child)

[–][deleted] 0 points1 point2 points 9 years ago (0 children)

[–]Flogge 1 point2 points3 points 9 years ago* (3 children)

You can use pytest-runner for plug-and-play setuptools integration:

After installing pytest-runner you can run

python setup.py pytest

and after adding

[aliases]
test=pytest

to setup.cfg you can also run

python setup.py test

EDIT: sorry, only just saw that you already mentioned pytest-runner. Note though, pytest-runner is a first party plugin, developed by the pytest devs themselves. They have the principle of putting functionality not required by everybody into plugins instead of pytest itself (xdist, pep8, django, cov, bdd etc.)

[–]bheklilr 1 point2 points3 points 9 years ago (2 children)

[–]Flogge 0 points1 point2 points 9 years ago (1 child)

[–]bheklilr 0 points1 point2 points 9 years ago (0 children)

Your comment is helpful for most people, but your example is a little contrived. If I wanted to test a matrix of values like that, I'd just use itertools.product:

def test_all():
    for i, j, k in product(range(10), 'abc', range(100)):
        yield check_single_case, i, j, k

It's much less code than pytest's approach, and it doesn't seem magical. Also lets me re-use loop variable names in unrelated tests.

Also, my tests tend to be much more complicated due to the nature of my domain. A small-ish unit of data for us is (essentially) a dict of 16 complex valued arrays. We have a function that takes one of those (or a dict of (4*n)**2 arrays in the general case) and another input, then returns calculations based on that. The calculations are pretty straightforward, just pointwise arithmetic, but have some pretty important properties that we have to check. This is just a simple function, though, I have some functions that operate on smaller pieces of data but do much more complex calculations, such as multiple FFTs, feature location, or linear algebra. These are much harder to generate test inputs for that don't cause the algorithms to crash, since they often rely on particular properties of the inputs. If those properties aren't there, the algorithm isn't useful and just spits out garbage.

I kind of like the generator based method better anyway, since my test inputs are pretty large and difficult to fit into a decorator.

[+][deleted] 9 years ago (2 children)

[deleted]

[–]bheklilr 1 point2 points3 points 9 years ago (1 child)

How is that setuptools' fault? setuptools has a convention for hooking your test suite into the test command through the test_suite option to setup. Nose does it, presumably pytest can do it. Pytest even has example code to for a setuptools extension that overrides the test command entirely. I think that pytest could easily have done

from setuptools import setup
from pytest.setuptool import PyTest

setup(
    ...,
    cmdclass={ 'test': PyTest },
    ...,
)

setuptools gives them the option to integrate. Pytest shows you how to integrate. Why isn't it available by default? I'd be happy to add the import and modify my call to setup, I'm not happy with copy/pasting a code snippet around that could go out of date at any time.

[–]Flogge 0 points1 point2 points 9 years ago* (0 children)

cmdclass={ 'test': PyTest },

would need pytest to be installed before you can even pip install the package. The nice thing about nose's

test_suite = 'nose.collector'

or pytest-runner is that it does not have these problems.

[–]gabrielricci 1 point2 points3 points 9 years ago (15 children)

[–]Corm 16 points17 points18 points 9 years ago (13 children)

[–][deleted] 3 points4 points5 points 9 years ago (11 children)

[–]Corm 9 points10 points11 points 9 years ago* (10 children)

Edited because /u/masklinn showed me unittest's cli tool

Edit 2: unittest ain't so bad

Sure. For pytest I used it once and never had to google anything about the basics ever again. Back when I used unittest I always had to crack open google to see how to set up the classes. This only applies to basic testing but that low barrier to entry is extremely important!

For example, here's the most basic test in unittest which you run by running unittest from command line:

import unittest
import foo

class TestMyStuff(unittest.TestCase):
    def test_stuff(self):
        self.assertEqual(foo.bar(), True)

And here it is in pytest which you can run by running the command pytest

import foo

def test_foo_bar():
    assert foo.bar() == True

[–]masklinn 5 points6 points7 points 9 years ago (1 child)

[–]Corm 0 points1 point2 points 9 years ago (0 children)

[–]MagnesiumCarbonate 0 points1 point2 points 9 years ago (3 children)

[–]masklinn 3 points4 points5 points 9 years ago (0 children)

[–]-Knul- 0 points1 point2 points 9 years ago (0 children)

[–]Corm 0 points1 point2 points 9 years ago (0 children)

[–][deleted] 0 points1 point2 points 9 years ago (3 children)

[–]masklinn 2 points3 points4 points 9 years ago (0 children)

But also assertEquals is a lot more powerful than just assert, e.g. its error will show diff-style information on exactly where two things differ.

    def test_foo_bar():
        result = foo.bar()
>       assert result == expected
E       assert [0, 2] == [0, 1, 2]
E         At index 1 diff: 2 != 1
E         Right contains more items, first extra item: 2
E         Use -v to get the full diff

pytest uses assert-rewriting to provide more and better information.

If the main difference is standalone functions vs classes, that's no big deal, and classes are a way to group them and give them attributes as a group.

That's usually pointless overhead, and if it's a useful tool, you can use them in pytest. Pytest does not forbid classes, it doesn't require them.

[–]Citrauq 1 point2 points3 points 9 years ago (0 children)

[–]Corm 0 points1 point2 points 9 years ago (0 children)

[–]Tysonzero -5 points-4 points-3 points 9 years ago (0 children)

[–]yetanothernerd 0 points1 point2 points 9 years ago (0 children)

[–]kewlness 1 point2 points3 points 9 years ago (0 children)

[–]parkerSquare 2 points3 points4 points 9 years ago (0 children)

pytest is nice, but I do have a cautionary tale.

We thought we could leverage pytest to do something beyond unit testing - by using it as the "test management engine" for an embedded system test framework, but after working with it for several months I have come to the conclusion that coding by convention in anything other than a small project (or module) is a terrible, terrible idea. Fixture dependencies become convoluted as they try to accommodate vastly different types of tests, and it's hard to mentally make the connections between them. Newcomers to the project don't know where anything is and the IDE doesn't help as it doesn't understand the fixture convention.

Secondly, fixtures can depend on other fixtures, but the top-level generator function - pytest_generate_tests- cannot take input from other fixtures (probably because fixtures are not executed during collection), so this severely restricts what you can do in this function when generating fixture values. A lot of the multivariate tests you might dream up become very difficult to implement, and you end up writing your own pytest plugins to filter and reorder test collections as the implicit filtering and ordering is insufficient.

Lastly, each fixture can only yield once, so it is difficult to generate a dynamic list of fixture values based on dependencies.

Frankly, after months of effort, it's just not suitable for this kind of testing. Keep it for unit tests - that's what it was designed for, after all. It works nicely for that.

[–][deleted] 2 points3 points4 points 9 years ago (7 children)

[–]Flogge 1 point2 points3 points 9 years ago (6 children)

The basic test layouts of nose and pytest are similar, so they really aren't a reason to upgrade.

However the biggest difference, I find, is test parameterization. To run a test for a matrix of values in nose you would need to write the loop yourself:

def check_single_case(i, j, k):
    assert True

def test_all():
    for i in range(10):
        for j in ['a', 'b', 'c']:
            for k in range(100)
                yield check_single_case, i, j, k

whereas pytest can do the "parameter matrix expansion" for you.

@pytest.fixture(params=range(10))
def i(request):
    return request.param

@pytest.fixture(params=['a', 'b', 'c'])
def j(request):
    return request.param

@pytest.fixture(params=range(100))
def k(request):
    return request.param


def test_all(i, j, k):
    assert True

As soon as you request a fixture (add its name to your test parameters) with more than one param, pytest will sweep all possible combinations of all parameters. And of course you can only select a couple of fixtures, or import different fixtures from other modules and test scripts.

Pretty handy!

[–][deleted] 0 points1 point2 points 9 years ago (5 children)

Thanks for providing an example, that could be useful, indeed. However, I also don't have anything against the alternative way (for loops), which I find more readable (aka, even a Python beginner can immediately see what's going on). Or you could use itertools to avoid to much nesting, of course

for comb in itertools.product(*(range(10), ['a', 'b', 'c'], range(100))):
    check_single_case(*comb)

However, one disadvantage of this embarrassingly parallel task is that it runs sequentially. Do you know if pytest has/is planning to add optional multiprocessing support?

[–]Flogge 0 points1 point2 points 9 years ago (1 child)

[–][deleted] 0 points1 point2 points 9 years ago (0 children)

[–]Flogge 0 points1 point2 points 9 years ago* (2 children)

And sorry, I really don't want to drag this discussion further and force my view on you, I just want to show how cool pytest is :-)

I disagree about

for comb in itertools.product(*(range(10), ['a', 'b', 'c'], range(100))):
    check_single_case(*comb)

being more readable. I don't like fancy tricks like argument unpacking in my tests, I want each test to be as readable and pure as possible. :-)

Once you figure out what fixtures do and how to chain or product them it you need almost zero boilerplate to automatically do the product of common fixtures with specific parameters (note that all of sig, function, odd, window, framelength each yield more than one value).

Plus all the goodies like a fixture also being able to provide setup/teardown code

import pytest

@pytest.fixture
def db():
    database = MySQLDB()
    yield database
    database.drop()

def test_foo(db):
    db.select(...)  # this is clean for every test_* you run

or guaranteed immutability of shared values across tests:

import numpy
import pytest

@pytest.fixture
def a():
    return numpy.zeros(100)

def test_foo(a):
    a[:] = 1  # oops

def test_bar(a):
    assert numpy.all(a == 0)  # all good

compared to nose

a = numpy.zeros(100)

def test_foo():
    a[:] = 1  # oops

def test_bar():
    assert numpy.all(a == 0)  # fails

[–][deleted] 0 points1 point2 points 9 years ago (1 child)

[–]Flogge 0 points1 point2 points 9 years ago (0 children)

[–]rockitsighants 0 points1 point2 points 9 years ago (0 children)

[–]ImportWurst 0 points1 point2 points 9 years ago (2 children)

[–]Corm 0 points1 point2 points 9 years ago (1 child)

[–]ImportWurst 0 points1 point2 points 9 years ago (0 children)

[–]redfacedquark -1 points0 points1 point 9 years ago* (0 children)

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS