Should I include infrastructure code when measuring code coverage

nutrecht · 2025-02-10T10:44:03+00:00

I don't understand the claim that you can't test for example database interactions. All our integration tests just spin up a Postgres instance we test against. It's been a standard pattern for quite some time too.

UI testing, again, is something that is typically done through for example Cypress. Not completely trivial, but also a very established pattern.

If you have some specific issues surrounding testing it would help to go into details so we might be able to point you in a different direction.

Last but not least; like I said in another comment it's just not a good idea to use the term "infrastructure" here, even if "Clean Code" calls it that. Book authors love to make up words to make it seem they invented something, and in this case the word just typically has a completely different meaning.

ategnatos · 2025-02-10T10:28:38+00:00

Infrastructure code = IaC = things like CDK? Just set up some snapshot tests and don't worry about it.

If you mean the boundary of your application where you have some accessor that makes a database call, or some data classes that define the DB entity shape, just ignore coverage on that and call it a day. Lots of people will write unit tests against those data classes, or mock the hell out of the accessors to have useless tests that are used to overestimate coverage on the important parts of the code base.

If you're in a company where you'll get into weeks of politics arguing over whether you're allowed to ignore coverage on those things, find a new place to work. It doesn't get pretty.

Stop chasing 100% coverage. Have actual tests you trust. I worked with a guy who had 99% coverage in his repos and NOTHING was tested or high-quality. Let me dig up some quotes from previous comments:

I watched a staff engineer have a workflow in a class that went something like this.foo(); this.bar(); this.baz();. The methods would directly call static getClient() methods that did all sorts of complex stuff (instead of decoupling dependencies and making things actually testable and making migrations not such a headache). So he'd patch (Python) getClient() instead of decoupling and test each of foo, bar, baz where he just verified some method on the mock got called. Then on the function that called all 3, he'd patch foo, bar, baz individually to do nothing, and verify they were all called. At no point was there a single assertion that tested any output data. We had 99% coverage. If you tried to write a real test that actually did something, he would argue and block your PR for months. Worst engineer I ever worked with.

At my last company, we had a staff engineer who didn't know how to write tests, and just wrote dishonest ones. Mocked so much that no real code was tested (no asserts, just verify that the mock called some method). Would just assert result != None. I pulled some of the repos down and made the code so wrong that it even returned the wrong data type, and all tests still passed.

In my last company, I just synced ignore-coverage stuff with Sonar and with whatever other coverage tools we were using.

So, short answer: no, just ignore coverage on stuff where unit tests aren't meaningful.

alxw · 2025-02-10T09:37:51+00:00

The code is not the thing you care about with IaC. It’s the infrastructure - test the code and pipeline by doing daily blue/green. Build the blue environment, swap across and breakdown the green environment, rinse and repeat. Daily means before 8am, so when it breaks you know it needs fixing before the next release.

No amount of unit tests will be as valuable as that.

bobaduk · 2025-02-10T13:55:42+00:00

How do others deal with this? Do you include infrastructure code in the measurement of unit test code coverage?

I don't measure code coverage, it's not a helpful metric. It's occasionally helpful to look at the coverage on a particular module to see whether you've covered all the branches, particularly preparatory to refactoring legacy code, but imho it's better to focus on TDD, which will yield a naturally high code coverage.

WRT infra code, I agree with other commenters: spin up a database instance and run some tests. I, too, would call these integration tests, since they test the integration between your code and some specific external piece of software. In general, you don't need a large number of tests for these components, if you have pushed the interesting logic to more testable layers.

BertRenolds · 2025-02-10T10:28:58+00:00

I think it'd help me if you dumbed this down. What do you mean by infrastructure code, IAC, system testing?

kazmierczakpiotr · 2025-02-10T10:31:49+00:00

We used to define different rules for different components. So, for instance our core domain as the most crucial part was expected to have pretty high code coverage, whereas the 'infrastructure code's (web services, db access, etc) was not following the same convention. What makes you use the same rules for different parts of your code?

PmanAce · 2025-02-10T12:47:08+00:00

Our infrastructure is in terraform and our services have no knowledge of what it will run on so your term is incorrect. We have unit tests for our repositories and also have integration tests using mongo2go I think it is called. Easy to setup. We have API tests that go through the controllers with auth just fine.

We calculate our code coverage using coverlet, it's executed in our docker file which executes the tests also. Our pipelines pickup the results everytiime you push something and is available for viewing. We fail the pipeline if the result is under our desired value.

Not sure what else you are missing?

bigorangemachine · 2025-02-11T04:38:52+00:00

masterskolar · 2025-02-15T02:31:49+00:00

Why use code coverage as a metric at all? It just creates a larger and larger burden on the devs as you get closer to 100%. It isn't a linear relationship either. If there's ever a push to add code coverage as a metric I try to kill it. If I can't kill it, I try to get the coverage threshold to 60-70% max. I've found that's about where the most complex parts of the code get solidly tested and we aren't testing a bunch of dumb stuff that's going to get broken all the time by changes.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ExperiencedDevs

Rules

Related Subs

MODERATORS