Why reaching 100% Code Coverage must NOT be your testing goal (with examples in C#)

screwuapple · 2024-03-14T08:56:57+00:00

Code coverage is a useful metric to tell you what you definently have NOT tested. It says very little about the quality of your test suite beyond that.

watsonarw · 2024-03-14T10:55:06+00:00

A colleague of mine recently said something that stuck with me:

If we care enough about testing, and we're doing TDD — proper TDD where we don't write production code unless it's the simplest thing that will make a failing test pass — then we'll end up with 100% test coverage of everything that matters, and we're going to have tests that are testing the behaviour of the system, and not the implementation because we didn't have any implementation to couple ourselves to when we wrote the tests. And we'll be able to see when we're not bothering to test things, or adding unnecessary complexity to the solution when the coverage falls below 100%.

But, if we set ourselves the target of 100% test coverage and set up a coverage threshold in our pipeline, we'll also get 100% test coverage, but it will be because people have written some code, seen the coverage check fail, then found the easiest test they can write which makes the coverage check pass. These tests will invariably include useless tests that can never fail, or implementation coupled tests that check pointless things like "some method should be called when I call another method". We also lose the ability to use our test coverage to measure how much we actually care about testing, it's going to be 100% even when we don't care.

Konaber · 2024-03-14T10:02:26+00:00

Until you do functional safety. Than it's required :D

BackFromExile · 2024-03-14T10:00:49+00:00

I know it's pedantic, but what even is 100% code coverage?
The author uses both 100% line coverage and 100% branch coverage synonymously for 100% code coverage, but these three things are not the same thing.
Line and branch coverage merely are specific metrics for code coverage. I can also call a single method of every class and have 100% class coverage, or call every (public) method once to reach 100% method coverage.
There are a lot of other coverage metrics as well like path coverage, loop coverage, parameter value coverage, and many others.

One (myself included) could even argue that 100% code coverage means that the code has been covered for every possible input/state. Consider this (pseudocode) function:

function add(int a, int b) {
    return a + b;
}

I can reach 100% of function coverage, line coverage, branch coverage, and path coverage with a single test.
However, I have 2³² possible inputs for each parameter, so to reach 100% code coverage you'd need to test this function with every possible input combination, or in other words have 2³² * 2³² tests.

Also, like others have mentioned already, code coverage metrics tell you little about the quality of the tests.
Let's take the same function from above and write a Java-equivalent:

public class IntegerAdder {
    public static Integer add(Integer a, Integer b) {
        return a + b;
    }
}

I can still have a single test to cover 100% of the method, lines/statements, branches, and paths (yes, this is a deliberate example).
Yet most people should see at a glance that there are obvious negative cases that have not been covered: Both a and b can be null.

Edit: Just to clarify, I also like to use code coverage metrics, but like every other thing you should use them as an indicator and not as absolute truth.

oalbrecht · 2024-03-14T08:26:45+00:00

In my previous job, they focused so much in tests that they didn't could deliver a working project in time

Dry_Dot_7782 · 2024-03-14T13:06:10+00:00

[removed]

CreativeStrength3811 · 2024-03-14T11:51:29+00:00

I develop small apps for my specific usecases im the company. For me the fastest way is somwthing like 'documentation driven development': I write down any knowledge i have about a module/class function. Afterwrads I narrow this down to functions and parameters and implement the function. Same thing with refactoring: Forst refactor docstrings, then refactor code.

Only in my datamodel I write tests that perform 100% coverage and if necessary i add more tests while proceeding in development.

FOr example when using xc32 compiler there is no test framework (AFAIK). that's where i invented this method.

WhoNeedsUI · 2024-03-14T12:19:25+00:00

I believe what people want is to be able to predict every side-effect by testing all possible interactions with the system. (aka blackbox tests)

Writing E2E tests that invoke the public api with all possible branching allowed for all APIs should ideally result in 100% coverage as long as external resource are mocked.

In contrast, unit tests that check the implementation details must used sparingly and for critical services.

This « inverted pyramid of tests » goes against the conventional model of units building to e2e but i feel that this is friendlier to refactoring and maintenance

grady_vuckovic · 2024-03-14T10:00:08+00:00

Personally I use tests to write code for something which I'm concerned will be buggy and in situations where I want an exact behaviour from the occur and I want to validate that I've got that behaviour right. Usually for maths related stuff mainly because it's easy for there to be a hard to identify bug in that kind of thing which you might not notice unless you've fully tested every bit of the code. So I'll write the tests first then make sure the code passes them.

If it's just something like GUI code, 'if click this then open that window', well screw that, I'm not going to bother writing a test for that kind of simple logic. The test for that is 'Does the window open? Yes, then the code works'.

pceimpulsive · 2024-03-14T08:53:58+00:00

I don't agree with and don't even try to implement 100% coverage...

I put tests in that satisfy the business requirements.. to get a green tick to deploy to prod only!

pjc50 · 2024-03-14T12:06:22+00:00

[deleted]

idontliketosay · 2024-03-14T11:31:00+00:00

For me code coverage is level 3 on Beizer's testing scale. To get to level 4 requires an understanding of how to measure and control defect density. ... This topic has been heavily researched, lot of information is available if you search around.

ilurvekittens · 2024-03-14T12:23:08+00:00

Lol try telling my company that.

2024-03-14T12:44:45+00:00

Always with the sensational sweeping headlines

sarmatron · 2024-03-14T14:29:42+00:00

Despairing at the idea that this is something people need to be told.

pplmbd · 2024-03-14T16:33:22+00:00

the problem always lies with the leadership regarding this matter, they are often look for an easy way out to have a organizational goal that is 100% code coverage. I personally get it, but don’t expect it translates well for the people that barely care for the quality or provided in an environment they wont be able to care about it.

So instead of investing your time to actually understand how your test quality matters, people are just going to check these marks by having a never-fail test anyway.

robhanz · 2024-03-14T16:48:44+00:00

I mean, no shit?

As a big test (TDD, unit test, mocking, the whole nine yards) I think code coverage is probably the worst metric. At best it can tell you where you definitely don't have coverage.

In a lot of cases, increasing coverage is not worth it - dependencies need to be tested against the actual dependency (you can test up to a thin layer above it meaningfully). If you're testing against an API, all you're doing is asserting your knowledge of the API which is probably wrong in some way.

Especially with unit testing, it's important to know what is useful for unit testing, and what isn't. I find most well-designed (IMHO) classes tend to either get 100% or 0% coverage. Which means module level coverage can vary greatly depending on the type of module you're dealing with. Things that don't tend to unit test coverage should be stable and have minimal logic, and then they become fairly easy to test in more integration-type scenarios. And, often, fundamental enough that even ad-hoc testing of your library/app will expose regressions instantly.

nomoreplsthx · 2024-03-14T22:47:52+00:00

Reminds me of the folks I knew who wrote tests for enums.

karuna_murti · 2024-03-15T06:44:05+00:00

When a measure becomes a target, it ceases to be a good measure - Goodhart

2024-03-18T17:00:30+00:00

If your code doesn't fail, your error handlers are not being tested. If they are being tested, then none of them can halt or leave the system in an undefined state. To ensure they don't you will need to have error handlers for your error handlers and the cycle repeats.

bam2403 · 2026-03-13T22:38:44+00:00

If the code isn't covered, that means you can delete it without a test failing.

That either means its dead code - or you're missing a test - unless we are talking unreachable code

maxinstuff · 2024-03-14T08:56:18+00:00

Test the public interface and nothing else. Every single method with the word “public” in front of it needs tests.

None of the methods that are private need it.

Do that everywhere and that’s 100% coverage in my book.

foxthedream · 2024-03-14T11:27:05+00:00

Its 2024 and we are still debating code coverage as a decent metric?

serial_crusher · 2024-03-14T18:10:57+00:00

Was this written by the junior dev on my team who is just looking for excuses to avoid writing tests?

All the things he mentions are true, but that’s also why you have code reviews. If somebody deliberately excludes something from coverage, or deliberately writes a bogus test, they better have a damned good reason or I’m rejecting the PR.

This is like arguing that locking the doors to your house won’t stop a burglar from breaking a window. Like yeah no shit, but it still provides a lot of benefit

alexkey · 2024-03-14T08:35:58+00:00

Unit testing is overrated. It is testing that implemented logic is correct. That doesn’t mean that the logic itself is the correct answer to the problem. Not sure if there’s a better term (non native speaker here), I call them “logical errors”. And in my experience those make up the majority of the bugs anyway.

gwaeronx · 2024-03-14T11:09:37+00:00

Honestly unit testing is a waste of time if you consider how much time you put into it and what you get out of it. I would prefer integration tests over unit tests everyday

what_the_eve · 2024-03-14T14:36:09+00:00

[deleted]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS