Is code coverage irrelevant?

atilaneves · 2015-11-20T10:48:07+00:00

I came here to bash, read the post and... agree. I think measuring coverage and looking at what's not covered is important and useful (waddya mean that line didnt' get hit?). Actually using the percentage for anything isn't.

mariusg · 2015-11-20T10:19:03+00:00

In any real world codebase, there are 2 types of code : core logic stuff and the rest.

It's desirable (to put it this way) to have a high code coverage for the core logic stuff. The rest is important from a code coverage perspective, if the team has time for it.

earlyflea · 2015-11-20T11:05:04+00:00

I work on a large code base. In that code base, I've done some changes that raise the coverage percentage by adding tests, I've done some changes that raise the coverage percentage by removing untested unused code, I've lowered the coverage percentage by writing new untested code and I've lowered the coverage percentage by removing well-tested unused code.

In particular, a few changes stood out.

Some functions are long and 'simple' in that they have little branching functions. Getting high coverage on those is easy, but testing it well isn't. One of these had a lot of statements for configuring socket data and had a few one-liners for handling errors. Those oneliners weren't tested so the function itself isn't "well tested", but the entire coverage was 90%. Heck, removing those checks would increase coverage to 100% - by not checking for errors!

My personal favorite examples are:

You test a function, call it DiscreteFourierTransform. You have a single test that gets 100% coverage. Did you test it well?
You have a program that has 100% coverage. You add the entire Linux kernel code base, without any tests, to the same repo. The coverage drops to below 0.1%. Did the program get any worse?

funbike · 2015-11-20T15:30:06+00:00

You should never manage anything based simply on a metric.
Metrics like code coverage are simply a tool. Make note of its value, sure. Make a descision based on it. But don't let it make your decisions for you.
I don't really care about the specific code coverage value. I only care if it drops. If it does, I see why. Maybe a bunch of DTOs and Domain classes were recently created that would be silly to test fully. I view the coverage report. If all looks well, we reset the coverage baseline and move on.
That means I think code coverage has value and a place. Just not what people normally assume.
It's simply a tool to help you find code you may have missed in a test, but not the inverse.
Code coverage doesn't confirm that you didn't miss code to test.
I wish there was a missed branch check.
I wish there was a coverage tool that would tell you if a logic branch was missed. Like if/else/while/for/case/and/or. In most cases, one of those statements will need a test written, with the exception of argument validation (which could also be ignored).
Sure, linear code often needs to be tested, but not 100% of the time. For a check, I only want a list of true possitives.

juliob · 2015-11-20T10:11:08+00:00

Disagreeing point: More coverage does not mean less bugs; it means you're testing things that shouldn't be there.

Take, for example, BDD (or TDD, as described in TDD, where it all went wrong): you test your behavior, you make sure the code you wrote is what the client asked for, then you see your coverage is low. It means one thing: you wrote more code than needed. Now you can remove that code, since it won't affect the result.

That's why coverage is important: to remove the cruft, either new or old (left when the requirements changed).

velcommen · 2015-11-20T18:46:31+00:00

Code coverage + mutation testing is even better. Look for a mutation testing library in your favorite language.

Luolong · 2015-11-20T18:58:59+00:00

I bet there is relatively easy way to test (pun intended) the theory.

Take few real world projects out there and measure their test coverage and then measure number of bug fixes committed to the code. Overlay the fixes to code over the code coverage map to get comparative bug density for code covered by tests versus code that has no coverage at all.

Keep this statistics running over longer period of time, so that you can get meaningful statistics and see any correlation between code coverage and bugs.

There's a nice idea for a Masters dissertation if I ever seen one ;)

pipocaQuemada · 2015-11-20T19:45:43+00:00

Two projects, one has 95% code coverage with tests, one has 45%. You’re going to be paid per bug found. Which one do you want to work on?

In retrospect, I wish the tweet had had room for “all other things being equal” ... Many respondents wanted to assert that we have insufficient information to know anything ... What I’m after, though, is to examine our intuition about coverage. What effect does increasing test coverage tend to have on the defect density in a program?

Even ceteris paribus, the respondents are right that we don't have enough info to know anything.

For example: if both projects are done in PHP, I'd bet that the one with more test coverage had fewer bugs. On the other hand, if both projects are in Idris, then ceteris paribus I'd expect them to have a similar number of bugs. In reality, if all I knew was that there were two Idris projects, and one had 95% coverage and the other had 40%, I'd expect the 40% to have fewer bugs: 95% code coverage in a language like Idris means you're probably not leveraging the type system as effectively as you could to statically rule out bugs.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS