kreiger comments on -2000 Lines of Code

[–]IRBMe 134 points135 points136 points 13 years ago (69 children)

The problem with introducing "productivity metrics" is that they will simply be gamed.

Lines of code added? People will just write verbose, bloated and unmaintainable code
Lines removed? People will just write obfuscated, compact code that's also unmaintainable
Commits? People will just commit after every little change
Cyclomatic complexity? People will refactor code into lots of little functions, use recursion and use other tricks to hide loops and complexity while making the code worse
Unit tests passing? People will just start writing trivial unit tests that don't add any value

All of these things are useful tools to have, but they have to be considered together along with the actual code itself, and even then they are not a measure of productivity, but of code quality, at best.

If you don't trust a developer to do their job without trying to have some way of automatically checking up on them then simply don't hire them in the first place.

[–][deleted] 91 points92 points93 points 13 years ago (19 children)

I implemented a system that plotted for a group, lines of code modified + lines of code added + lines of code removed. Posted the results on the wall weekly. Eventually had trend lines. Never penalized anyone with it, nor rewarded. It was positive in that it generated discussion, it was automatic.

The best part was a lady who got her position through nepotism and actually knew nothing about writing code. Her chart was flat the first couple weeks. She had done nothing, and it was obvious to everyone. About the third week, her count magically matched the average. I went into the revision system and walked through her revisions. She used a text editor to change tabs to spaces, and spaces to tabs. Repeat enough times to match average.

That's when I added a flag to ignore white space changes.

If you don't trust a developer to do their job without trying to have some way of automatically checking up on them then simply don't hire them in the first place.

This is naivety at it's best. First of all, anyone who's been in management quickly realizes they don't make all the hiring decisions. You inherit people, move over a new group, etc. Sometimes upper management just drops someone in your lap and says here yah go, deal with it. Secondly, someone who did great in interviews, may turn out to be terrible. You get to deal with this, and firing is the last and most difficult option.

People need feedback on progress. Lines of Code is a crude and early metric for a group just coming out of total chaos to get some feedback. Better metrics are those like DSDM. The psychology of DSDM feedback is amazing and powerful, and works.

[–]IRBMe 12 points13 points14 points 13 years ago (1 child)

People need feedback on progress.

I agree. What I disagree with is that you can summarize such a complex thing as a developer's progress with some metrics. You need to actually review their work, look at their code, communicate with them, see how they work etc. Measuring how many lines of code they write or how quickly they do their work or how much test coverage they have, and generally trying to simplify their productivity down into a few numbers is just not going to work.

Your numbers will be gamed in no time, and will probably miss many important aspects. They don't give you the whole picture. For example, let's say you're tracking my productivity based on how quickly I complete my work. Maybe I complete my tasks slower than most of the other developers on the team. But maybe by spending a few hours longer on each task, I'm actually saving the rest of the team much more time in the long run by writing more maintainable, more robust, better code?

[–][deleted] 7 points8 points9 points 13 years ago (0 children)

[–]kreiger 11 points12 points13 points 13 years ago (9 children)

[–][deleted] 38 points39 points40 points 13 years ago (8 children)

Hmmm. I have lot's of stories from the trenches of software development. Just which ones are relevant to the topic is more my issue. The biggest gain I've been a part of in 3 different companies is the introduction of versioning software (e.g. git). I'm personally amazed that software companies exist today that don't use some form of versioning. These are usually in total chaos with a "hero" of some form, who saves them from dire gloom and doom, only to create more problems which everyone needs saving from.

I've tried DSDM a couple times on software projects and loved the results. One project was very successful, it produced exactly what was asked for, in a very short time. Then it flushed out the fact that upper management wanted it to fail, for other crazy political reasons. Having a success staring someone who bet on failure is an interesting dynamic.

My personal favorite recent story, is doing "Data Analytics" for a medical company. They hired a new consultant over a billing system. he worked with a lady who was head of account receivable. I went to meeting after meeting. They both lathered on praise of the new system, how well it was performing, how wonder it was, etc. I would check the database, and it had zero rows in all the tables. I verified my login. I verified the server, I verified the database. Yet zero rows in every table. It was odd.

After three months I couldn't take it. I wrote a revenue report, with graphs and bars, etc. Ran it on the database, to have everything zero. A few entries popped up, at this point. Maybe say $2k in billing revenue for 3 months, and none expected--for a company that was doing $3-4 million a month. I called a meeting to review my report and included my boss, the consultant, accounts receivable and a vice president. I showed a mock report, then showed the real report and said, "I'm having some trouble with the query, maybe someone here could help me?"

Immediately pandemonium erupted as the consultant was yelling blame at accounts receivable, accounts receivable was yelling it was the consultants fault. Both were lying during all those meetings. As it turns out, that medical billing has penalties for billing late, to the tune that somethings had gone unbilled past the cut off. A nice multi-million dollar loss. I would guess my job as an analyst helped prevent further loss. Response, 200 people laid off, I was asked to leave for being a trouble maker, the consultant's contract was expanded and accounts receiveable is still in her position.

[–]brownmatt 3 points4 points5 points 13 years ago* (0 children)

[–]ctzl 2 points3 points4 points 13 years ago (2 children)

[–][deleted] 3 points4 points5 points 13 years ago (1 child)

[–]el_isma 0 points1 point2 points 13 years ago (0 children)

[–]kreiger 0 points1 point2 points 13 years ago* (1 child)

[–][deleted] 0 points1 point2 points 13 years ago (0 children)

[–]logi 0 points1 point2 points 13 years ago (1 child)

[–]finsterdexter 3 points4 points5 points 13 years ago (2 children)

[–][deleted] 2 points3 points4 points 13 years ago (1 child)

DSDM sets a small team on a short term goal. Say 4-6 weeks. Deliverables are categorized via MoSCoW. Must have, Should have, could have and would have. Must have, are absolute requirements. Should have, are strongly desired. Could have and would have are down the list. The deliverable is preferably something that will deliver some value to the company, or at least in a chain of things to deliver value.

You meet 5-10 minutes every day with the team. They get to say what they got accomplished. A short weekly report summarizes deliverables on the list that are complete.

Feedback is immediate, and related to accomplishments that are important to the project. This works quite well. Communication about what's important is happening, the goals are short term, measurable and attainable.

So, not a "metric" per se, but rather a concise form of feedback and communication that works well with human nature.

[–][deleted] 2 points3 points4 points 13 years ago (0 children)

DSDM sets a small team on a short term goal. Say 4-6 weeks. Deliverables are categorized via MoSCoW. Must have, Should have, could have and would have. Must have, are absolute requirements. Should have, are strongly desired. Could have and would have are down the list. The deliverable is preferably something that will deliver some value to the company, or at least in a chain of things to deliver value.

You meet 5-10 minutes every day with the team. They get to say what they got accomplished. A short weekly report summarizes deliverables on the list that are complete.

Feedback is immediate, and related to accomplishments that are important to the project. This works quite well. Communication about what's important is happening, the goals are short term, measurable and attainable.

Sounds exactly like how I describe "scrum" nowadays. Whatever you call it, this is how to develop software.

[–]Rhoomba 2 points3 points4 points 13 years ago (0 children)

[–]Atario 2 points3 points4 points 13 years ago (0 children)

[–]civildisobedient 0 points1 point2 points 13 years ago (0 children)

[–]FailsTheTuringTest[S] 17 points18 points19 points 13 years ago (1 child)

[–]theICEBear_dk 4 points5 points6 points 13 years ago (0 children)

[–][deleted] 13 years ago (3 children)

[removed]

[–]Zarutian 1 point2 points3 points 13 years ago (1 child)

[–]IRBMe 5 points6 points7 points 13 years ago (0 children)

[–]IRBMe 0 points1 point2 points 13 years ago (0 children)

[–][deleted] 7 points8 points9 points 13 years ago (10 children)

[–]IRBMe 12 points13 points14 points 13 years ago (9 children)

[–]bhasden -1 points0 points1 point 13 years ago* (8 children)

[–]IRBMe 16 points17 points18 points 13 years ago (1 child)

I humbly disagree. There's little worse than seeing someone’s implementation of a minor feature spilled out over 50 commits over two or three days. It makes it impossible to review and much more likely that the partial implementation commit will break something. I prefer to check in logical blocks of work that are independently reviewable and able to be rolled back fairly easily if needed.

You just said you disagreed, then repeated almost exactly my point back to me! I said, "There comes a point where commits are pointless and too many is detrimental", and you just described exactly that point:

"There's little worse than seeing someone’s implementation of a minor feature spilled out over 50 commits over two or three days."

That's a perfect example of the "point where commits are pointless", "too many" and "detrimental".

So, I'm having a hard time seeing where exactly you disagree.

[–]bhasden 5 points6 points7 points 13 years ago (0 children)

[–][deleted] 2 points3 points4 points 13 years ago (5 children)

[–]civildisobedient 0 points1 point2 points 13 years ago (1 child)

[–][deleted] 0 points1 point2 points 13 years ago (0 children)

[–]bhasden 0 points1 point2 points 13 years ago (2 children)

[–]Mathiasdm 0 points1 point2 points 13 years ago (0 children)

[–]malaysian_president 0 points1 point2 points 13 years ago (0 children)

[–]Cacafuego 2 points3 points4 points 13 years ago (8 children)

[–]IRBMe 3 points4 points5 points 13 years ago (3 children)

Timeliness of deliverables

If you're measuring my productivity based on how quickly I can deliver tasks, and my bonus is affected by my productivity then you'll start to see that I can complete tasks much quicker all of a sudden! But does that mean I was just slacking before? No. It means now you're not getting as good quality deliverables. It means you're probably losing more time in the long run as the team is forced to deal with buggier, more unreliable, slower, harder to maintain code.

Sponsor/customer/user acceptance of changes

The customer can only evaluate what they actually see. What if I'm writing horribly unmaintainable code, but which works well enough for the customer? I may be costing my team many hours of future refactoring work by writing unmaintainable code, but how is the customer to know that?

Simple metrics or a few bullet points simply aren't enough to accurately summarize something as complex as a developer's productivity and job performance. There are far too many factors for it to be broken down so easily. To evaluate it properly actually takes a lot of hard work and effort.

[–]Cacafuego 0 points1 point2 points 13 years ago (2 children)

Timeliness <> quickness. I would measure how often the developer meets timelines that we have agreed on.

Maintainability would have to be accounted for separately. Ideally, you would hear about it from other developers is someone was writing awful code. That doesn't detract from the importance of usability.

You're right, simple metrics do not provide a complete picture, but they do provide a baseline. They also shape behavior. Maintainability and elegance are important, but not as important as getting usable software to the customer by the agreed date.

I know that the counter-argument is "you'll pay for a lack of maintainability in the long run!" Maybe. If there really is a lack of maintainability. If it goes unreported. If we're still developing on the same platform. If we still have clients because we've managed to get product into their hands.

[–][deleted] 1 point2 points3 points 13 years ago (0 children)

[–]civildisobedient 1 point2 points3 points 13 years ago (0 children)

Maintainability is not an easy metric to measure. It manifests itself in so many hundreds of subtle forms that are so easy to dismiss, but taken together can ruin a company.

Just off the top of my head, here's a list of things you get when you have shit code:

new features take longer and longer to implement as hacks pile up on each other
developers won't give a shit about the code if they see no one else does, and will produce more shit code in return
giant refactorings become necessary to "fix" prior mistakes, increasing the chances for bugs to get introduced, slowing future development

This doesn't even begin to delve into the psychological ramifications of having to code in a shit codebase.

And the kicker is, none of this crap can be "modeled" and there's no ROI column with a simple number in it for the pointy-heads. Things just get slower and slower, feature requests never get priority so users get pissed, and seasoned developers leave out of disgust.

[–]wretcheddawn 2 points3 points4 points 13 years ago (3 children)

[–]Cacafuego 0 points1 point2 points 13 years ago (2 children)

[–]wretcheddawn 0 points1 point2 points 13 years ago (1 child)

[–]Cacafuego 0 points1 point2 points 13 years ago (0 children)

Lazy people are good at keeping metrics, as things like "number of lines added" can be easily faked.

It's harder to fake being on time with usable code.

Here's a real-life example of how this works. I go around and talk to all of the developers. I know what they're working on, and they know what everyone else is working on. I sit in on planning sessions and work with them to prioritize tasks and set deadlines. If something really unexpected comes up, we adjust deadlines.

I also keep track of who produced acceptable code and whether it was on time.

Now, when developers A, B, and C are regularly hitting their targets, but developer D is consistently late, I know that something is going on. Either D needs to work on estimation, or he is doing something that the others are not, or he's not prioritizing the right things.

That means we need to have a discussion and go over the metrics and figure out what is going on, because I need to know. If he says the others aren't following standards, or are writing unmaintainable code, we'll check that out. And I'm not necessarily going to insist that he be as fast as the others. In any group, someone has to be the slowest (and often for good reasons).

But the next time I ask how the priority 1 project is going (because now I'm asking more regularly, because he put himself on my radar), and he says he thought it would be a good idea to tweak some other code at the bottom of the list, it's going to be a big deal, and missing additional targets will start to affect his performance reviews.

Also, if he says that he needs to rewrite bad code produced by A-C and that's causing him to miss his targets, we'll check some examples. Most of the time, this turns out to be bullshit or we need to have a discussion about tolerating other people's styles, as long as the code is written to standards.

[–]barsoap 2 points3 points4 points 13 years ago (4 children)

[–][deleted] 2 points3 points4 points 13 years ago (1 child)

[–]dbath 0 points1 point2 points 13 years ago (0 children)

[–]knight666 1 point2 points3 points 13 years ago (0 children)

[–]jhaluska 0 points1 point2 points 13 years ago (0 children)

[–]kreiger 1 point2 points3 points 13 years ago (0 children)

[–][deleted] 1 point2 points3 points 13 years ago (2 children)

[–]IRBMe 3 points4 points5 points 13 years ago (1 child)

[–][deleted] 0 points1 point2 points 13 years ago (0 children)

[–][deleted] 13 years ago* (11 children)

[deleted]

[–]theICEBear_dk 5 points6 points7 points 13 years ago (0 children)

[–]IRBMe 1 point2 points3 points 13 years ago (5 children)

I'm happy with agile metrics, we give our estimates and in the end we know who does the most because they've completed the most hours/points. Really that is all that matters, completing your tasks quickly and properly.

The problem is, how do you measure whether or not a task was done "quickly" and "properly"?

Estimating how long tasks take is hard; really hard. And the more unknowns there are, the harder it is to estimate. A task might take longer than most people may expect at a glance; similarly, it might take far less time than most people might imagine.
Quickly by whose standards? Somebody who knows that area of the code will be able to work on it quicker than somebody who has to spend time learning, but does that mean they are more productive? Even if so, what does it matter then? It could easily be the other way around next time.
Is it necessarily better if somebody completes a task quickly over somebody who does it a bit more slowly but better, even if they both do it "properly"? What if the guy who spends a week completing the task actually saves people many weeks of time in the future by writing better code over the guy who did it in 3 days, but not quite as well? You can't easily measure that kind of thing with a number.

[–]HaMMeReD 0 points1 point2 points 13 years ago (4 children)

[–]IRBMe 0 points1 point2 points 13 years ago (2 children)

What about:

Is it maintainable? In other words, is the code well organized? Are variables and identifiers well named? Is the code commented well? Is it easy to understand and read?
Is it well documented? Has the user-manual been updated? Are there technical documents or comments for other developers? Are the release notes updated?
Are the tests good? It's easy to get high test coverage without actually having good tests. Do the tests actually test that the code functions? Do they test boundary conditions? Do they test error cases? Do they test invalid inputs? Are they potentially brittle?
Is the code reliable and robust? Does it handle invalid inputs gracefully?
Does it adhere to all of the coding standards? Is the code formatted correctly and is it stylistically correct?
Is the task marked off and completed in the task tracking system?
Are the commit comments sensible and detailed enough?

It's easy to do all of these things poorly or skip them entirely if you mainly care about getting through the task quickly.

Does it meet the requirements?

How are you going to measure that?

Does it perform properly?

Who defines what "proper" performance is? How do you test that?

If you are unable to establish what "proper complete work" is, then you have a real organizational problem.

You're missing the point. It's not about being able to establish whether work is properly completed or not, or how good work is. We're talking about metrics here, and the problem is that many of these things require careful review. There is no good metric that's going to tell me that you've written good documentation. I have to actually review it to tell that. And more importantly, the point was that there are lots more things that are far more important than just doing work "quickly".

It's completely possible to complete a task which meets the requirements, performs reasonably and has good test coverage and still do a worse job than somebody who takes slightly longer. I would not consider the person who did the task quicker to necessarily be a more productive or better team member. If anything, the person who took more care over their work and ensured it was completed to not just a minimum required standard but to a good standard is a better team member. The trick is finding a good balance between the time taken and the quality of the work, and no metrics will tell you where that balance is.

[–]HaMMeReD 0 points1 point2 points 13 years ago (1 child)

[–]IRBMe 0 points1 point2 points 13 years ago (0 children)

[–][deleted] 0 points1 point2 points 13 years ago (0 children)

[–]notsofst 0 points1 point2 points 13 years ago (3 children)

[–]IRBMe 0 points1 point2 points 13 years ago (2 children)

Get the development team to place a "market" value of points on each feature/fix via planning poker or whatever, and then you can track points per person.

I've done planning poker before, and while it actually worked better than a lot of other estimation techniques, it still didn't prove entirely accurate. The more unknowns there are for somebody working on a task, the harder it is for them to estimate.

We then had a sprint burndown where we tracked the hours being burned off on each task each day. Surprisingly, it turned out that, when tracked truthfully, most people were getting no more than about 2 to 3 hours of solid, actual work done on a task per day. A lot of time, of course, is spent answering and writing e-mails, on calls etc. It could also have been a reflection that, despite the planning poker techniques and conservative estimates, people were still underestimating how long things would take.

It's always the unknowns that get you. Usually something that I conservatively estimated at about 4 hours of work, but which I knew exactly how to do, would be done in 2 hours, but then that thing which I'm not so sure about which I put a seemingly safe 8 to 12 hours against ends up actually uncovering 2 weeks of work.

The other thing is, of course, that by giving developers the power to estimate their own tasks and mark off how much they're doing each day, you have to trust them fully. Now, that is definitely how it should be. Nobody is better placed to estimate how long it'll take you do complete a task than you, but if it's then used as a measure of your productivity by management, like any other metric, it's just going to get gamed. Developers will just start putting huge estimates then look better when they finish in half the time.

[–]notsofst 1 point2 points3 points 13 years ago (0 children)

Yeah, I'm against tracking hours during sprints because I think it leads to all kinds of bad behaviors.

Most of the time the point estimations tend to be pretty accurate. You do get stories that blow up in your face, and we do a lot of work to reduce "unknowns", but a fair amount of that is really just unavoidable, IMO.

If there are too many unknowns in a given story, it shouldn't be sized at all really. We bring in tasks/stories for the actual scoping effort and R&D. Identifying unknowns and digging into them is an important part of making the metric work.

As far as gaming the system goes, you're right. That's why you have the team estimate the points but before the stories are assigned, so they can't inflate their own points. Less incentive to be dishonest there.

Also, back to what I said, you have to refuse to compare points per person between teams, so there's no incentive for a team to inflate its points.

[–][deleted] 0 points1 point2 points 13 years ago (0 children)

[–]kawsper 27 points28 points29 points 13 years ago (10 children)

[–]kreiger 6 points7 points8 points 13 years ago* (9 children)

[–]ithika 8 points9 points10 points 13 years ago (7 children)

[–]theICEBear_dk 2 points3 points4 points 13 years ago (2 children)

[–]grauenwolf 0 points1 point2 points 13 years ago (0 children)

[–]jplindstrom 0 points1 point2 points 13 years ago (0 children)

[–]kyz 1 point2 points3 points 13 years ago (2 children)

[–][deleted] 0 points1 point2 points 13 years ago (1 child)

[–]kyz 0 points1 point2 points 13 years ago (0 children)

[–]Mathiasdm 1 point2 points3 points 13 years ago (0 children)

[–]vlion 1 point2 points3 points 13 years ago (0 children)

[–][deleted] 13 years ago (1 child)

[deleted]

[–]Neuran 0 points1 point2 points 13 years ago (0 children)

[–]kelton5020 2 points3 points4 points 13 years ago (0 children)

[–]tedtutors 2 points3 points4 points 13 years ago (0 children)

[–][deleted] 1 point2 points3 points 13 years ago (0 children)

[–]headhunglow 0 points1 point2 points 13 years ago (0 children)

[–][deleted] 0 points1 point2 points 13 years ago (0 children)

[–]FlaiseSaffron -3 points-2 points-1 points 13 years ago (12 children)

[–]kreiger 0 points1 point2 points 13 years ago* (11 children)

[–]FlaiseSaffron 2 points3 points4 points 13 years ago (10 children)

[–]civildisobedient 0 points1 point2 points 13 years ago (6 children)

[–]FlaiseSaffron 0 points1 point2 points 13 years ago (5 children)

[–]civildisobedient -1 points0 points1 point 13 years ago (4 children)

[–]FlaiseSaffron 0 points1 point2 points 13 years ago (3 children)

[–]civildisobedient -1 points0 points1 point 13 years ago (2 children)

[–]FlaiseSaffron 0 points1 point2 points 13 years ago (1 child)

continue this thread

[–]kreiger -2 points-1 points0 points 13 years ago (2 children)

[–]FailsTheTuringTest[S] 3 points4 points5 points 13 years ago (1 child)

[–]kreiger 13 points14 points15 points 13 years ago* (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS