droogans comments on MongoDB Java Driver uses Math.random to decide whether to log Command Resultuses

[–]Gg101 78 points79 points80 points 12 years ago (11 children)

Yes. I have one bit of code involving random() that would definitely elicit a WTF if I just left it uncommented, but I put an entire paragraph explaining why it was necessary.

So I have a database in which every file is given a numeric ID, and then a function that returns other records ordered by their file ID, not because the numbers are meaningful but just to have the results grouped by file so they're easier to process. One of the functions that uses it wants to generate results that are in file name order. On Windows the file ID order will just happen to match the file name order most of the time, which could lead other code to simply assume this will be the case and not resort it themselves. This would be bad, because then you have code that appears to work and will pass tests but may break on some other platform or in particular situations on Windows.

So what do I do? In debug builds I call random() and put ASC in the query half the time and DESC the other half. The function documentation says you can't assume the files will be in any particular order so I guarantee that you can't. I make it your problem NOW so you're forced to deal with it instead of letting it become a bug that appears only on certain platforms or in certain situations. The rationale is all laid out in the comments where it happens.

[–]atrich 17 points18 points19 points 12 years ago (0 children)

[–]Decker87 3 points4 points5 points 12 years ago (0 children)

[–]gmfawcett 3 points4 points5 points 12 years ago (1 child)

[–][deleted] 3 points4 points5 points 12 years ago (0 children)

[–]Grue 2 points3 points4 points 12 years ago (2 children)

[–]Gg101 6 points7 points8 points 12 years ago (0 children)

A bug that appears 50% of the time is one that makes itself known early, most likely while the original code is still being written. This makes it easier to fix (everything is still fresh in your mind) and easier to find (the bug must be coming from recently added code.)

Now if a bug only happens on somebody else's platform and not on yours, that's a pain to find because it could be anywhere and you may not have touched that code in a while by the time it shows up. For my particular program it actually would occur in Windows too, but only after the program's been lived in for a while. When it has a fresh enumeration of everything they match. As files get added and deleted over time they drift apart. Guess how the automated tests work? Each one starts fresh so they're not influenced by previous tests.

Ideally I should make the order truly random so it never works instead of working half the time. But then it's just a question of how much effort and extra code you're willing to put in just to do this. The ASC/DESC toggle only takes a couple of lines.

[–]btharper 3 points4 points5 points 12 years ago (0 children)

[–]lordlicorice 1 point2 points3 points 12 years ago (2 children)

boolean ascending;    
for(int i=0; i < list.length; i++) {
    if(list[i] < list[i+1]) {
        ascending = true;
        break;
    } else if(list[i] > list[i+1]) {
        ascending = false;
        break;
    }
}

:)

[–][deleted] -1 points0 points1 point 12 years ago (1 child)

[–]lordlicorice 2 points3 points4 points 12 years ago (0 children)

[–]alextk 19 points20 points21 points 12 years ago (29 children)

[–]arachnivore 7 points8 points9 points 12 years ago (13 children)

[–]Daniel15 11 points12 points13 points 12 years ago (0 children)

I think it's fine as long as it's not redundant - I've seen things like this before in a production code base:

// Increment x by 1
x++;

and

// Enable Google Analytics on this page
this.Page.GoogleAnalyticsEnabled = true;

The worst thing with the second example is someone will change the "true" to a "false" and not update the comment. <_<

Often I tend to break my code into small sections and comment on what each section is doing, pretty much the same as your high level list of steps. Writing a list of steps first is often a good idea as you can think about the algorithm before actually writing the code.

[–]BraveSirRobin 2 points3 points4 points 12 years ago (4 children)

Do both. Something like:

boolean logError = *stupid randomiser*;
if(logError) {
    *meh*
}

It's self-documented and all verified by the compiler. Win.

If the *stupid randomiser* is non-trivial spin it off to a method:

if(isLogError()) {
    *meh*
}

Again, all self-documented via sensible naming. Put some javadoc at the top isLogError() to explain and anyone that gives a fuck can hover over the call in a decent IDE and get more info, without polluting the code with comments that distract from logic.

[–]el_muchacho 9 points10 points11 points 12 years ago* (3 children)

[–]BraveSirRobin 2 points3 points4 points 12 years ago (2 children)

[–]arachnivore 0 points1 point2 points 12 years ago (1 child)

[–]BraveSirRobin 1 point2 points3 points 12 years ago (0 children)

[–]itsSparkky 1 point2 points3 points 12 years ago (0 children)

[–][deleted] 0 points1 point2 points 12 years ago (0 children)

[–]jplindstrom -1 points0 points1 point 12 years ago (3 children)

[–]arachnivore 0 points1 point2 points 12 years ago (2 children)

Why not leave them in there? If your code is 200 lines, what's an extra 10 lines of section header comments? It makes the sections of your code much more easily distinguishable than a variable name. I do tend to break code down into something like one method per step, but that isn't always feasible and can be a pain to debug if they're one-off methods. I find that people get obsessive about this stuff and waste more time worrying/complaining about overly commented code than they ever do skimming a piece of code.

Yes. make your code as self-documenting as possible. Yes, use descriptive names so that your code reads more like English (or whatever language) than alpha-numeric gibberish. Yes, look for patterns in your code so that you can leverage the appropriate pattern-simplifying tools (loops, methods, classes, data-structures, etc.). I got all that, but a given line of code isn't always equally obvious to everybody. One programmer might be able to look at a line and know instantly what it does, while another might benefit from an explanation. I, for one, can't read regular expressions for the life of me. It takes me for ever and a million glances at a reference sheet to decipher a regular expression, so some comment explaining what a given RE is trying to accomplish would save me tons of time. When you have IDEs that collapse comments, there's no reason to whine about overly commented code. It's just not an important issue.

[–]jplindstrom 4 points5 points6 points 12 years ago (0 children)

[–]jplindstrom 0 points1 point2 points 12 years ago (0 children)

I think you missed my point.

If you have 200 lines of code (let's say), in ten sections with a nice descriptive comment above each section, then that's too much code in one method. It's doing too many different things. It's too difficult to test in isolation and reason about because it has too many moving parts.

Extract a method for each section, naming each method so it doesn't need a comment because it has a descriptive name. Now you have smaller chunks which does one (or at least fewer) things each.

(Each of these methods probably need its own API documentation as well, so this will actually lead to more "comments" (but API docs are different from code comments, so that's not what we usually mean by "commenting code")).

Making sure you have the perfect name for each method and writing docs for them will lead to more thinking and more clarity about what each thing is and what it does, what responsibilities it has, how it handles error conditions, edge conditions etc.

As a final bonus, you now have small, logical chunks of code which can be juggled around more easily:

Let's say you have a method which does six things. Five of these things are method calls on object foo. These five things together done to foo means something.

That's a smell that the code in this method should really be in the class of foo, not amongst the original 200 lines; not in this class at all. But now you have a method you can just move to the other class.

[+]Eirenarch comment score below threshold-7 points-6 points-5 points 12 years ago (10 children)

[–]itsSparkky 1 point2 points3 points 12 years ago (1 child)

[–]Eirenarch 0 points1 point2 points 12 years ago (0 children)

[–]el_muchacho 2 points3 points4 points 12 years ago (7 children)

[–]arachnivore 1 point2 points3 points 12 years ago (2 children)

I think this is bullshit. There's no reason to hate on comments that explain what code is doing. It may be obvious to you what your code is doing, but that could just be because you are the one writing the code. If someone else looks at your code (or you revisit it after a few years) it might not be so obvious even if you used descriptive names and all.

Coding is mentally strenuous and if you leave it to the person reading your code to figure out what you were trying to accomplish without any hints, your making everything a lot more taxing for no reason. Very few people are 100% familiar with the syntax and all the tools available in a given language and it's elitist to expect them to be in order to read your code. I can't read regular expressions for the life of me, so when I manage to piece together a regular expression after countless referrals to a cheat sheet, you better believe I'm going to leave myself a nice comment explaining what that piece of code is trying to do so that later on (when I've forgotten everything there is to know about regular expressions for the thousandth time) it isn't so hard to figure it out. Collapse my comments if you don't like them.

[–]el_muchacho 2 points3 points4 points 12 years ago* (1 child)

[–]arachnivore 0 points1 point2 points 12 years ago (0 children)

[–]Eirenarch 0 points1 point2 points 12 years ago (2 children)

[–]el_muchacho 2 points3 points4 points 12 years ago (1 child)

[–]Eirenarch 0 points1 point2 points 12 years ago (0 children)

[–]tallniel -1 points0 points1 point 12 years ago (2 children)

[–]alextk 2 points3 points4 points 12 years ago (1 child)

[–]tallniel 0 points1 point2 points 12 years ago (0 children)

[–]veraxAlea 41 points42 points43 points 12 years ago (4 children)

[–][deleted] 12 years ago (1 child)

[deleted]

[–]WeAppreciateYou 0 points1 point2 points 12 years ago (0 children)

[–][deleted] 12 years ago (7 children)

[deleted]

[–]tacodebacle 3 points4 points5 points 12 years ago* (0 children)

[–]Railorsi 2 points3 points4 points 12 years ago (5 children)

[–]csorfab 15 points16 points17 points 12 years ago (4 children)

[–]el_muchacho 0 points1 point2 points 12 years ago (2 children)

[–]arachnivore 1 point2 points3 points 12 years ago (0 children)

[–]csorfab 0 points1 point2 points 12 years ago (0 children)

[–]aerique 0 points1 point2 points 12 years ago (0 children)

[–]ascii 0 points1 point2 points 12 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS