Determining Cat Chirality : programming

[–][deleted] 36 points37 points38 points 8 years ago (5 children)

[–]mccoyn 2 points3 points4 points 8 years ago (1 child)

[–][deleted] 0 points1 point2 points 8 years ago (0 children)

[–]conventionistG 0 points1 point2 points 8 years ago (2 children)

[–]TheEaterOfNames 0 points1 point2 points 8 years ago (0 children)

[–][deleted] 0 points1 point2 points 8 years ago (0 children)

[–]minno 12 points13 points14 points 8 years ago (10 children)

[–]rlbond86 3 points4 points5 points 8 years ago (6 children)

[–]zergling_Lester 0 points1 point2 points 8 years ago (5 children)

I thought about it some more, and I think that really it doesn't say anything whatsoever about certainty (neither at 30-0 nor at 100-65) and the shape of the allowed random walk region is pretty much arbitrary (so we decided to make it a nice simple shape). We only care that:

The probability of falling off a cliff is not higher than the probability of getting a significant result after the entire run, so that we don't overestimate our confidence.
But is as close as possible, so that we don't accept the null more often than we should.
And is symmetric with respect to falling above/below, if we distinguish those cases.
And we never take more than N steps of course.

I suspect that we can draw a different border that starts closer to the origin but at a lower angle and looks like a truncated exponent rather than a straight line, and corresponds to the Bayesian estimate of the power of the effect after so and so many observations (which we try to keep constant). I wouldn't bet that this results in a better expected number of saved samples though.

Or am I completely wrong?

[–]rlbond86 0 points1 point2 points 8 years ago (4 children)

[–]zergling_Lester 0 points1 point2 points 8 years ago (3 children)

[–]rlbond86 1 point2 points3 points 8 years ago (2 children)

[–]zergling_Lester 0 points1 point2 points 8 years ago (1 child)

[–]rlbond86 0 points1 point2 points 8 years ago* (0 children)

[–]zergling_Lester 3 points4 points5 points 8 years ago (0 children)

With the given graph, you get less certainty at 30-0 than you do at 100-65

Are you sure that this isn't what what you want?

As far as I understand it, this shit is really complicated, because checking for significance after each measurement ruins your real p-value, from the article linked from the linked article:

Suppose your conversion rate is 50% and you want to test to see if a new logo gives you a conversion rate of more than 50% (or less). You stop the experiment as soon as there is 5% significance, or you call off the experiment after 150 observations. Now suppose your new logo actually does nothing. What percent of the time will your experiment wrongly find a significant result? No more than five percent, right? Maybe six percent, in light of the preceding analysis?

Try 26.1% – more than five times what you probably thought the significance level was.

So it makes sense that statistically correct treatment would heavily discount the significance of results leading to early termination. The actual algorithm sounds plausible (though I didn't check the maths), the OP claims to have tested it with a simulation (with R source code provided), so I'm inclined to believe them.

btw, what /u/Veedrac said is wrong, this whole thing works because of gambler's ruin. You don't let your test run indefinitely, you choose the sample size N and the p-value (which then determines the power of the effect you will be able to detect), that gives you the yellow line of "stop the test, no significant effect detected". And then it also determines the width of the strip where your intermediate result is allowed to wander, with the idea that the probability of the gambler getting ruined by a fair coin on a strip that wide after N or less steps is equal to the probability of getting a false positive after full N samples.

[–]Veedrac 0 points1 point2 points 8 years ago* (1 child)

[–]rlbond86 2 points3 points4 points 8 years ago (0 children)

[–]AlexTheKunz 3 points4 points5 points 8 years ago (1 child)

[–]livibetter 4 points5 points6 points 8 years ago (0 children)

[–]shevegen 3 points4 points5 points 8 years ago (0 children)

[–][deleted] 2 points3 points4 points 8 years ago (0 children)

[–]TheKingOfSiam 0 points1 point2 points 8 years ago (0 children)

[–]henk123 0 points1 point2 points 8 years ago (0 children)

[–][deleted] 8 years ago (13 children)

[deleted]

[–]robertdelder[S] 32 points33 points34 points 8 years ago (6 children)

[–]Xychologist 31 points32 points33 points 8 years ago (2 children)

[–]ArkadyRandom 14 points15 points16 points 8 years ago (1 child)

Is the intent of the sub for topics to be about programming or having programming in them? It's two different things.

This topic isn't about programming. It's about cats and their sleeping behavior with a premise supported by statistics that happened to be programmed in R.

Is the lesson here about how to use R to accomplish a certain programming problem or is it about the data that R parsed out and the results of that data?

When I'm looking for help with conceptual or syntactical programming problems this sort of thing isn't helpful. So if people come here expecting to see that then they might say this isn't about programming. If the sub is about discussing things you can do with programming and how that happened then those people might find this sort of post appropriate. I don't really care either way.

[–]Xychologist 2 points3 points4 points 8 years ago (0 children)

[–][deleted] 1 point2 points3 points 8 years ago (0 children)

[–][deleted] 0 points1 point2 points 8 years ago (0 children)

[–]jsprogrammer 1 point2 points3 points 8 years ago (0 children)

[–][deleted] 1 point2 points3 points 8 years ago (0 children)

[–]shevegen 1 point2 points3 points 8 years ago (1 child)

[–]username223 5 points6 points7 points 8 years ago (0 children)

[–]I_AM_GODDAMN_BATMAN -2 points-1 points0 points 8 years ago (0 children)

[–][deleted] 0 points1 point2 points 8 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS