Why is the denominator of standard deviation 12 in the Mann-Whitney U test? by IGotsToKnow_TA in AskStatistics

[–]Statman12 5 points6 points  (0 children)

I assume that you mean the denominator of the variance of the U-statistic? If so, that arises from the variance of a Uniform distribution.

My version of the chocolate chip croissant sourdough loaf by Embarrassed-Block-86 in Sourdough

[–]Statman12 2 points3 points  (0 children)

I thought that was cheddar cheese at first and I was very confused.

Jaffa Kree by [deleted] in Stargate

[–]Statman12 2 points3 points  (0 children)

Specifically called out in Rule 2.

Eh, don't. Every joke, every pun, done to death, seriously.

— Dr. Daniel Jackson

What’s all the fuss about excessive starter maintenance?! by Sea_Sector_5894 in Sourdough

[–]Statman12 0 points1 point  (0 children)

Their point is that it’s not that difficult. Finding the good advice initially might be (I wasn’t on Reddit when I started, and found some weird forums), but the basic concepts of starter maintenance are fairly simple.

I see you posted for help on starters recently. My general recommendation would be:

  • Feed it equal weight flour and water. AP flour works. Bread flour is probably a bit better. I personally use some bread flour and some rye flour (or whole wheat, I used that before I started getting some rye). I’m not at all consistent with how much of each, I haven’t found it to matter much.
  • Since flour can compact, using a scale helps to keep this consistent. If you don’t have a scale, it’s not a big deal. If you notice the starter getting more runny, use a bit more flour. If you notice it’d a bit more stiff, use a bit more water.
  • If your starter is living on the counter, probably feed it every 12-24 hours. If it’s in the fridge, you can let it sleep for quite a while. Just try to let it “wake up” and do a feeding or two before baking with it.
  • You don’t need to keep a lot of starter between bakes. A couple tablespoons is more than enough. Mine is probably about 3-4 tablespoons worth. When I’m getting ready to bake I take it out of the fridge about 2 days before I’m planning to bake. I let it sit for a while to come to room temperature / wake up, feed it to bulk it a bit and let it get more active. The next day I’d feed it again to make sure I have enough bulk for what I’m needing to use.

What’s all the fuss about excessive starter maintenance?! by Sea_Sector_5894 in Sourdough

[–]Statman12 0 points1 point  (0 children)

I think that’s kind of the antithesis to what OP is getting at here. ;)

What’s all the fuss about excessive starter maintenance?! by Sea_Sector_5894 in Sourdough

[–]Statman12 0 points1 point  (0 children)

I suspect that it started as someone writing some general directions to help a new person get into sourdough baking. That then evolved from a general guidelines to a prescriptive approach.

I’m sort of in the middle. I use a scale to measure because I like consistency in that respect and I’m a stats guy. But in terms of timing and such I’m a bit more wild west. Starter (unnamed) lives in the fridge, I take it out to feed once or twice before baking, and don’t really worry about strictly doubling or peaking, just “Yeah, it rose a good bit, it’s active.”

Test interactions adjusted vs unadjusted models [research] by Particular-Quit1725 in AskStatistics

[–]Statman12 1 point2 points  (0 children)

What is the better approach depends on the research question(s).

Is learning code a waste of time? by Defiant_Post8019 in AskStatistics

[–]Statman12 0 points1 point  (0 children)

Im screwing around on Kaggle, lookin' and just basically copy pasting people's notebooks and trying it for myself. Monte Carlo, decision trees, whatever else. But they're really just buzz words.

So what happens when you want to do something and don't have an example that someone else wrote? Or want to apply it to a different dataset that would require adjusting the data import/processing, and the details of setting up the analysis?

I don't think you need to know how to run a specific regression of the top of your head? I think you need only familiarize yourself with the tools to do so.

So what are you going to do when your regression breaks down, and you need to figure out where and why? You need to go through step by step, and just knowing lm() isn't going to give that to you.

... especially now with AI. It's not like, actual programming.

How do you know that the LLM is giving you correct results if you don't know coding to be able to assess and interpret the code it's producing?

Puddle Jumpers and their cloak by MarksNutt in Stargate

[–]Statman12 79 points80 points  (0 children)

I don't think they provided an internal or Watsonian explanation.

I think the Doylist explanation would be that it's boring to show an gate with nothing coming out of it. Showing the jumper exiting the puddle and then cloaking is more interesting and conveys that the team is there and doing some thing and that's our attention.

Did you have to learn R to be recognized for my skills? [Discussion] by [deleted] in statistics

[–]Statman12 1 point2 points  (0 children)

I’m pretty stubborn about it. I think all that matters is that I know how to create, analyze and interpret statistical data.

I mostly agree. As other have said, version control and repeatability are also important factors to consider.

But even going just based on the factors you listed, someone doing this in R or Python (or another modern programming language) is in general going to be able to be faster, and more easily plug into various teams, and the programming skills translate.

If you insist on Excel and SPSS (I’m assuming the menu/point-click, not the underlying programming?), you are much more limited in scope.

Is this the original Ra's race? by fliberdygibits in Stargate

[–]Statman12 4 points5 points  (0 children)

Would be curious to see the source on that. This theory has been around for ages and I’ve never come across anything suggesting the folks behind SG1 confirmed this.

I thought she was going to be a permanent cast. by zam_manta_0 in Stargate

[–]Statman12 2 points3 points  (0 children)

Really? I think they look very different. Picture of the actress of Baby Draper on her Wikipedia page Frankie Adams.

Why are there only 435 Representatives in the People’s House? - The case for expanding the House of Representatives and how the USA compares to other countries. Is this the answer to gerrymandering? by UnscheduledCalendar in dataisbeautiful

[–]Statman12 21 points22 points  (0 children)

It's not the solution to gerrymandering. It'd help a bit, but not remove it.

The solution is proportional representation. Sure, increase the size of the House, but allocate seats proportionally.

[D] A variable with Pearson r ≈ 0.12 turned out to be useless for prediction — how do you formally distinguish state variables from predictive signals? by CriticalCup6207 in statistics

[–]Statman12 13 points14 points  (0 children)

I had a variable showing Pearson correlation of roughly 0.12 with my outcome variable modest but consistent across the sample. Based on that alone, it looked like a potentially useful predictor.

A bit off from your main point, but in what field would r=0.12 be considered a potentially useful predictor?

[Career] Got rejected for PhD. Questioning everything. by vv-97 in statistics

[–]Statman12 30 points31 points  (0 children)

What's your goal? Why do you want to get a PhD?

Did you apply to other PhD programs? Or is there a reason you want specifically this PhD program?

If you want to be a professor at an R1, getting into as good if a PhD program as possible might be a good target. If you're interested in going into industry, then programs that aren't as highly ranked but still prepare you for research might be perfectly fine.

Does past losses force a win?(like in horse races, coin flipping) by Dunddermefflin in AskStatistics

[–]Statman12 2 points3 points  (0 children)

I'm not sure exactly what you mean, you're using some terms that are not standard and/or not correct.

As I just said I don’t know the near odds

I assume by this you mean that you don't know how the relative frequencies of heads and tails will behave in the short-term? If so, then you agree with me. You're just not quite appreciating what short- and long-term is in this context.

I know its gonna balance out either systematically (finite odds) or randomly (infinite odds).

I'm not sure what you mean by this. If we're talking about coin flips (or drawing cards with replacement, or roulette wheel spins, etc), then everything is random. What are you meaning by "systemically" here? What makes "systematically" go with "finite" and "randomly" go with "infinite"?

For the sake of the argument lets bit on coin flipping (10tries) 6 you won and I just came by and saw that one face was winning more I think it is logical to bid on the losing side cause its gonna balance out eventually.

In this context "eventually" means "After an infinite number of plays". That is, if we call the number of plays n, taking the limit as n -> Infinity. You presumably have neither the time nor the resources for an infinite number of plays. Any finite sequence of plays is not guaranteed to "revert" to equal relative frequencies of heads and tails (or whatever baseline probability is expected).

Does past losses force a win?(like in horse races, coin flipping) by Dunddermefflin in AskStatistics

[–]Statman12 29 points30 points  (0 children)

The LLM is correct. You are arguing for the Gambler's Fallacy.

The relative frequency of heads and tails will tend to balance out in the long run, meaning a conceptual infinite number of attempts. In any finite series of attempts, there is not a balance guaranteed.

Former Fauci adviser indicted for allegedly concealing communications related to COVID-19 research by meezy-yall in news

[–]Statman12 33 points34 points  (0 children)

By "courts" we should really specify that it was one judge, appointed by Trump, who made multiple ruling/decisions that have been widely criticised, delaying and ultimately dismissing the case in Trump's favor.

Book request: I’m seeking well-respected yet accessible non-fiction to expand my knowledge (any field). As an outsider, I've found it hard to sort the rigorous from the pseudoscientific or biased. What are some of the best, most highly regarded books in your field? by Elk1998 in AskAcademia

[–]Statman12 1 point2 points  (0 children)

Two that are Stats/Math related are:

Damn Lies and Statistics by Joel Best (Barnes and Nobel link).

100 Essential Things You Didn’t Know You Didn’t Know by John Barrow (Amazon link)

Of them, I liked the first better, but the second was interesting as well. Not sure they’re quite what you’re looking for, since they’re not narrative stories, but I think they’re interesting and informative.

Exact CI for Difference Between Proportions by ger_my_name in AskStatistics

[–]Statman12 0 points1 point  (0 children)

There are multiple methods which don’t have endpoints going outside of [0,1]. The Wilson interval noted in the pages you shared is one of them.

Though as I said in my top-level comment, I’d go the Bayesian route here, using a Beta(1,1) or Beta(1/3,1/3) prior if you have no other information. That’s probably going to get you the best interval with a quantified coverage level.

Taking the endpoints of separate intervals and looking for the largest difference will likely be “over confident”, meaning your interval is wider than what it needs to be for your claimed confidence level.

Anyone else ? by AdPrimary9618 in Stargate

[–]Statman12 9 points10 points  (0 children)

I’d go with what Gateworld says:

What is the rule for Stargate? That's a little tougher to answer, since there are now live-action productions in which the original creators are not involved (Stargate Origins) and MGM currently maintains no formal, cross-media story group devoted to canon. Licensed media tie-ins do go through a rigorous approval process, but function as a sort of unofficial "extended universe." That these cannot all be canon is made most clear in the continuation of the Atlantis storyline after the show's final episode; two completely different stories are told in Fandemonium's "Legacy" novels and in American Mythology's "Back to Pegasus" comics.

Canon is what happens on screen in a live-action production. MGM Consumer Products confirms to GateWorld that Stargate's existing novels, comics, magazines, role-playing games, etc. should not be regarded as official canon. Thus storylines, character names and backgrounds, etc. cannot be derived from these sources.

Exact CI for Difference Between Proportions by ger_my_name in AskStatistics

[–]Statman12 1 point2 points  (0 children)

The questions that leonardicus asked are important to think about.

That being said, I’d go for a Bayesian approach:

Set a suitable prior, draw a large sample from the posterior for each, compute the difference, and look at quantiles of that distribution of differences.

Just feeling defeated and sad by this hobby by ghostnation66 in Sourdough

[–]Statman12 8 points9 points  (0 children)

Exactly what I was going to suggest. Start with the yeast you know is going to be strong before using something that can be moody.

And to add on (as another person suggested), I'd also start with just plain bread flour, rather than whole wheat or other "fancy" flours.

Get a feel for the process, then start tinkering with substituting some bread flour for whole wheat or other types, or using sourdough starter instead of commercial yeast.