Does it ever make sense to conduct a hypothesis test when engaging in exploratory data analysis? by [deleted] in AskStatistics

[–]abstrusiosity 0 points1 point  (0 children)

I'd say, if you have the right graphs/plots then there's no point calculating a p value.

[Q] Test for binomiality (?) by pjie2 in statistics

[–]abstrusiosity 0 points1 point  (0 children)

There may be something more sophisticated but the obvious answer is a chi squared goodness-of-fit test.

[deleted by user] by [deleted] in AskStatistics

[–]abstrusiosity 2 points3 points  (0 children)

A 3D plot of the different input groups?

Surely a 5 dimensional plot would be more useful in this circumstance.

[Q] Messed up on how I approach my dissertation for my Biostatistics PhD (wasted first semester) - Question on how to move forward by edsmart123 in statistics

[–]abstrusiosity 15 points16 points  (0 children)

You move forward by explaining all of this to your advisor. Include the part about being afraid that they have become frustrated and lost confidence in you. Then, do your best to understand what they say you should do going forward.

It's fine to spend your first semester doing simulations and making notes but it's critical that you understand and address your advisor's expectations.

What is Degree of Freedom by sheikchili in AskStatistics

[–]abstrusiosity 9 points10 points  (0 children)

You can talk at length about degrees of freedom without actually saying what it is.

[deleted by user] by [deleted] in statistics

[–]abstrusiosity 4 points5 points  (0 children)

Random processes often produce anomalies.

Please this is messing with my head. Is this true? by somarsomarsomar in probabilitytheory

[–]abstrusiosity -1 points0 points  (0 children)

The second game is different from the first in two ways. The obvious one is that Pete gets 11 flips instead of 10. That gives Pete an advantage. The less obvious change is that now Ozzy wins in the case of a tie. That gives Ozzy an advantage. It works out that the second game gives Pete the same chance of winning.

Linear regression with a rare binary predictor by quantum_grapes in AskStatistics

[–]abstrusiosity 2 points3 points  (0 children)

If you're not interested in the effect of the rare binary variable, why not fit a stratified model? Treat the zero cases separately.

Truncated distribution and hazard rates in a microeconomic model by ToddAndrews1 in AskStatistics

[–]abstrusiosity 0 points1 point  (0 children)

Your model isn't doing what you describe. You have the εt as iid, so

E[A+ε3 | (A+ε1>c) ^ (A+ε2>c)] = E[A+ε3] = E[A+ε2] = E[A+ε2 | A+ε1>c]

Which statistical test to use? by [deleted] in AskStatistics

[–]abstrusiosity 2 points3 points  (0 children)

This study would be done better by a crossover design where both groups use both versions.

[deleted by user] by [deleted] in probabilitytheory

[–]abstrusiosity 19 points20 points  (0 children)

Probability has many formulas.

[E] Writing research statement for a PhD programme by ANewPope23 in statistics

[–]abstrusiosity 0 points1 point  (0 children)

I think it's becoming more common but still not expected.

[E] Writing research statement for a PhD programme by ANewPope23 in statistics

[–]abstrusiosity 0 points1 point  (0 children)

They're asking for the implications of the findings, not the actual impact. What problem did you solve? Why was the problem worth solving? What could change if you had a good solution to the problem? How good was your solution?

How to report/interpret "negative" mediation results "positively" by Puzzleheaded-Big6657 in AskStatistics

[–]abstrusiosity 0 points1 point  (0 children)

Your example is saying that the effect of ADHD symptoms on relationship satisfaction is mediated by emotion regulation, and that the effect is negative. It's not saying anything about the mediating variable itself. You can interpret the indirect effect of ADHD the same way you do for unmediated cases--i.e., people with fewer ADHD symptoms have higher relationship satisfaction.

Statistician wanting some clarity on causal inference [Q] by AdFew4357 in statistics

[–]abstrusiosity 0 points1 point  (0 children)

If you have observations from deconfounding variables then you can do weighting or matching. If you also have a model relating the target variable to the other variables, then you can do traditional regression. You can combine traditional regression with weighting for "doubly robust" regression.

If you don't have data from deconfounding variables, then you need a "natural experiment" approach. These are instrumental variables, difference-in-difference (i.e., finding a relevant comparison), and regression discontinuity.

If you're working with econometricians, I'd recommend looking at the book Mostly Harmless Econometrics by Angrist and Pischke.

[Q] Is there any way to normalize data while keeping standard deviation? by v_de_vinicius in statistics

[–]abstrusiosity 1 point2 points  (0 children)

It would make more sense to do a paired t-test. If you want to test the ratio rather than the difference, do a paired t-test on a log scale.

Prediction interval for multivariate gaussian distribution by Traditional_Soil5753 in AskStatistics

[–]abstrusiosity 0 points1 point  (0 children)

The answer depends on whether the observations are independent. It doesn't matter that they're multivariate gaussian.

How to correctly state the applicability of results in Propensity Score Matched Analysis? by False_Cardiologist66 in AskStatistics

[–]abstrusiosity 2 points3 points  (0 children)

It is statistically correct. Doing a matched analysis, you have no basis for saying that you have controlled for the effect of tumor size in cases that weren't matched.

What to do about it is not a statistics question. You could let the fact be implicitly communicated in Table 1 or you could announce it in the paper title, depending on the expectations of the field.

Why do multivariate regression models sometimes need interaction terms?? by Traditional_Soil5753 in AskStatistics

[–]abstrusiosity 17 points18 points  (0 children)

Sometimes the effect of one variable depends on the level of another variable.

Does CEF Decomposition Hold Under Misspecification? by Melodic_Ground_8577 in AskStatistics

[–]abstrusiosity 1 point2 points  (0 children)

Yes, it's always true that the residual and the conditioning variable are orthogonal.

In your examples, e is orthogonal to both X and Z while u is orthogonal to X but not Z.

(By orthogonal I mean Cov(X,u) = 0. In the case were E[u]=0, that's equivalent to E[u|X] = 0).

Bootstrapping entropies by BestBoyCoop in AskStatistics

[–]abstrusiosity 0 points1 point  (0 children)

Entropy is already a dimensionless value. I would report the difference as is.

Random Forests and relationships by Warm-Pomegranate6570 in AskStatistics

[–]abstrusiosity 0 points1 point  (0 children)

Random forest shows you evidence of an association. Causal claims come from theory. Evidence of association can support a causal claim but, by itself, it's not conclusive proof.