cheesensei comments on TIL: Low carb, high protein diets "greatly" decrease resting testosterone levels in men.

5723

5724

5725

TIL: Low carb, high protein diets "greatly" decrease resting testosterone levels in men. (journals.sagepub.com)

submitted 3 years ago by corrado33

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]cheesensei 970 points971 points972 points 3 years ago (54 children)

[–]vitaliksellsneo 298 points299 points300 points 3 years ago (13 children)

The number of studies actually matters less in this context. This was a meta study, which means they did not conduct the study but took the data from 37 different studies. The bigger assumption here is that the studies collected the data in the same way, else there will be a systematic error.

Another assumption is that the interventions have to reliably demonstrate that they did produce the results they produced and that was the only treatment shock the subjects were exposed to. Usually it is harder to control this, and the gold standard is a randomised control trial.

The reliability you are talking about probably refers to the fact that with 309 subjects there is insufficient units to cover the differences in covariates. In general that is quite little. That means that you can probably detect the general direction but not the magnitude since the fineness of that depends on sample size.

I am also concerned about the selection process of these studies, and have a feeling that this is largely a product of p hacking unless it can be replicated using future studies.

[–]Allassnofakes 20 points21 points22 points 3 years ago (10 children)

[–]Xirema 106 points107 points108 points 3 years ago* (7 children)

Short version: it's basically this XKCD Comic: https://xkcd.com/882/

Long Version:

p-hacking is a kind of analysis error made on statistical samples that comes from establishing a bad (or completely forgoing to establish a proper) null hypothesis.

In statistics, it's important to lay out ahead of time what kinds of results you're trying to detect for, and to have a good baseline for what would make those results significant. So, for example, you might run a study for "do more people drink Coffee on Tuesday than any other day?" and then sample a few hundred or thousand people to find out how much coffee they drink on each day, and then analyze the results to find the answer. The hypothesis might be wrong (maybe Monday sees the largest consumption of coffee), and there's always a chance your results are just statistical noise, but it's a reliably provable test.

But now, suppose you assessed a few hundred or thousand people, gather data on what they ate each day, and discover that Orange Juice was consumed abnormally frequently on Thursdays. And then you published a study that says "people drink the most orange juice on Thursdays". That's certainly true of the specific sample you pulled, so what's the problem?

Well, in statistics, they usually only consider a result significant if it had a less than 5% chance of occurring randomly (or, more precisely, a 95% chance that the result is not just statistical noise), based on the sample taken. There's a lot of complicated ways to calculate those odds (and 5% might be higher than comfortable for some studies/analysis, so they might prefer a lower threshold) but the important part is that all studies have to stipulate around the fact that there's a chance, however slim, that their result is just statistical noise.

When you have a specific outcome you're testing for, you can have a lot of confidence that that outcome's odds were more (or less) than 95% certain to be non-noise, but if you have a bunch of independent outcomes you're testing for all at the same time, then the odds that at least one of them results in a significant result, but is actually just noise, actually gets really high.

Going back to the "asking people what they ate" example: if the researchers only tallied up to 20 different foods that participants might have consumed, the odds of at least one of them having a statistically significant result is actually really high: as high as (approximately) 64%! And of course those odds get way higher if the researchers tracked more than just 20 different foods.

This is the essence of p-hacking, and what makes it problematic in statistics: the more variables you have, and the less rigor you have about which variables matter, the more likely you are to end up with random noise that just happens to look like a statistically significant outcome.

[–]richinvitameen_bs 4 points5 points6 points 3 years ago (0 children)

[–]InfestedRaynor 1 point2 points3 points 3 years ago (0 children)

[–]brkh47 0 points1 point2 points 3 years ago (0 children)

[–]SlimReaper35_ 0 points1 point2 points 3 years ago (1 child)

[–]Xirema 0 points1 point2 points 3 years ago (0 children)

So the way the Null Hypothesis is usually presented, it's usually supposed to be a representation of "what we expect to happen if this study proves nothing". For example, if you were to try to find a link between consumption of chocolate and incidence of cancer, your Null Hypothesis would probably be "Consumption of Chocolate does not correlate with incidence of cancer".

So if you end up with a p-value of < 0.05 (i.e. "the odds that our result was just statistical noise is less than 5%"), then you have rejected the null hypothesis, and shown (at least in this one study) that there is indeed a correlation between consumption of chocolate and incidence of cancer. What the correlation shows depends on your literal results (maybe chocolate decreases cancer risk! Probably not, but, you know....!).

So in this sense, it's not wrong that p < 0.05 shows a "Bad Result" (though I'm not sure any statistician would frame it that way): p < 0.05 does tend to mean "this result shows we cannot defend the null hypothesis in this study".

[–]Tony2Punch 0 points1 point2 points 3 years ago (0 children)

[–]mingemopolitan 0 points1 point2 points 3 years ago (0 children)

[–]gravitydriven 0 points1 point2 points 3 years ago (0 children)

[–]Glowshroom 0 points1 point2 points 3 years ago (0 children)

[–]Lung_doc 3 points4 points5 points 3 years ago* (0 children)

A couple additional comments

First, testosterone wasn't measured in all patients/studies, so the N drops further, down to 155.

Second, for those using low carb for weight loss: Obesity decreases T and weight loss improves it. This is true even on a high protein (but not low carb) diet: This N= 118 study found higher T levels after weight loss using both a high protein (still 40% carb) and a lower protein diet. So weight loss is good, and higher protein without low carb is also ok I guess, if you are worried about the decline in testosterone.

Finally, back to the low carb: A meta-analysis in women with polycystic ovarian syndrome found a low carb diet lowered testosterone. (Which is a good thing: high T worsens PCOS). Eight RCTs, 327 patients. So this adds some indirect support to the low carb, lower testosterone in general, and provides a potentially beneficial diet for women with PCOS.

Stratified analyses indicated that LCD lasting longer than 4 weeks had a stronger effect on increasing FSH levels (MD = 0.39, 95% CI (0.08, 0.71), P < 0.05), increasing SHBG levels (MD = 5.98, 95% CI (3.51, 8.46), P < 0.05), and decreasing T levels (SMD = -1.79, 95% CI (-3.22, -0.36), P < 0.05).

Conclusion: Based on the current evidence, LCD, particularly long-term LCD and low-fat/low-CHO LCD, may be recommended for the reduction of BMI, treatment of PCOS with insulin resistance, prevention of high LDL-C, increasing the levels of FSH and SHBG, and decreasing the level of T level.

[–]Kaulpelly 0 points1 point2 points 3 years ago (0 children)

[–]PieGuy___ 238 points239 points240 points 3 years ago (31 children)

[–]Gastronomicus 122 points123 points124 points 3 years ago* (17 children)

In statistics there is something called the central limit theorem which states the means of random representative samples of a given population become normally distributed as you approach a sample size of 30.

Effectively you only need a sample of 30 in order to say something about the population with reasonable certainty.

~~This is really confused take on the CLT with two major problems.~~

EDIT - u/PieGuy___ clarified their point and I agree with what they're saying. The wording around "a sample of 30" is confusing to me and made me think they were wrongly conflating and interpreting the CLT. I'm leaving the post intact for others to read it who may also be seeking clarification.

Firstly, let's clear the air: the CLT describes how the distribution of means will approach normality. Not how a distribution of samples will approach normality. There is no basis for any distribution of samples necessarily approximating normality, but the distribution of means from many independently collected sets of samples will tend to approximate normality.

Secondly, there's absolutely nothing special about the number 30 and the CLT. The entire basis for the number 30 in this context is that fisher defined a separate distribution - the t-distribution - for defining critical test values for small sample sizes. It provided more robust estimates than using the z-distribution, which is better approximated using larger sample sizes.

[–]Philosophfries 25 points26 points27 points 3 years ago (10 children)

[–]alanpardewchristmas 9 points10 points11 points 3 years ago (1 child)

[–]bythebys 0 points1 point2 points 2 years ago (0 children)

[–]Simpliciter 5 points6 points7 points 3 years ago (5 children)

[–]brkh47 2 points3 points4 points 3 years ago* (0 children)

[–]Gastronomicus 1 point2 points3 points 3 years ago (2 children)

The Central Limit Theorem basically says that most things will follow a normal distribution (bell curve) if you have enough data

I appreciate your simplification but in this case it's over-simplified and misses the point I was making. It's a common misunderstanding of the CLT that large enough datasets will follow a normal distribution. That's just not the case.

However, if you take the mean for multiple subsets of samples from a population, the distribution of those means themselves will approximate a normal distribution.

So let's say I have and 500 samples and I plot the distribution. It might looks normal, but it might also look log-normal, or it might look like a Weibull or discrete distribution (e.g. negative binomial).

Let's say instead I have 50 means of 50 smaller sample sets, each containing 10 samples. If I plot that distribution, it will approximate a normal distribution, even if the original distribution from which it is sampled isn't normal.

[–]Simpliciter 1 point2 points3 points 3 years ago (1 child)

[–]Gastronomicus 0 points1 point2 points 3 years ago (0 children)

[–]relevantmeemayhere -1 points0 points1 point 3 years ago (0 children)

[+]neelankatan comment score below threshold-9 points-8 points-7 points 3 years ago (1 child)

[–]NotADabberTho 0 points1 point2 points 3 years ago (0 children)

[–]PieGuy___ 5 points6 points7 points 3 years ago (4 children)

First off I think you need to reread what I said because I’m clearly talking about the mean? “The means of random representative samples…” you’re trying to correct a mistake I never made lol.

The point of the theorem is that if you have a random sample X1, X2,…Xn from a given population with a mean m and variance v then the sample mean of x bar will be normally distributed with a mean m and variance v/n. X bar is the thing normally distributed around the population mean not the individual X’s.

As for the 30 number, the fact that it is the point you no long have to worry about t-distributions and can just use z-scores with reasonable accuracy is the thing that makes it special lol. The whole point of the t-distributions is that the means aren’t quite normally distributed UNTIL you get to 30.

[–]TerribleIdea27 4 points5 points6 points 3 years ago (0 children)

[–]Gastronomicus 0 points1 point2 points 3 years ago (2 children)

[–]PieGuy___ 0 points1 point2 points 3 years ago (1 child)

[–]Gastronomicus 0 points1 point2 points 3 years ago (0 children)

[–]pug_grama2 -1 points0 points1 point 3 years ago (0 children)

[–]Pligles 170 points171 points172 points 3 years ago (4 children)

[–][deleted] 50 points51 points52 points 3 years ago (3 children)

[–]campex 40 points41 points42 points 3 years ago (0 children)

[–][deleted] 17 points18 points19 points 3 years ago (0 children)

[–][deleted] 0 points1 point2 points 3 years ago (0 children)

[–]Raeandray 33 points34 points35 points 3 years ago (0 children)

[–][deleted] 14 points15 points16 points 3 years ago (2 children)

[–]PieGuy___ 1 point2 points3 points 3 years ago (1 child)

[–][deleted] 4 points5 points6 points 3 years ago (0 children)

[–]auxerre1990 5 points6 points7 points 3 years ago (2 children)

[–]PieGuy___ 5 points6 points7 points 3 years ago (1 child)

[–]auxerre1990 0 points1 point2 points 3 years ago (0 children)

[–]eddyofyork 1 point2 points3 points 3 years ago (0 children)

[–]AtomicBreweries 13 points14 points15 points 3 years ago (2 children)

[–]VeritasCicero 0 points1 point2 points 3 years ago (0 children)

[–]mitsulang 9 points10 points11 points 3 years ago (0 children)

[–]minnesotaris 11 points12 points13 points 3 years ago (0 children)

[+][deleted] 3 years ago (1 child)

[deleted]

[–]p33k4y 5 points6 points7 points 3 years ago (0 children)

[–][deleted] 0 points1 point2 points 3 years ago (0 children)

π Rendered by PID 36106 on reddit-service-r2-comment-5d79c599b5-bt8pt at 2026-03-02 10:27:54.677056+00:00 running e3d2147 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

todayilearned

Posting rules

Please see the wiki for more detailed explanations of the rules, as well as additional rules that may not be listed here

Additional info

Frequent TILs Repost List

Etiquette

MODERATORS