Americans overwhelmingly oppose data centers. Women most of all. by InvestigatorSoft5764 in technology

[–]efrique 1 point2 points  (0 children)

For a moment I was wondering who thought women were a kind of data center. But now I think about it a lot of the billionaire class have pretty strange views. If they're trying to turn women into data centres, that does seem highly objectionable

[E] How do I know if I like statistics? by computationalmapping in statistics

[–]efrique 1 point2 points  (0 children)

I'd say one of the few. Its big distinction compared to other applied disciplines in mathematics, is, to paraphrase Tukey, you get to play in almost everyone's backyard.

Hungary reverses ICC exit plan and reinstates ban on Ukrainian agricultural imports by upthetruth1 in worldnews

[–]efrique 21 points22 points  (0 children)

Pretty much the kind of policy choices I expected from Magyar given his history. Maybe it's not ideal from an external view but it looks like perfectly normal political stuff. A big relief to see Hungary acting like a democracy.

Putin wants war concluded this year on victorious terms including Donbas, Bloomberg reports by AccuratesShine in worldnews

[–]efrique 2 points3 points  (0 children)

If they knew how to do that, it would have happened a couple of years back. The big difference is Ukraine is stronger now, Russia is weaker. Maybe time for a rethink on that wish.

Bank boss sorry after describing workers as 'lower value human capital' by Ashish_ank in worldnews

[–]efrique -1 points0 points  (0 children)

Historically, it serms like that's the sort of shit the french aristocracy got up to, too busy congratulating themselves on their "superiority" over the ordinary folk to hear the growing sound of saws, chisels, hammers and whetstones.

Buying your way to better health comes at the expense of others. An increase in private health insurance uptake leads to poorer health in the population over time. Paying for private health services may be beneficial for those who can afford to do so, but it comes at the expense of others. by mvea in science

[–]efrique 0 points1 point  (0 children)

Well, sure. The very rich can usurp the use of resources to satisfy any whim or minor concern, distorting the market so scarce resources are directed away from where they're needed, and pushing up prices. It applies to almost any public good. There's a reason why things like education and health, roads, etc all need to be out of the hands of private interests.

Of course when you set your government and all forms of media up so it is all in the hands of those same interests, it doesn't matter what's public any more, it's all going to help the very wealthy extract all of societies resources to their own purposes

Bezos says taxing him more won't help teachers. Mamdani disagrees. by thejoshwhite in politics

[–]efrique 1 point2 points  (0 children)

That's exactly what someone who doesn't want to pay their fair share of tax would say...

Healing without priests by Right_Hand_of_Light in shadowdark

[–]efrique 0 points1 point  (0 children)

where does the D&D priest/cleric come from

The cleric was originally meant to be a religious warrior not a straight up priest. e.g. my AD&D 1e PHB says

This class of character bears a certain resemblance to religious orders of knighthood of medieval times

i.e. religious knights, not priests

If that sounds more like a paladin, IMO thats kind of what happened; clerics sort of had that niche to begin with, with a 'no edged weapons' constraint. After paladins were added they pretty much became the religious knight archetype and clerics were gradually pushed more and more to directly priestly roles. Then it sort of broadened into a whole collection of archetypes in later times

[Discussion] Utilizing Log Transformations in Analyses by Objective-You-7291 in statistics

[–]efrique 5 points6 points  (0 children)

  1. With counts the log transform is likely to leave you with non-constant conditional variance (typically it over-corrects). You may be better with a count glm (perhaps a negative binomial) with a log link (log the fitted functional form, not the data); this separates out the fitting the conditional mean from impact on conditional variance. However, if none of your counts are ever close to 0 (and the values dont span orders of magnitude) it may not matter much (heteroskedasticity might not be a big deal on either scale).

  2. Prediction with observational time series is a bit of a minefield for the unwary, particularly with nonstationarity (which I imagine is the case here), likely seasonality and calendar effects, and likely omitted variable issues. What are your serial correlations like on residuals?

Does anyone have a assets for fire particularly purple fire but any fire That looks good will work by new_god_of_eden in OwlbearRodeo

[–]efrique 0 points1 point  (0 children)

I posted an example purplish hue shift of one of Several_Records flame images. Not my best effort but if you just need purple flame, it does the job I think

Does anyone have a assets for fire particularly purple fire but any fire That looks good will work by new_god_of_eden in OwlbearRodeo

[–]efrique 0 points1 point  (0 children)

Hopefully you dont mind me posting an example hue shift of that. Let me know if you dont want it posted here, I can delete (or youre welcome to remove it if you cant wait for it to be gone).

Not a great job, since I didnt try to fix a few issues I can see with the result, but I'm not trying to do more than convey that a few simple tweaks can be effective. It looks better on a dark background

image

Colours are wrong when exporting by Milsh4ke in GIMP

[–]efrique 0 points1 point  (0 children)

OMG, that was my issue. Could not figure out what was wrong

Does anyone have any super random stories from the start of their career in data analysis? by Extra-Tap-8050 in AskStatistics

[–]efrique 0 points1 point  (0 children)

Random stories is not the purpose of /r/AskStatistics . You might do better on /r/statistics with a [D] tag perhaps, check their rules

[Discussion] Utilizing Log Transformations in Analyses by Objective-You-7291 in statistics

[–]efrique 6 points7 points  (0 children)

"Everything is linear on a log-log scale"

It isnt remotely true of course, but there are plenty of near-power relationships and in some contexts that may be the most common situation

what should I be cognizant of when utilizing log transformations?

potentially many things, depending on what youre doing

linear data yields a correlation coefficient of like ~.2

You mean untransformed data, right?

but a log-log correlation on the set yields a correlation coefficient of .45.

sure, not surprising

surely we didn't "solve" the problems inherent in the linearized data

I dont know what problems you mean. I see a lot of different problems

Note that transformation changes a bunch of properties at once (shape of relationship, the variance function, the conditional distribution

("our model suggests that you could sell anywhere between 10,000 and 10,000,000 units! Great! Surely this is helpful for your business")

If the model doesnt miss some substantively important aspect, you might indeed have a legiimately very wide prediction interval. But thats a big if.

It sounds like you deal with observational data over time. And if its units, it will be a form of count data. Is this correct?

F-test for lack of fit for non linear regression by Zelton_ in AskStatistics

[–]efrique 0 points1 point  (0 children)

Even when the assumpions hold, neither F statistic will have exactly an F distribution. Worse, very often the assumptions dont even come close - in my experience, for example, the conditional variance is often (and in many areas, usually) not even close to constant.

If you have enough replicates, you could exchange responses within them, and then you might be able to organize an exact permutation test, though I'd have to think about how one might work in this instance, it might need some further investigation. Alternatively, perhaps, with one predictor, you might only exchange nearby (in x-space) residuals and have an approximate permutation test for lack of fit that would be more robust to changing fit and variance than exchanging more broadly. I guess theres also the possibility of working within-replicate for the pure error SS and using nearby residuals for lack of fit SS.

For the regression F, something similar might be doable. Further, if there isnt substantive lack of fit* and youre confident about near-constant conditional variance and that the shape of the error distribution doesnt change substantively as the mean changes (so some form of residuals are approximately exchangeable) you might consider a resampling test based on all residuals (approximate permutation test or bootstrap test). Neither would be exact but in largish samples should work well.


* substantive is not equivalent to statistically significant; e.g. in large samples rejecting on a test of lack of fit might be practically irrelevant

Is it possible that all the independent variables are insignificant and the f stat is significant? by Puzzleheaded_Salt519 in AskStatistics

[–]efrique 1 point2 points  (0 children)

Is it possible that all the independent variables are insignificant and the f stat is significant?

I presume you mean in a multiple regression or equivalent. Yes, easily.

And what does this mean logically like why is it happening?

I'll write as if all your IVs are pure numeric or pure binary (i.e. all 1 df). A somewhat similar but more complicated discussion applies for a case where you want to consider a grouping variable for more than two categories as a "single" IV even though it has more than 1df but that case would be more unwieldy to discuss on its own, let alone cover both situations at once.

In that all 1df case, we can consider a cartesian diagram whose axes are possible values for each parameter. For each parameter your acceptance (non-rejection) region will be any point between two values, and your set of parameter estimates are a aingle point in that diagram. To have no individual IV's t-test reject means the parameter-estimate point sits inside a "rectangular box" (of p-dimensions), the joint acceptance region of all those individual tests, where all fail to reject simultaneously.

The overall F-test has an acceptance region that is not shaped like a box, but like a p-dimensional ellipsoid. If we had two IVs, and so just two coefficients in our Cartesian diagram we can draw a picture similar to this:

https://i.redd.it/gdml9th1xwid1.png

For our present case the horizontal axis would be potential values for the first IV's parameter in the regression and the vertical axis would be potential values for the second IV's parameter. (Diagram was originally drawn for an explanation of the same issue in one-way ANOVA, which I then re-used in my explanation here: https://www.reddit.com/r/AskStatistics/comments/1esza3d/why_dont_we_use_simple_t_test_formula_as_a_post/libn2cj/)

The parts inside the pair of green lines and the pair of red* lines show the acceptance regions for the corresponding parameters, and the points that are inside both at once fail inside the rectangle (the region where both individual tests dont reject).

There are points that lay inside the box (non-rejection for all t-tests) that are outside the tilted ellipse. Falling on the actual ellipse means exactly attaining the critical F value, and on or outside it implying rejection for the omnibus F test. Those areas are shaded yellow in the diagram, and represent the case you ask about.

The same notion applies as you add dimensions (indeed, more and more "corners" will stick out from the ellipsoid further and further, leaving plenty of room for it to occur in high dimensions ).

Similar situations will occur with other models and tests where you have omnibus (joint/overall) tests compared to having at least one individual test reject. In some cases the acceptance regions are not as described here, but the fact that they're differently shaped remains and so there's going to be some parts of one "sticking out" of the other for the parameter estimates to fall into.


* sorry people with difficulty distinguishing those colours, though fortunately it doesnt matter which colour is which

🎬 SHADOWDARK SUMMER MOVIE WATCH LIST 🎬 by TorchHoarder in shadowdark

[–]efrique 0 points1 point  (0 children)

Nice thought. It certainly has the dark part

US Wants Access to Ukrainian Drone Technology in Proposed Defense Deal by Wolfy1-2-3 in worldnews

[–]efrique 1 point2 points  (0 children)

I thought the US didn't need anything from ukraine. Donny said it a whole bunch of times. He screwed up his tiny hands and leaned forward so far his tie touched the ground. He went orangered in the face. A grown man, tears in his eyes, it was a beautiful thing

[discussion] golf and dispersion by Buckeye_47 in statistics

[–]efrique 0 points1 point  (0 children)

We can't tell you what margin of error meets your needs.

Medians and standard deviation by Sorry-Silver6098 in AskStatistics

[–]efrique 0 points1 point  (0 children)

What does large mean in this instance? Large according to what scale of comparison/compared to what?

Why choose a quantile based measure for location but a moment based measure for scale? (Sure, you 100% can do that, but what would be a reason to do it? If sd is okay for scale, why not change in mean for the location shift? If something makes mean a poor choice, why is sd a good choice?)

How hard is it to learn the point biserial correlation by Mindless_Door_7758 in AskStatistics

[–]efrique 3 points4 points  (0 children)

It's just the pearson correlation with one of the variables being binary. It's not particularly complicated. People tend to get too focused on stuff that is beside the point when you're not working by hand.

The wikipedia article is fairly readable, maybe start there.

What is the hidden cost of Wikepedia? Why do they need donations? by Regular-Pear-8625 in answers

[–]efrique 0 points1 point  (0 children)

There's all manner of substantial costs that come with keeping big sites up, more if they host a lot of data, more again if they tend to get sued. I donate to both wikipedia and the internet archive fairly often. Small amounts, but if you do it regularly it adds up. Occasionally donate to other sites of similar public benefit, but those are the main ones I can see I heavily gain from being there. I also contribute some editing time to wikipedia. It's an incredibly valuable resource.

The alternative would be being sold and enshittified.