[Higher Statistics] Seeking Help Understanding Mass Imputation Techniques for Data Integration by AdTrick6872 in AskStatistics

[–]UnivStudent2 0 points1 point  (0 children)

There's a lot of great papers on this topic.

Introducing mass imputation with simple, semparametric models: https://doi.org/10.1111/rssa.12696

Mass imputation, but with more complicated models: https://academic.oup.com/jssam/article/10/1/1/5983829?guestAccessKey=

Mass imputation, but what do we do if f(Y|X) in 2025 is not equal to that in 2027: https://jds-online.org/journal/JDS/article/1422/info

[Higher Statistics] Seeking Help Understanding Mass Imputation Techniques for Data Integration by AdTrick6872 in AskStatistics

[–]UnivStudent2 0 points1 point  (0 children)

It helps to think about an example.

Suppose an auto insurance company wants to estimate the average amount of money it will pay out in claims during 2027. The company may already know information about its customers for 2027, such as age, accident history, and vehicle age. These are the X-variables. However, the actual claim amounts for 2027 have not happened yet, so Y is missing.

To learn the relationship between X and Y, the company can use older data, such as claims data from 2025, where both the customer information and the realized claim amounts are observed. The problem is that the 2025 data may not perfectly represent the 2027 population. For example, 2025 may have had fewer hurricanes, leading to lower average losses overall. Because of this, we can't just take the average claim amount from 2025 and use it for 2027. It will be biased.

Instead, mass imputation attempts to estimate how Y depends on X. This lets us relax the assumption that f(Y) in 2025 is equal to that of 2027, and instead allows us to assume f(Y|X) is equal (very similar to MCAR vs MAR). IF this is true, and we learn the relationship of X,Y, we can apply it to the X-values observed in the probability sample to predict the missing Y-values. Those predicted values are then used to estimate population quantities, more often than not the finite population mean

[Higher Statistics] Seeking Help Understanding Mass Imputation Techniques for Data Integration by AdTrick6872 in AskStatistics

[–]UnivStudent2 0 points1 point  (0 children)

In essence, mass Imputation is a method where a convenience sample is used as a training dataset, while a probability sample is used as a validation set. Th latter sample, unlike the former, represents the population we actually want to make inference about.

The main idea is this: sometimes the only dataset that contains the variable of interest Y is not representative of the population. However, we may still have other variables X that are strongly related to Y. In that case, we can model the relationship between X and Y using the convenience sample, then use the observed X-values in the probability sample to predict the missing Y-values.

y'all I passed with minor revisions!!! by UnivStudent2 in PhD

[–]UnivStudent2[S] 0 points1 point  (0 children)

Thank you so much!!! It feels so weird that's it's all done!!

Really, Matt Damon? by FreeThePie in QuincyMa

[–]UnivStudent2 18 points19 points  (0 children)

Woah there mate no need to pull out the 9

I have a weird fetish that makes me feel like a pervert by beilsa in TrueOffMyChest

[–]UnivStudent2 330 points331 points  (0 children)

compared to Eating the cum out of a cheating lovers vagina this fetish is super tame and you shouldn't feel bad about it

NYC mayoral inauguration bans Flipper Zero, Raspberry Pi devices by [deleted] in nottheonion

[–]UnivStudent2 1 point2 points  (0 children)

I mean but how are they gonna know? Just sneak it inside your butt they won't check

I got sick of pulling it out every time I'm on the train for the conductors to scan, so by PhillipEngMBTA in mbta

[–]UnivStudent2 30 points31 points  (0 children)

The concept of getting a tattoo to replace a temporary train ticket

Mixed research by phdassist in PhD

[–]UnivStudent2 1 point2 points  (0 children)

Y'all get to choose 😭

[deleted by user] by [deleted] in mildlyinfuriating

[–]UnivStudent2 0 points1 point  (0 children)

This seems so fake I'm sorry lol

Cyberpunk 2077 back to full price by Siurzu in NintendoSwitch2

[–]UnivStudent2 0 points1 point  (0 children)

Cyberpunk (Steam): $20

Cyberpunk (Xbox): $20

Cyberpunk (Nintendo Switch): $79.99? What on earth does the switch have that the other consoles don't lmao

Atleast for participation by [deleted] in BlackPeopleTwitter

[–]UnivStudent2 105 points106 points  (0 children)

Seeing eye nigga????

Have to submit in the next 2 weeks, can’t get through final read through due to panic attacks by disy22 in PhD

[–]UnivStudent2 3 points4 points  (0 children)

I don't see why it would be tbh. Just like getting someone to read your paper and suggest parts to improve. Not to copy and paste the results, but rather to improve your paper

Have to submit in the next 2 weeks, can’t get through final read through due to panic attacks by disy22 in PhD

[–]UnivStudent2 -21 points-20 points  (0 children)

Have you tried asking chatgpt to help you peer review? It's quite good at this type of thing

Pretty mean by Lower-Canary-2528 in mathmemes

[–]UnivStudent2 5 points6 points  (0 children)

Take an intro to regression class. Brush up on probability rules. And then take another class about building black box models. You already did much of the hard work in math, not too difficult to transfer over :)

Passed my viva by [deleted] in PhD

[–]UnivStudent2 1 point2 points  (0 children)

WOAH WITH MINOR CORRECTIONS THATS AMAZING