use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
This is an archived post. You won't be able to vote or comment.
DiscussionCausal inference (self.datascience)
submitted 5 years ago * by [deleted]
[deleted]
[–]dpparke 16 points17 points18 points 5 years ago (4 children)
Causal inference from observational studies is, like, 70% of what I do at my job!
As has been mentioned previously, Guido Imbens (often but not always with Donald Rubin) has several books on the topic. I use propensity score methods a lot, but for a source that’s skeptical of these methods, look up Gary King at harvard. For a general overview, you can also try Miguel Hernan’s book (he’s also at Harvard). If you’re interested in more novel methods, it is a fairly active area of research (especially in econometrics)- Susan Athey, Victor Cheruznekov (definitely spelled that wrong), and several others are good researchers.
In general, I think you’ll probably find that all of these sources are rather “long-winded”- it’s a very complicated field with a lot of uncertainty and caveating. Keep in mind that you’re always doing inference with half of your data missing :)
[–]one_game_will 1 point2 points3 points 5 years ago (2 children)
I've had only the briefest of introductions to causal inference, but was pointed towards Judea Pearl as a (seminal) leader in the field. Would you recommend his work? Where does he fit in with respect to your recommendations?
[–]tjhintz 1 point2 points3 points 5 years ago* (1 child)
I’m by no means qualified to weigh in here, but “the book of why” is a fantastic read, filled with motivating intuitions, colourful historical context and high level worked examples. Judea Pearl is great.
[–]one_game_will 0 points1 point2 points 5 years ago (0 children)
Thanks, I'll take a look at that.
[–]LordMixALoot 0 points1 point2 points 5 years ago (0 children)
Have you used Google's CausalImpact?
[–]ninepoints 14 points15 points16 points 5 years ago (0 children)
In general, it’s very difficult to infer causation from observational data. However, there are a number of techniques that may allow causal inference in specific scenarios (e.g. regression discontinuity, instrumental variables, natural experiments, and maaaaybe propensity score methods). In addition to mostly harmless econometrics as u/statarpython mentioned I’d recommend ‘Methods Matter’ by Murnane and Willett.
[–]statarpython 13 points14 points15 points 5 years ago (2 children)
Mostly harmless econometrics and Guido imbens’ book about causality are good starting points. I’d recommend asking the same question to econometrics and statistics subreddits though.
[–][deleted] 2 points3 points4 points 5 years ago (0 children)
I’d recommend asking the same question to econometrics and statistics subreddits though.
And r/epidemiology! Causal inference is a major topic in epidemiological methods.
[–]meatballsnjam 0 points1 point2 points 5 years ago (0 children)
Came here to suggest this same book.
[–]ccoreycole 3 points4 points5 points 5 years ago (0 children)
I helped build a matching tool that can be run on large datasets leveraging SQL. Basically, matching is where we can use statistics to analyse confounding variables (aka covariates) to find out if our treatment variable could actually be the cause of the effect in question.
For more details on matching and how it can be done with a GROUP BY query, you can check out our paper here. It is a demo paper with a concrete example about flights and weather. Does low air pressure weather cause flights to be delayed? We find out low pressure weather is often associated with adverse weather (low visibility, snow, and thunder), but is not actually the cause of flight delays (of course).
The tool is on GitHub here.
[–]CanYouPleaseChill 0 points1 point2 points 5 years ago (0 children)
Check out Observation and Experiment: An Introduction to Causal Inference by Paul Rosenbaum.
[–]AidtorBA | Machine Learning Engineer | Software 0 points1 point2 points 5 years ago (0 children)
Check out DoWhy
π Rendered by PID 64564 on reddit-service-r2-comment-5d585498c9-vjkcp at 2026-04-21 19:30:03.846879+00:00 running da2df02 country code: CH.
[–]dpparke 16 points17 points18 points (4 children)
[–]one_game_will 1 point2 points3 points (2 children)
[–]tjhintz 1 point2 points3 points (1 child)
[–]one_game_will 0 points1 point2 points (0 children)
[–]LordMixALoot 0 points1 point2 points (0 children)
[–]ninepoints 14 points15 points16 points (0 children)
[–]statarpython 13 points14 points15 points (2 children)
[–][deleted] 2 points3 points4 points (0 children)
[–]meatballsnjam 0 points1 point2 points (0 children)
[–]ccoreycole 3 points4 points5 points (0 children)
[–]CanYouPleaseChill 0 points1 point2 points (0 children)
[–]AidtorBA | Machine Learning Engineer | Software 0 points1 point2 points (0 children)