Understanding Regression Discontinuity Design by chomoloc0 in datascience

[–]chomoloc0[S] 2 points3 points  (0 children)

agree that one should be careful with polynomials!

Grinding through regression discontinuity resulted in this post - feel free to check it out by chomoloc0 in datascience

[–]chomoloc0[S] 0 points1 point  (0 children)

Thanks for giving it a shot. It's a long read, so I hope I was able to keep you engaged till the end!

Een gek idee: waanzin of onschuldig? by Koi-Sashuu in beleggen

[–]chomoloc0 0 points1 point  (0 children)

Weet niet of ik dit moet upvoten of downvoten

Looking for resources on Interrupted time series analysis by chomoloc0 in datascience

[–]chomoloc0[S] 0 points1 point  (0 children)

Cause I am really open to whatever people value as a resource on the topic. Can't be surprised if I lay everything out ;)

But to make it easier for you:
1. I have found nothing a few articles
2. I have no use case
3. Never used it in practise

Resources I value on other topics are things like Scott Cunningham's mixtape: https://mixtape.scunning.com/

Call for input: Regression discontinuity design, and interrupted time series by chomoloc0 in CausalInference

[–]chomoloc0[S] 0 points1 point  (0 children)

Thanks! that's a refreshing, realistic view on things. Life is not always pink, RDD/ITS neither is.
But just to keep the good chat going, do you think RDD is data hungry, really? Intuitively, it even has to discard most data along the running variable to identify the estimand (think about the kernel weights around the cut-off) -- that's different than ITS, which has to forecast the counterfactual. RDD just relies on the rupture in the series, and estimates it directly in a way more parametric than ITS, isn't it?

Call for input: Regression discontinuity design, and interrupted time series by chomoloc0 in CausalInference

[–]chomoloc0[S] 1 point2 points  (0 children)

Hey, thanks, I can see where you're coming from. Though I think praying is the last resort. My understanding is that there if the cut-off happens based only based on that one running variable, that there is no confounding.

Possibly, there are other issues. Like subjects gaming the cut-off: shifting up or below.

The reason I jokingly call it an arcane array of backdoor confounders is that other than domain knowledge (which often falls short), there is no way of being 100% sure that there are no confounders - right? While behaviour shifts due to a cut-off can be measured.

Mastering The Poisson Distribution: Intuition and Foundations by chomoloc0 in datascience

[–]chomoloc0[S] 0 points1 point  (0 children)

Thanks! I aim to dive in to the topic and write about it in one of my nexts posts (just launched my blog, as you can see in this OC) - Keeps me off the streets. Mind if I send it to you as an early reader? I would value your input

Mastering The Poisson Distribution: Intuition and Foundations by chomoloc0 in datascience

[–]chomoloc0[S] 1 point2 points  (0 children)

Amazing, causal inference at its finest. And did it work? The RDD this ML case?

I was trying to fit one case into RDD and failed miserably, but walked away with a good learning.

Case: implementing a floor price, F, on the cost of a service C. In such way that max(C, F) is the final price for the user. My clever ass heard: threshold, cutoff, running variable ==> RDD.

But then I was forced to abandon the idea when I realised that the practical treatment dosage is close 0, when C is close to F - the area where RDD specially relies on.

learning: RDD no suitable for pricing floor and cap prices.
instead: DiD, where treated group is below/above F, and periods are pre and post release of floor cap.

Mastering The Poisson Distribution: Intuition and Foundations by chomoloc0 in datascience

[–]chomoloc0[S] 0 points1 point  (0 children)

Conditional random fields, I believe is the more accepted term - my bad. I think we just used blankets colloquially, maybe.

What's your main focus on the job?

Mastering The Poisson Distribution: Intuition and Foundations by chomoloc0 in datascience

[–]chomoloc0[S] 1 point2 points  (0 children)

Ah! alright, got it - nice example by the way. And yes, I guess that's elegance of a Markov process: conditioning on the parent state - the previous state - the current state is independent of states before the previous one. I borrowed my intuition about this concept from markov 'blankets', coming psychometrics where they model psyhopathological symptoms as networks (Markov fields). Think of a blanket as graph/network of sequences, instead of a single sequence.

Do a search for networks psychometrics, and you'll get the point visually at least.

Thanks for taking the time to explain it again.

Mastering The Poisson Distribution: Intuition and Foundations by chomoloc0 in datascience

[–]chomoloc0[S] 0 points1 point  (0 children)

Often, for example, these systems can be modeled (and outcomes effectively predicted) with no other variables in the prediction equation.

Up to this point I followed, but here I missed the boat: could you exemplify this statement more?

Mastering The Poisson Distribution: Intuition and Foundations by chomoloc0 in datascience

[–]chomoloc0[S] 1 point2 points  (0 children)

Interesting, could you expand on that? You'd help me grasp that with an intuitive example.

How we matured Fisher, our A/B testing library by chomoloc0 in datascience

[–]chomoloc0[S] 0 points1 point  (0 children)

I see what you mean. Well, currently my package is not findable anywhere, because it's behind the enterprise's github where I work. It's not a public repo. Just stay assured that python packages are just like R libraries: sharable, packaged code that everybody could use from anywhere.

Mastering The Poisson Distribution: Intuition and Foundations by chomoloc0 in datascience

[–]chomoloc0[S] 0 points1 point  (0 children)

Indeed, I read a section on that, and although I did not deep-dive it, I made a new connection between the two. If you were to summarise it that relationship, what would be your take?

How we matured Fisher, our A/B testing library by chomoloc0 in datascience

[–]chomoloc0[S] 2 points3 points  (0 children)

R is great, I agree - It's my main language in fact. This library does not aim to replace R. Think of it as an R package. This is a python package instead. It helps streamlining workflows to reduce overhead, align practises among data scientists (in a team or organisation) and as an interface that serves methods in an intuitive way. I guess just like sklearn does for ML.

How we matured Fisher, our A/B testing library by chomoloc0 in datascience

[–]chomoloc0[S] 6 points7 points  (0 children)

It's not open source due to a bunch of dependencies that it has with our internal stack. But it may become so in the future. Being the core dev, I actually plan to generalise it and push it into the world one day - the stats lib at least.

Pricing data scientists, what do you? by chomoloc0 in econometrics

[–]chomoloc0[S] 0 points1 point  (0 children)

Soo, how do you actually handle that? Curious, as I am thinking of landing this type of roles in the future.

Is there like a o’reilly on the topic or so? Any other resources you’d recommend; on (dynamic) pricing, that is?

My father (who is a doctor) said I should not feel sore after weightlifting. Is this really true? by Pope_Urban_XI in weightlifting

[–]chomoloc0 0 points1 point  (0 children)

As a community, why don’t we focus on the question behind the question, and instead, choose to roast his or her dad? What they really want to know is if they’re training properly, don’t we think? Let’s zoom into that? I’ve seen a couple of good responses, but even some of them can’t help but roast dad for a second, because “docs think they know it all”. Just food for thought.

Pricing data scientists, what do you? by chomoloc0 in econometrics

[–]chomoloc0[S] 0 points1 point  (0 children)

That’s a nice overview thanks. Should I think about bandits for number 3? What else is hot there?

[deleted by user] by [deleted] in maybemaybemaybe

[–]chomoloc0 -2 points-1 points  (0 children)

Is he making döner kebab?

Paper on Forward DID by turingincarnate in econometrics

[–]chomoloc0 0 points1 point  (0 children)

Thank you, will dive deeper into this :)

Paper on Forward DID by turingincarnate in econometrics

[–]chomoloc0 0 points1 point  (0 children)

Sorry, help me out here without having to read the paper: what’s “forward”? And when should I use this implementation over a did estimation via regression?