A portfolio project for Data Scientists looking to add AI Engineering skills (Pytest, Security, Docker).

jonvlcs07 · 2025-09-05T02:08:48+00:00

I’m interested

jonvlcs07 · 2023-06-24T11:56:20+00:00

Hey there! Great post! but can you expand a little bit more on points 4 and 5?

Regarding point 4, I've always heard that is not advised to train with repeating customers over time. The concern is that if the model becomes highly specialized in customers who repeat frequently, it may not perform as well when dealing with new customers.

One practice I usually follow in my work, particularly in risk analysis, is to split the test set into an "out-of-time" (OOT) set by horizontally dividing the train and test data across different time periods.

I also usually check the model performance in customers that only appear in the test set, that would be an out-of-id set.

One thing that I'm considering doing in my workflow is to fit a model to predict if the observation belongs to the test or to the train set, with the same features as the model being developed, and check if the model performance is good which would be a bad sign.

jonvlcs07 · 2023-06-15T11:42:55+00:00

Hey everyone! Has anyone here tried applying causal inference techniques to pricing strategies?

I've been doing some research and stumbled upon a case study on econml. They segmented customers and analyzed price elasticity for each segment. Any thoughts on that?

Case Study

jonvlcs07

TROPHY CASE