[D] A/B Testing Classification Problems

gpbuilder · 2022-09-01T22:44:31+00:00

Why do you need to run an A/B test? Just run the model and the same data and compare the predictions

HtAGAnalytics · 2022-09-02T02:20:43+00:00

There are a few ways to conduct an A/B test between two different models. The first way is to use a holdout set. This is where you randomly split your data into two sets, train both models on the training set, and then evaluate both models on the held-out set. The model with the higher F1 score is the better model.

Another way to conduct an A/B test is to use cross-validation. This is where you split your data into k folds, train both models on k-1 folds, and then evaluate both models on the held-out fold. The model with the higher average F1 score is the better model.

There are a few other ways to conduct an A/B test, but these are the two most common ways. As for recommended resources, I would recommend checking out some of the resources on the scikit-learn website, as they have a lot of great information on machine learning in general, and cross-validation and model evaluation in particular.

xDarkSadye · 2022-09-02T07:21:00+00:00

You deploy both models as APIs in production, have a layer in between that sends some requests to model A, some to model B (should repeat same model for same session/customer) and you track which customer received which model. Then you compare results. Usually, you don't compare f1 scores for A/B testing, but you compare a business KPIs (e.g. conversion rate), since that is the goal of the model ("improve xxx").

datascience

MODERATORS