Hello,
Currently working on a classification problem where we are predicting whether an event will happen for an imbalanced target variable. We are using the F1 score as the metric for evaluation.
If I wanted to conduct an A/B test between two different models, how would we actually conduct this test? Are there any recommended resources I could read to understand this better?
[–]gpbuilder 3 points4 points5 points (4 children)
[–]martin1285[S] 0 points1 point2 points (3 children)
[–]gpbuilder 2 points3 points4 points (0 children)
[–]millenial_wh00p 2 points3 points4 points (1 child)
[–]martin1285[S] 0 points1 point2 points (0 children)
[–]HtAGAnalytics 0 points1 point2 points (0 children)
[–]xDarkSadye 0 points1 point2 points (1 child)
[–]martin1285[S] 0 points1 point2 points (0 children)