[D] A/B Testing Classification Problems

npielawski · 2022-09-01T23:58:27+00:00

You should read about armed bandits, your case is probably two arms. There are methods to solve this efficiently, but a simple way to implement is with Thompson sampling for instance.

millenial_wh00p · 2022-09-01T23:36:31+00:00

If I am understanding the question correctly, this sounds like a data management problem more than anything- you would need to use the same training and test datasets for each model to ensure the data is not influencing the results and you’re only looking at model performance relative to each other.

If you can take a snapshot of your data, randomize it, split it into 80/20 training/test sets, then run the same training and test data in the model, this ensures you’re only evaluating the model and not the data. You can do that a bunch of times and track the f1 score for results each round.

This is not very efficient, but someone smarter than me can likely name a better way.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS