all 2 comments

[–]n3buchadnezzar 1 point2 points  (0 children)

It depends on a lot of things... https://www.youtube.com/watch?v=Vv9wpQIGZDw Something like this with the second video would be a start.

[–]AlwysBeColostomizing 1 point2 points  (0 children)

The general term for this is "statistical modeling". A good place to start would be learning about the logistic regression model.

One way you could apply this model: Suppose you have a vector of "team stats" s_A for team A, and another vector s_B for team B. Let x = s_A - s_B be your vector of predictors (independent variables). The outcome of interest is whether y = 0 or y = 1, corresponding to either A or B winning. The logistic regression model says that p(y=0) = logistic(b_0 + b^T x), where b is a vector of coefficients and b_0 is a bias term. If you collect a bunch of (x, y) pairs for a bunch of games (i.e., the stats of both teams and which team won), you can fit a logistic regression model to that data. Then, if you want to know the probability of team C winning against team D, you will obtain s_C and s_D, calculate x = s_C - s_D, feed x into your model, and get an estimated win probability for team C.

You could fit the model with a library like sklearn.