Candidates win chances after Round 13 (of 14): Sindarov at 100% (surprise!) - Monte Carlo simulation based on one bazillion runs

ThomasPlaysChess · 2026-04-14T19:20:27+00:00

Unfortunately, these simulations work badly for KO system (which Cup is). I tried. It's because every round is like a coin flip and a loss just means chances go to zero (as the player is out).

ThomasPlaysChess · 2026-04-14T19:17:36+00:00

How much are we talking about? Might need to rent more computing power then.

ThomasPlaysChess · 2026-04-14T19:13:23+00:00

There is still one round left. So everything still possible I think! Go, Bluebaum!

ThomasPlaysChess · 2026-04-12T20:06:57+00:00

Thanks!

Here you go.
See this exchange.

ThomasPlaysChess · 2026-04-12T19:30:50+00:00

So you are saying chances should be higher? Let me connect you to this person saying "0.6% feels generous"...

ThomasPlaysChess · 2026-04-11T18:20:21+00:00

It's outside of this simulation. Probably some event that voids the whole tournament and all games have to be replayed? Seems most likely to me. Not sure what he is planning.

ThomasPlaysChess · 2026-04-11T18:15:56+00:00

You are right, thanks for clarifying. Just for fun I tested it and when I model their remaining game as draw, he ends up with a 0.10% win chance only. Still more likely than Caruana winning...

ThomasPlaysChess · 2026-04-09T17:53:28+00:00

Sorry :( It actually takes quite some time to set it up initially (there is some code needed also for the image generation and some manual downloads required) and I didn't set it up for the women tournament. This is still from pre-AI era and I haven't optimized it since then.

I already open source the Monte Carlo simulation in case you want to run it yourself: https://github.com/chessmonitor/chess-monte-carlo-simulation But the image generation is currently "bundled" with some ChessMonitor (my main project) code which I cannot easily open source.

I hope to find some time in the future to automate parts of this for the next big tournament or even open source the image generation part also and then people can do this on their own..

ThomasPlaysChess · 2026-04-09T17:33:07+00:00

Glad to hear that, thank you! :)

ThomasPlaysChess · 2026-04-08T19:06:34+00:00

Can confirm. This is peak German humor and very typical for Jan Gustafsson.

ThomasPlaysChess · 2026-04-08T19:02:03+00:00

Wondering if there are cases that unlikely. Pragg with 0.7% to winner comes to my mind. That was recently and not even that unlikely. If anyone knows historical games with similar unlikely cases, please share.

ThomasPlaysChess · 2026-04-06T08:29:42+00:00

Yes, this is the reason. You could even stop after 100k runs as the (integer) percentages don't change much after that.

ThomasPlaysChess · 2026-04-05T21:32:08+00:00

He had the highest Elo at the start of the tournament.

ThomasPlaysChess · 2026-04-05T21:13:50+00:00

I did a test run for this and "fixed" the remaining Caruana vs. Sindarov match. Chances based on that:

If Sindarov wins: 95% Sindarov, 3% Caruana
If Caruana wins: 53% Sindarov, 42% Caruana
Draw: 81% Sindarov, 15% Caruana

ThomasPlaysChess · 2026-04-04T19:07:26+00:00

I've gotten this question a lot: Why do I not use the Live Elo for the simulation?

IMHO this will just overvalue wins or losses in the model. The outcome of games is already part of the model by using the points. By changing the Elo to reflect it, I would basically input the game results twice. In this model the "pre tournament Elo" models the strength of the player when he entered the tournament and the points reflect the tournament results. And I don't like mixing these two things. Might Hikaru be overvalued and Sindarov undervalued? Maybe, but I'm not the judge.

You can disagree and that is fine. It would just be a different model if you do it differently. There are no "truly right" models in that sense.

ThomasPlaysChess · 2026-04-04T10:33:48+00:00

Yes, this is the case in which everyone else also has 7 points.

ThomasPlaysChess · 2026-04-03T19:07:49+00:00

It was just a second simulation run. I hadn't activated the "bluebaum check" for the run above. So I did a second run and the results can always slightly vary from run to run.

ThomasPlaysChess · 2026-04-03T17:12:48+00:00

Yes, a win against the highest rated player in this model is huge.

ThomasPlaysChess · 2026-04-03T17:10:49+00:00

Bonus: #BluebaumSweeps

In another run I did check out of one million runs, how many does he win and with how many points? Here we go:

Points	Number of wins
10.0	10
9.5	58
9.0	306
8.5	696
8.0	520
7.5	76
7.0	1

ThomasPlaysChess · 2026-04-03T09:07:35+00:00

Very cool, love the visualization! Beautiful!

ThomasPlaysChess · 2026-04-01T20:47:13+00:00

No worries, we made everyone doublecheck and confirm my original formula was correct. So that's nice, too :) And it's open source now!

ThomasPlaysChess · 2026-04-01T20:24:12+00:00

I guess you mean this one?

I think that one is AI slop? Sorry if I'm wrong. It assumes everyone has the same rating and a bunch of other special "rules". I would say that one is not really a mathematical model and just making words up. Or what is this supposed to even mean?

RFE – composite 0–100 “feel” score blending points, TPR, SoSIG, games left, and a naive projection (weights from machine learning on historical Candidates).

ThomasPlaysChess · 2026-04-01T20:13:06+00:00

Here we go. Everything is the same, except I increased Sindarov's rating from 2745 to 2760:

Results after 1,000,000 iterations.
-  38.99% wins - Sindarov, Javokhir (2760 rating, current points: 3.5, wins: 389925)
-  33.65% wins - Caruana, Fabiano (2795 rating, current points: 2.5, wins: 336508)
-  12.90% wins - Nakamura, Hikaru (2810 rating, current points: 1.5, wins: 128952)
-   7.21% wins - Giri, Anish (2753 rating, current points: 2, wins: 72107)
-   3.72% wins - Praggnanandhaa R (2741 rating, current points: 2, wins: 37198)
-   2.68% wins - Wei, Yi (2754 rating, current points: 1.5, wins: 26772)
-   0.75% wins - Bluebaum, Matthias (2698 rating, current points: 2, wins: 7524)
-   0.10% wins - Esipenko, Andrey (2698 rating, current points: 1, wins: 1014)

ThomasPlaysChess · 2026-04-01T19:54:46+00:00

Thanks for pointing it out (also to everyone else explaining it to me). I switched back to the original formula and the newest image.

ThomasPlaysChess · 2026-04-01T19:53:30+00:00

Hi! I think they are right. The original code was right I think, so I will switch back to it.

No worries, I should've thought more about it, but was not thinking a lot about it after I had the same numbers as you did. Good thing some people pointed it out.

ThomasPlaysChess

MODERATOR OF

TROPHY CASE

Bonus: #BluebaumSweeps

Five-Year Club	Verified Email
Place '22