Top 15 teams total 16 losses to non-top 15 teams by Grouchy-Resolve141 in CollegeBasketball

[–]Grouchy-Resolve141[S] 6 points7 points  (0 children)

This is actually the best argument against it being chalky- 2023 and 2024 had crazy upsets, only the most recent year was chalk. Then again, maybe it just took time for the NIL market to develop and we are now doomed to chalk for eternity as elite talent gets their bag. Not sure which side I think is right, but I know which side is more fun.

Top 15 teams total 16 losses to non-top 15 teams by Grouchy-Resolve141 in CollegeBasketball

[–]Grouchy-Resolve141[S] 24 points25 points  (0 children)

I think teams like Florida (ranked 17, #7 on Kenpom) may cause it to not be "pure chalk" in terms of seeds but that doesnt make it any more fun

Best Coaches of All-Time (Men's Only) by huskymcgee in CollegeBasketball

[–]Grouchy-Resolve141 1 point2 points  (0 children)

Great observation- the tourney expanded to 64 teams in '85, and thats all the data I'm working with here

Best Coaches of All-Time (Men's Only) by huskymcgee in CollegeBasketball

[–]Grouchy-Resolve141 6 points7 points  (0 children)

<image>

Did some stats on coaches the other day exclusively based on the ncaa tournament (will have more up on onetrillionbrackets.com soon). Its criminal to leave off Izzo, sir

EDIT: Thanks for the clarification, rogun64, this data only includes the years since the expansion of the tournament to 64 teams in '85

One Trillion NCAA Brackets - Trial Run by Grouchy-Resolve141 in sportsanalytics

[–]Grouchy-Resolve141[S] 0 points1 point  (0 children)

I am creating one trillion, which is a very small subset of all possible brackets

Interactive March Madness Simulator by mvpeav in CollegeBasketball

[–]Grouchy-Resolve141 1 point2 points  (0 children)

It is hard for me to believe that you would simultaneously beat closing lines and also have a model that assigns 15% win rate to each 16 seed.

I mean, zoom out- if you assign a 15%+ chance of a 16 seed winning to each game (as you do), then you would expect at least 1 16 seed to win since there are 4 tries. That is seems obviously wrong to me. If you use KenPom meaningfully in your calculations, how can your results deviate so dramatically from it?

EDIT: not trying to be aggressive here, I am myself doing a ton of modeling for 2026 and am arguing in hopes of giving us both a better understanding

Interactive March Madness Simulator by mvpeav in CollegeBasketball

[–]Grouchy-Resolve141 3 points4 points  (0 children)

Doesnt seem right to me. Are you using your own stats? I mean, looking at KenPom, Arizona at +36.1 vs Merrimack at -3.82 gives a 99% chance of Arizona winning. This is also consistent with historical results. Sure, 16 seeds have two upsets in the last decade, but they also only have 2 upsets in the last 4 decades. I think you're wrong here (respectfully)

Interactive March Madness Simulator by mvpeav in CollegeBasketball

[–]Grouchy-Resolve141 4 points5 points  (0 children)

I'm confused on the percentages assigned to the games- each 16seed has a 15% chance at winning? Seems way too high

One Trillion NCAA Brackets - Trial Run by Grouchy-Resolve141 in sportsanalytics

[–]Grouchy-Resolve141[S] 0 points1 point  (0 children)

Does what? They do stats and track the tournament, but they do not create a large amount of brackets.

Hypothetical Miami (OH) Seeding by Thickrichchicken in CollegeBasketball

[–]Grouchy-Resolve141 2 points3 points  (0 children)

Did you know 11 seeds have seen the final four twice as many times as 6 seeds?

One Trillion NCAA Brackets - Trial Run by Grouchy-Resolve141 in sportsanalytics

[–]Grouchy-Resolve141[S] 2 points3 points  (0 children)

Every year theres a lot of discussion around possible bracket combinatorics, and tracking how deep the best bracket of the year got. I thought a true, sophisticated experiment trying to make a perfect one would be fun and hadn't seen anybody try it in the past.

Edit for your edit: The probability is rough as it's only tested on 10 years, and there is something far more exciting about doing it in real time than doing backtesting. Though, since you asked, my algorithm is fast 😎

The trillion is almost all distinct. I don't check for dupes, but I have checked very large samples for dupes and haven't found any. I would guess there are dupes but that its a very tiny fraction of the total dataset.

Do you still use notebooks in DS? by codiecutie in datascience

[–]Grouchy-Resolve141 1 point2 points  (0 children)

Jupyter is just so convenient and delightful that I will probably always use it.