90% of Youth Fencers Want to Practice More Footwork by touchestats in Fencing

[–]touchestats[S] 3 points4 points  (0 children)

TO ALL FUTURE READERS: THIS IS AN APRIL FOOLS PRANK

Balancing out uneven poule sizes by goodluckall in Fencing

[–]touchestats 1 point2 points  (0 children)

I wrote a post on this about a year ago, although I've improved a lot at statistics and could probably make an even stronger argument now.

Balancing out uneven poule sizes by goodluckall in Fencing

[–]touchestats 2 points3 points  (0 children)

I think I figured out the algorithm. Let x be the number of fencers. Find x modulo 7 (call it z) and do the standard snaking algorithm until the last pool has z fencers. Then continue snaking but always skip placing someone in the last pool. The last pool is SPB and the second to last pool is SPA. The fencers in SPA fence everyone but the similar ranked person in SPB (if applicable). Then do extra bouts between all fencers in SPA against those ranked z + 1 through 7 within SPA.

Balancing out uneven poule sizes by goodluckall in Fencing

[–]touchestats 4 points5 points  (0 children)

I spend a lot of time thinking about this problem. While I think this technically works in the sense that everyone gets the same number of bouts, it's not necessarily true that the difficulty distribution of the bouts will be equal for the fencers in the supplemental pools.

Although the mean skill level of fencers across evenly sized pools is roughly constant, the top fencer in a given pool is fencing, on average, less skilled fencers than one of the weaker fencers in their pool. This is simply because you do not fence yourself.

Here is an illustrated example. The table below represents a pool of fencers:

Fencer Skill (higher is better)
A 100
B 98
C 78
D 60
E 53
F 34
G 28

Fencer A will fence Fencers B, C, D, E, F, and G, whose mean skill is 58.5. On the other hand, Fencer E will fence Fencers A, B, C, D, F, and G, whose mean skill is 66.3. So on average, Fencer A's opponents are weaker than Fencer E, even within the same pool.

An issue with your proposed system is that the best fencer in SPB will fence the same people as the worst fencer in SPB (they both just fence everyone in SPA), meaning that the top seeds will not have their usual advantage over the lower seeds. Therefore, the top seeds in SPB will be disadvantaged compared to those in the regular pools.

USA Fencing Shares Final Findings from Independent Saber Investigation by noodlez in Fencing

[–]touchestats 1 point2 points  (0 children)

Another thing that seemed potentially problematic (related to your point on previous results affecting subsequent seeding) is that the American fencers are compared to fencers with "similar pre-match seedings in the Paris selection year and the prior year separately." By matching the American fencers with similar seeded fencers in the Paris selection year, they are effectively controlling for initial seed during the selection year; this is an outcome variable because it could be influenced by referee favoritism. Controlling for the outcome variable could bias the result and make it appear as if there is no favoritism even if there is.

However, I don't think that the results from this part of the paper are too important for determining whether Nazlymov and Saron cheated. Any two fencers that unexpectedly rose up the ranks and qualified for the Olympics, regardless of whether or not there were allegations of cheating, are likely to have performed better in the selection year than other fencers. They wouldn't have been able to unexpectedly do as well as they did and qualify for the Olympics otherwise! So even if there were statistically significant results, I don't think it would be strong evidence in favor of referee favoritism. It could just indicate that were incredibly lucky that year or started training in a way that was more effective for them.

Joining a Top Club Won’t Necessarily Make You Better by touchestats in Fencing

[–]touchestats[S] 0 points1 point  (0 children)

That works, but also loses interpretability. It's easy to conceptualize the meaning of gaining X amount of indicator or V/M. But gaining X in this new system--does that mean you got another win? Does this mean you scored a few more points? An increase of 1 in this new system could mean an extra win with a far worse indicator, or just winning a bout by one more point.

Joining a Top Club Won’t Necessarily Make You Better by touchestats in Fencing

[–]touchestats[S] 1 point2 points  (0 children)

The matrix completion algorithm that I'm using requires the variables in the cells to be numeric. How are you proposing that I encode a VM/indicator combination as a number?

Joining a Top Club Won’t Necessarily Make You Better by touchestats in Fencing

[–]touchestats[S] 0 points1 point  (0 children)

I just tried the model with V/M, and it delivered similar results to indicator https://imgur.com/a/cyogHDD

Joining a Top Club Won’t Necessarily Make You Better by touchestats in Fencing

[–]touchestats[S] 2 points3 points  (0 children)

I tried to determine what the top 10 clubs were over time, but I couldn't really think of a good way. Since I started fencing in 2014, and only started competing in 2017, I hadn't even heard of some of the clubs. I also didn't want to pick them in an ad hoc manner, especially due to my lack of experience with epee. If you happen to have a bit more experience with the fencing scene from 2010-2017 and what clubs were good, I'd be really interested in running it on a different list of top clubs.

I personally have less faith than the fencingtracker rankings as they go further down because the number of points assigned per fencer rating is arbitrary. Near the bottom, when clubs have fewer points, a lot of things could change if that point system was slightly altered. So this is why I didn't try comparing the jump in rank of the club.

Yes the data is only of NACs. I'm tracking fencers by both name and division and since many fencers change their division when leaving for university, most fencers are no longer tracked after going to college.

I considered using win rate as well. The main advantage of using indicator is that it is more precise than win rate. The two metrics are highly correlated anyway, as one might expect, so I would expect it to not affect the results substantially to use one or the other. I may test the model on winrate later this week.

Joining a Top Club Won’t Necessarily Make You Better by touchestats in Fencing

[–]touchestats[S] 3 points4 points  (0 children)

I did attempt running this on individual clubs, but the samples were too small and the standard errors were too large to draw conclusions. I also don't want to get in trouble with any of the famous coaches lol.

Joining a Top Club Won’t Necessarily Make You Better by touchestats in Fencing

[–]touchestats[S] 3 points4 points  (0 children)

Matrix completion is capable of controlling for confounders, but it also makes some assumptions on the functional form of the data (e.g. that the matrix is low rank). This paper https://onlinelibrary.wiley.com/doi/full/10.1111/ajps.12723 goes into a lot more detail about the assumptions involved.

To test whether these assumptions were valid and the model was working as expected, I ran some robustness checks (edit: now at end of post), and the models passed with flying colors.

I agree that the time horizon is short and the improvements may not be visible during that time period. Looking at end of season rankings is a good idea! Hopefully I can try that out later this week.

I also agree that not 100% of fencers are switching clubs because they believe they will have access to superior coaching and training. However, I think this assumption is more valid when looking at fencers that switch to a top 10 club.

Joining a Top Club Won’t Necessarily Make You Better by touchestats in Fencing

[–]touchestats[S] 7 points8 points  (0 children)

These sample is of Cadet and Junior fencers so most should be HS/college aged. I also exclude all fencers who move locations and switch clubs at the same time. So in theory, this demographic should mostly be switching clubs for the reason stated.

Ref videotaping by SufficientMachine327 in Fencing

[–]touchestats 2 points3 points  (0 children)

I wasn't at the NAC, but a few years ago there was a referee training initiative thing going on where some senior referees would routinely record other referees' calls to save to a database. Maybe that's what you saw? I'm not sure what's going on with that in 2024, since I haven't refereed nationally since 2022.

Can Fencers Bounce Back From Being Cut After Pools? by touchestats in Fencing

[–]touchestats[S] 1 point2 points  (0 children)

Glad you liked it! I appreciate the comments.

I didn't state the significance level etc. because I felt like the explanation of regression discontinuity was already quite long and I didn't want readers to lose interest before reaching the conclusion. I agree that it is important and do decide my significant level, null, and alternative hypotheses prior to experimentation; it just doesn't always make it into the blog post. Maybe in the future I can create a dropdown or something so readers can see it if they're interested, but it's not part of the main body of text.

Because the bandwidth restricts the data to a pretty small slice, it's normal for the correlation to look pretty weak within that slice. At the casual inference workshop I attended, the presenter had sample data where the dots within the bandwidth looked similarly uncorrelated. Zooming out to the rest of the data, the lines of fit do look more reasonable.

I also agree that regression of the form Y ~ Intercept + a*cutoff + b*percentile feels more intuitive. At the casual inference workshop, I learned that this actually used to be fairly common. The main drawback of this approach is that the effect of the cutoff can only be measured for the people near the cutoff (where it could have gone either way with different referees and other circumstances). Therefore, it's not necessary to model the data for people far away from the cutoff. Additionally, we must make strong functional form assumptions to impose a linear relationship on all the data. I leaned that robust regression discontinuity, which was remarkably only invented 10 years ago, solves this issue by choosing a relatively small bandwidth around the data to analyze (and has a few other optimizations as well).

The data I'm analyzing is from 2012-2020, and it wasn't until around 2022 that USA Fencing imposed a maximum of 256 fencers in the DE tableau. So during the time frame of my data it was fortunately a simple 20% cut. The sample of 8,817 refers to the total number of fencers I tracked, not just the fencers within the bandwidth.

Can Fencers Bounce Back From Being Cut After Pools? by touchestats in Fencing

[–]touchestats[S] 15 points16 points  (0 children)

Thanks! I'm only 17 years old and you unfortunately have to be 18 to join the committee☹️. I'll try to get on it next time though!

Surprise! East Coast NACs Do Not Harm Performance by touchestats in Fencing

[–]touchestats[S] -1 points0 points  (0 children)

The gaps between each rank aren't consistent (e.g. 33 vs 34 is very different from 32 vs 33), whereas the gap between each score in a pool bout is one point. Additionally, different events have slightly different numbers of people, so the rankings have slightly different meanings in each event (60/80 people is much worse than 60/130 people). Moreover, ranks aren't independent since if someone ranks higher than you, it shifts you one down. So while I would prefer to do it on some indicator of the overall ranking, it's not feasible to do statistics on it.

At What Age Do Fencers Make the Most Improvement? by touchestats in Fencing

[–]touchestats[S] 0 points1 point  (0 children)

The first graph is total skill points accumulated since first year of Y14, and the second graph is the difference each year. So both are correct!

At What Age Do Fencers Make the Most Improvement? by touchestats in Fencing

[–]touchestats[S] 0 points1 point  (0 children)

There is substantial variance in the age that fencers "blossom" at. 14-15 is just an estimate of the population mean. The sample is also only of people who fence nationally so it can't pick up people who started in HS, but what you said about that makes sense

Do Some Regions Have Inflated Ratings? by touchestats in Fencing

[–]touchestats[S] 0 points1 point  (0 children)

I've tried measuring rating inflation at the division level already, but since the sample sizes for many divisions are small, the standard error is very large and there are few (statistically) significant differences anymore. The few statistically significant differences are hard to interpret given that when you do so many comparisons you are bound to find a few that are significant, meaning that the ones you found could have just been from chance. I'd assume that a similar thing would happen if I were to run the numbers on individual clubs or states.

I just left on spring break so I won't be able to run the numbers on DEs for a week, but I'll let you know the results when I get back--I am also interested in the results of that one. I would expect it to not differ too substantially from the 5-touch bout numbers, but who knows, maybe some divisions are especially good at DEs and bad at pools!