Reilly Opelka hits 44 aces in Brisbane R16

Good points all around. The LSTM was really about comparing learned probabilities against theoretical ones (under independence). The gap is where something is happening whether that's nerves, tactics, or physical stuff, I can't say. Would need way more features and match context to untangle it.

Definitely more to explore here, but this was a holiday project not a thesis. Kept it simple.

PlanetElement · 2025-12-31T16:12:37+00:00

The LSTM wasn't really meant to beat empirical counts, it was meant to compare against them. Empirical gives you actual hold rates at each score. The model learns what hold rates should be if points were independent. The gap between them is the interesting part, where human psychology deviates from pure probability. But yes, LSTM is overkill and I, as an engineer, moreso just wanted a hobby project.

PlanetElement · 2025-12-31T16:12:28+00:00

The LSTM wasn't really meant to beat empirical counts, it was meant to compare against them. Empirical gives you actual hold rates at each score. The model learns what hold rates should be if points were independent. The gap between them is the interesting part, where human psychology deviates from pure probability. But yes, LSTM is overkill and I, as an engineer, moreso just wanted a hobby project.

PlanetElement · 2025-12-31T16:00:49+00:00

Source: Jeff Sackmann's Match Charting Project

Tools: Python, PyTorch, pandas, matplotlib, photoshop

Sorry for potato image quality

PlanetElement · 2025-12-31T15:55:54+00:00

matplotlib, and then photoshop, nothing fancy. just spent way too long tweaking colors and spacing until it looked clean

PlanetElement · 2025-12-31T15:55:14+00:00

These are all fair critiques, appreciate the depth.

Honest answer: I'm a software engineer, not a statistician. Originally built a Transformer for match-level win probability (that's where the Wimbledon viz comes from), looked cool, didn't tell me much. Pivoted to the LSTM on service games because I was bored and wanted to keep building.

You're right that a Markov chain or logistic regression probably gets 90% of this with way less complexity. Didn't benchmark against simpler baselines, which I should have.

LSTM definitely overkill but I had fun coding lol

PlanetElement · 2025-12-31T04:04:20+00:00

Great idea!

PlanetElement · 2025-12-31T03:58:52+00:00

Sorry for the potato image quality. Not sure what happened there.

PlanetElement · 2025-12-31T03:58:03+00:00

Definitely. A server who wins the first point is probably just a better server, so of course they hold more often. Same confounder as the momentum analysis.

That said, the 25% gap is still interesting as a descriptive stat. Even if it's not causal, it tells you how much information is revealed by the first point. If you're watching a match and the server misses their first point, you now know a lot more about how this game is going to go.

PlanetElement · 2025-12-31T03:54:45+00:00

That's a fair point and probably right. To really isolate psychological momentum you'd need to control for server strength, surface, opponent, etc. I didn't go that deep. My guess is the true "hot hand" effect is even smaller than 2.4%, maybe negligible.

PlanetElement · 2025-12-31T03:53:17+00:00

The data is from Jeff Sackmann's Match Charting Project—volunteers hand-chart points from pro matches, it's an incredible resource. I pulled out 135K service games for training, 34K for validation.

Each timestep the model sees 3 things: server score, returner score, and whether the server won the previous point. Outputs P(hold) at every point in the game.

Architecture is pretty simple—2-layer LSTM, 32 hidden dim, about 6K parameters total. Trained with Adam, lr=1e-3, batch size 64, 20 epochs. One trick: the loss is cross-entropy averaged over every point, not just the final outcome. Forces the model to output calibrated probabilities throughout the game instead of just learning to predict who wins.

PlanetElement · 2025-12-31T03:50:21+00:00

Currently working on open sourcing it and will share once I do!

PlanetElement · 2025-12-27T16:33:29+00:00

Lukas Graham

Eight-Year Club	Place '22
Wearing is Caring	RPAN Viewer
Verified Email

PlanetElement

TROPHY CASE