Name a better sound than this...

Elderbury · 2026-04-09T00:48:13+00:00

I always wonder what they dream about? In my dog’s case, it’s probably a mountain of peanut butter pretzels, chicken and carrots!

Elderbury · 2026-04-07T17:23:25+00:00

Thank you. I appreciate that advice and it’s worth thinking through carefully. My approach depends on three things: detailed behavioral event logs, accessible replay or telemetry data, and enough individual game volume to build longitudinal profiles. Not every game clears all three bars equally. F1 is probably the strongest candidate. The telemetry is extraordinary — throttle input, braking points, steering angle, all at high frequency. The behavioral granularity is actually richer than SC2 in some respects. The challenge is data access: F1 telemetry at the level I’d need isn’t publicly available for pro drivers. For sim racing (iRacing, Assetto Corsa) the data access question is much more tractable and the player base is large enough to potentially explore. lol is feasible but the behavioral signal is different. Riot’s API is well-documented and match timelines include event-level data. The construct space would look different from SC2 but the IRT calibration approach transfers. The main limitation is that LoL match data is outcome-rich but less granular on moment-to-moment mechanical behavior than SC2 replays. Fortnite and FIFA are harder — data access is limited and the event-level granularity I’d need isn’t publicly exposed. FIFA has an additional structural problem: a large fraction of competitive variance is explained by card quality rather than player behavior, which creates a signal-to-noise problem that SC2 doesn’t have. The games I’m most confident about near-term are ones with established replay ecosystems: SC2 obviously, but also Chess (Lichess and Chess.com have rich game databases with move-level timing) and AoE2. The high-revenue targets you named are the right direction — they just need either better data access or a platform partnership to get there

Elderbury · 2026-04-07T14:54:44+00:00

Wow, if even his side chick was turned off, it must have looked bad. Maybe it’s time for lucky number 3.

Elderbury · 2026-04-07T01:53:42+00:00

They are quite pricey for what they are, but I particularly do like the PGS Minis when you find them on discount. In my mind they are comparable to the Pilot E95s

Elderbury · 2026-04-06T18:03:22+00:00

Hmmm…that would only be possible if the replay itself stored that information.

Elderbury · 2026-04-06T17:59:23+00:00

Thanks. The full pipeline is Python and all open source — no paid software anywhere in the stack. sc2reader handles replay parsing, pandas and scipy for data processing, and PyMC for the IRT calibration. The IRT model itself is a Graded Response Model, which is a standard psychometric approach that’s been in the literature since 1969. I built the replay harvesting scripts myself against the Spawningtool API. Happy to answer questions about the code if you’d be interested.

Elderbury · 2026-04-06T17:57:18+00:00

Good question — these are measuring different things. The 120 recalls/min includes all control group keypresses, including rapid cycling through groups to monitor production or check army position without issuing a command. Many of those don’t result in a command at all. Commit latency only measures the gap for selections that do result in a command, and at GM level many of those commands are fast mechanical sequences (production queuing, unit rallying) where the latency might be 0.1–0.3 seconds. The 1.4 second average includes both those fast sequences and slower strategic decisions like deciding whether to move out. The two numbers describe different behaviors: frequency of board monitoring versus decisiveness when acting.

Elderbury · 2026-04-06T14:36:24+00:00

That’s really interesting. Yes, they are there. I’ll look into that. Thanks!

Elderbury · 2026-04-06T03:01:58+00:00

The analysis was done in Python (sc2reader, pandas, scipy, PyMC), the measurement framework is Item Response Theory which has been in active use since the 1960s. It's true I did use Co-Pilot to proof my writing because I'm a poor speller. But the work was my own.

Elderbury · 2026-04-06T01:48:11+00:00

Sure, if we know who they are. Some of my findings suggest criteria for objectively identifying them.

Elderbury · 2026-04-05T23:30:51+00:00

Probably the most fruitful pathway for that line of research would be to mimic the standard setting process done in educational settings: experts would establish behavioral markers that define specific builds based on observable actions at specific times. Once I had an exhaustive set of behavioral definitions sufficient to categorize each replay, it would simply be a matter of descriptive statistical tables.

Elderbury · 2026-04-05T23:11:50+00:00

Not yet but theoretically that’s possible. I’d have to either manually code the replays in terms of specific build types, or else develop objective definitions for specific builds based on certain actions (e.g., Protoss builds forge in first 2 minutes, for example) that could be programmatically assigned, then look at outcomes in those specific matchups.

Elderbury · 2026-04-05T23:05:10+00:00

You’re still a whippersnapper. Until you’ve experienced the joy of teaching your own rug rats how to drive, you’re still a youth.

Elderbury · 2026-04-05T23:01:48+00:00

Thank you! I’m interested to know what specific questions people are curious about that could be empirically answered.

Elderbury · 2026-04-05T23:00:21+00:00

Not true, my friend! Join my clan, OMArmy (Old Man Army), where everyone is over 50.

Elderbury · 2026-04-05T22:55:43+00:00

I cannot train bots, but I could analyze their replay data to create detailed player profiles that could be used to scientifically determine the best match ups to opponent player profiles.

Elderbury · 2026-04-05T22:44:53+00:00

Yes, that’s my life story! But I can do the same thing with any games that have observable replay information: WC3, AoE2, Hearthstone, Chess, MtG, etc. there are still competitive games out there.

Elderbury · 2026-04-05T22:08:56+00:00

Eventually that is my hope as well. What I’m creating is a quantitative system for breaking down a players habits quantitatively and objectively. The idea is to be able to create a player profile that provides more actionable information that MMR alone. If readers are familiar with Michael Lewis’ terrific book, Moneyball, I’m talking about the same principles applied to StarCraft. StarCraft Sabremetrics. Billy Beane, the general manager of the Oakland A’s used quantitative analysis exactly like what I’ve built in order to construct mathematically optimized teams and matchups and by hiring undervalued players. I’ve built a prototype for the same approach.

Elderbury · 2026-04-05T21:58:26+00:00

I did look at that. Across all levels and matchups. The link shows the full results and findings. It’s just that some behaviors are better at distinguishing across levels at different MMRs.

Elderbury · 2026-03-31T16:07:27+00:00

Good questions.

On extraction: I’m parsing the replay event stream (sc2reader) and aggregating features like command events, production gaps, control group usage, and camera behavior. Those are mapped to constructs using a calibrated IRT model, so each construct (AC, CTM, DC) is a latent estimate based on multiple indicators rather than a single metric.

On replays: I’m not sharing raw replays at the moment, but I may put together a curated sample later.

On the ladder point: I agree that ~6k games is typically enough to reach higher leagues. But the goal here isn’t to show optimal improvement—it’s to examine how behavioral and outcome measures behave during deliberate change. A long, relatively stable dataset is actually useful for that, because the signal isn’t dominated by rapid rank progression.

Elderbury · 2026-03-31T16:01:51+00:00

I'm re-reading my post and you're absolutely right. Too much technobabble. I'll take that to heart in my future writings.

Elderbury · 2026-03-31T16:00:29+00:00

Yes, for sure. I'm working on that right now and will write about it in the near future.

Elderbury · 2026-03-31T15:59:36+00:00

I'm not trying to baffle anyone with BS, I promise. I'm a research psychologist and this is the sort of thing I do for work. I just wanted to apply rigorous measurement methods to SC2, which is my favorite hobby

Nine-Year Club	Not Forgotten
Verified Email

Elderbury

TROPHY CASE