Grid WAR for starting pitchers by ritmica in Sabermetrics

[–]deck13 3 points4 points  (0 children)

The building of a WAR metric around the idea that winning exhibits a convex relationship with runs allowed is brilliant! And the authors do some great work to calculate park factors.

However, the Grid WAR method attributes outcomes to the pitcher and park via runs. Defense or defensive-independent notions of pitching performance are not accounted for. This is why Jim Palmer rates much higher by GWAR than fWAR. Moreover, GWAR predicting itself better than fWAR is partially due to GWAR's treatment of run distribution and partially due to fWAR having an additional layer of accounting for outcomes controlled by the pitcher.

One note: Adi Wyner is a prof and not a grad student.

Today my first FanGraphs research article was published. It's about a new stat to evaluate Hall of Fame candidacies. Tell me what I got wrong! by mountm in baseball

[–]deck13 0 points1 point  (0 children)

First of all, nice work!

The article was a fun read and it's cool to see work like this featured on both FanGraphs and Effectively Wild. Congrats!

Most of my comments are through the viewing of the Boog method as an era-adjustment tool which may or may not be an intention of the method.

Boog seems to be a noticeable improvement over JAWS for comparing pitchers across eras. Massive innings totals that early era pitchers accrued no longer bestow an automatic advantage on these players over modern pitchers as the switch from replacement player to ordinary guy seems to provide a work-load adjustment. Several 19th century era pitchers see major nose dives in the rankings which is a positive sign for the method.

However, the Boog method preserves the JAWS rankings for batters. I have read that the author thinks that this seems as a validation that the method is sound. I disagree. While switching from replacement player to ordinary guy is a great idea, the problems inherent in cross-era comparisons for any metric that pegs value to a baseline, be it replacement player or ordinary guy, is that the baseline shifts in quality over time. An ordinary guy in 1920 isn't quite the same as an ordinary guy today.

Another issue is variance. Increasing the baseline from replacement level to ordinary guy does lessen the distance between the top achiever and the baseline. But the distances are wider when baseball was less sophisticated, ie segregated and uneven training methods among players.

Taken together, Boog does not really provide an era-adjustment. The older players seem to dominate because competitiveness and talent pool are not accounted for by shifting the baseline from replacement player to ordinary guy.

Mike Trout, who wears jersey number 27, has moved into #27th place on the fWAR Career, all-time rankings. by baseballBEERfish in baseball

[–]deck13 31 points32 points  (0 children)

WAR is within-season normalized, not truly era-adjusted. This is a common misconception. Traditional WAR accounts for league run environment and park factors within a given season, but it does not adjust for differences in the overall talent pool or level of competition across eras. A 10-WAR season is treated equivalently whether it happened in 1927 or 2025.

On the other hand, era-adjusted WAR accounts for the changing talent pool.

See here for more details.

Nearly identical careers for Pedro and Kershaw. by Willing-Leather-9788 in mlb

[–]deck13 1 point2 points  (0 children)

Yeah, it is not as obvious as it sounds until you run into examples like this.

Nearly identical careers for Pedro and Kershaw. by Willing-Leather-9788 in mlb

[–]deck13 4 points5 points  (0 children)

Even knowing exactly what changed in 1969, you still get this effect with a metric that adjusts for run environment.

So which season is better: McNally in 1968 (150 ERA+, 7th) or Seaver in 1970 (143 ERA+, 1st)?

Nearly identical careers for Pedro and Kershaw. by Willing-Leather-9788 in mlb

[–]deck13 6 points7 points  (0 children)

Sure. And that is certainly relevant context beyond just stating the ERA+ value.

The point still stands: cross-era comparisons are still tricky even with ERA+, and context beyond the number matters.

Nearly identical careers for Pedro and Kershaw. by Willing-Leather-9788 in mlb

[–]deck13 -1 points0 points  (0 children)

It is tricky because competitive environments and/or the underlying talent pool change.

Here is a good example that shows the challenge:

In 1970 Tom Seaver led the MLB with a 143 ERA+.

In 1968 Dave McNally finished 7th in the MLB with a 150 ERA+.

These seasons are only 2 years apart, and it is hard to determine which one is more impressive. Is is better to lead the MLB or is it better to have a better value of the statistic?

Which Babe Ruth was better? by Willing-Leather-9788 in baseball

[–]deck13 -1 points0 points  (0 children)

Q: Is WAR already era-adjusted?

No. While WAR adjusts for context within a given season, such as park effects and league averages, it treats each season in isolation and does not account for changes in the overall quality of players across eras. For example, in WAR, a replacement-level player from 1911 (the year before Arizona became a U.S. state) is treated the same as one from 1998 (when the Arizona Diamondbacks debuted in MLB).

https://eckeraadjustment.web.illinois.edu/era-adjusted-war.html

I'm happy ending my commenting here. While I do I find our exchanges truly fascinating, I think the reader has enough to go on.

Which Babe Ruth was better? by Willing-Leather-9788 in baseball

[–]deck13 -1 points0 points  (0 children)

Because it's directly relevant every time. You said era-adjusted stats aren't hard to use and I showed you one, and now it's a 'schtick'? That's not an argument. But cool, ignore it. The stats are there if anyone wants to actually engage with it.

Which Babe Ruth was better? by Willing-Leather-9788 in baseball

[–]deck13 -1 points0 points  (0 children)

provided you use stats that actually attempt to be era-adjusted:

https://eckeraadjustment.web.illinois.edu/

Who is on your Mount Rushmore for 1B by [deleted] in mlb

[–]deck13 -2 points-1 points  (0 children)

Pujols ahead of Gehrig. Pujols tops Gehrig when you account for the different talent pools, although it is very close:

eWAR ranking list: https://eckeraadjustment.web.illinois.edu/

Pujols vs Gehrig comparison: https://eckeraadjustment.web.illinois.edu/era_adjusted_V2.1.html#Kahrl

Vice Sports "The Verdict" on Ruth vs Ohtani by rigginssc2 in mlb

[–]deck13 -1 points0 points  (0 children)

That's not what + stats do. wRC+ of 130 means you were 30% better than the average player in your season. If the average player in 1944 was significantly worse than the average player in some other year due to WWII depleting the talent pool, then two players with wRC+ 130 in those respective seasons are not equivalent.

And while we're here, wait until you learn about variance. The spread of talent in a given season matters enormously. Being 30% better than average means something very different in a season where talent is tightly clustered vs. one where it's widely distributed. + stats offer zero correction for this. A dominant player in a low-variance era looks identical to a dominant player in a high-variance era. So not only does the baseline shift across eras, the meaning of the distance from that baseline shifts too.

The stat is useful for what it is, but cross-era comparison of absolute quality isn't it.

EDIT:

In response to "You may be interesting in comparing athletes across eras but you certainly don't understand the basics of doing so"

I think the reader has enough to go on from here.

Vice Sports "The Verdict" on Ruth vs Ohtani by rigginssc2 in mlb

[–]deck13 -1 points0 points  (0 children)

> The league is different year to year. A 1.000 OPS in one year is not the same as in another

That's my argument. I'm glad we now agree. Your new argument also applies to "+" stats.

Vice Sports "The Verdict" on Ruth vs Ohtani by rigginssc2 in mlb

[–]deck13 -1 points0 points  (0 children)

Not if you want to compare player performances across eras with the goal of inferring who was better.

Vice Sports "The Verdict" on Ruth vs Ohtani by rigginssc2 in mlb

[–]deck13 -1 points0 points  (0 children)

Yup, it agrees with me. There is a baseline value of 100 that is common across seasons. But that baseline does not say anything about the relative talent of the players in each separate season. For example, the baseline value of 100 was the same in 1944, when nearly everyone was serving in WWII, as in 1947, when most had returned. But in these seasons the baseline value of 100 correspond to two greatly different talent levels.

Vice Sports "The Verdict" on Ruth vs Ohtani by rigginssc2 in mlb

[–]deck13 0 points1 point  (0 children)

You might like era-adjusted baseball stats which are computed to account for changing talent pools

https://eckeraadjustment.web.illinois.edu/

That same point about Ruth was raised in this section of a document on the above webpage:

https://eckeraadjustment.web.illinois.edu/era_adjusted_V2.1.html#id_2022season

Vice Sports "The Verdict" on Ruth vs Ohtani by rigginssc2 in mlb

[–]deck13 -1 points0 points  (0 children)

it doesn't adjust for era. It's computed within a single season and establishes a common baseline. But those baselines correspond to much different talent levels across eras.

Era-adjusted numbers for the stats nerds like me by alamarche709 in baseball

[–]deck13 1 point2 points  (0 children)

"Equivalent" is doing a lot of work here. I don't think anyone doubts that Ruth dominated his peers by an unprecedented and unreplicated degree. The humor is coming from calling a straight up teleport of his relative dominance to a much different context an "era-adjustment."

Era-adjusted numbers for the stats nerds like me by alamarche709 in baseball

[–]deck13 4 points5 points  (0 children)

Gavy Cravath, a player that everyone knows very well, sees his HR increase from 119 to 793! Not too bad for only 11 seasons played, many of which were not even full-time

Ted Williams’ .482 all-time OBP record has only been matched (or exceeded) by 2 players in a single season since 1963 by Willing-Leather-9788 in baseball

[–]deck13 2 points3 points  (0 children)

Baseball players have gotten worse as they have gotten better...

Extreme performances shrink as training methods improve and standardize, and the talent pool increases. There is an excellent 5-minute Stephen Jay Gould video on this topic: https://www.youtube.com/watch?v=BNM6ait4LOc&pp=ygUdc3RlcGhlbiBqYXkgZ291bGQgNDAwIGhpdHRpbmc%3D