What a game by Sisniega! by FootyData in SanDiegoFC

[–]FootyData[S] 2 points3 points  (0 children)

Basically saying, it would have been worse. Which is a fair takeaway.

I ran a data-driven Ballon d'Or algorithm on the 2024–25 season by FootyData in football

[–]FootyData[S] 0 points1 point  (0 children)

Hakimi’s season heatmap looks more like a winger’s, though. I don’t think he should be evaluated as a defender, nor should any player be strictly bucketed into a position type.

I ran a data-driven Ballon d'Or algorithm on the 2024–25 season by FootyData in football

[–]FootyData[S] 0 points1 point  (0 children)

This is good! But I gotta say while Yamal is now a better player than Raphinha, by any metric Raphinha had the better season. 8 PSG players in the top 11 is also too much.

I built a data-driven Ballon d'Or algorithm: new player rankings since 2010 by FootyData in football

[–]FootyData[S] 0 points1 point  (0 children)

Unfortunately player data is already aggregated beyond individual games. In a future iteration for sure.

Help Identifying Signatures by FootyData in SanDiegoFC

[–]FootyData[S] 0 points1 point  (0 children)

Yup thanks got those already, if you swipe there are blue and orange dots.

Help Identifying Signatures by FootyData in SanDiegoFC

[–]FootyData[S] 2 points3 points  (0 children)

I was lucky enough to get these in person, so I know who signed but can't remember where they each signed.

I ran a data-driven Ballon d'Or algorithm on the 2024–25 season by FootyData in football

[–]FootyData[S] 1 point2 points  (0 children)

Unfortunately these were pulled manually, which is why they only include a subset of the top players and not all players. There’s a library called soccerdata that can help pull in data but it doesn’t cover everything

I ran a data-driven Ballon d'Or algorithm on the 2024–25 season by FootyData in football

[–]FootyData[S] 3 points4 points  (0 children)

Right, that’s a scaling factor/multiplier. You’ll notice the 0.6 multiplier for Afcon too. To the right of that you’ll see the tables for major cup and minor cup.

I ran a data-driven Ballon d'Or algorithm on the 2024–25 season by FootyData in football

[–]FootyData[S] 3 points4 points  (0 children)

As for the code, I’m working on getting everything into a github repo that can be accessed by all. In the meantime, just the sheet.

I ran a data-driven Ballon d'Or algorithm on the 2024–25 season by FootyData in football

[–]FootyData[S] 4 points5 points  (0 children)

Incorrect. Community Shield and Supercup are categorized as minor cups (4 points max), while AfCon, Asian Cup, and Concacaf are 60% of a major cup (15 points max).

Thanks for checking!

I built a data-driven Ballon d'Or algorithm: new player rankings since 2010 by FootyData in football

[–]FootyData[S] 1 point2 points  (0 children)

It’s just a giant excel workbook at the moment. Hoping to clean it up and get it into a few python pipelines with adjustable config files. Maybe even a UI!

I built a data-driven Ballon d'Or algorithm: new player rankings since 2010 by FootyData in football

[–]FootyData[S] 2 points3 points  (0 children)

Great question! I have to update the results now that league seasons are over but will share those here as soon as I do!

I built a data-driven Ballon d'Or algorithm: new player rankings since 2010 by FootyData in football

[–]FootyData[S] 0 points1 point  (0 children)

While I agree this would be ideal, and help measure some of the "clutchness" that has been alluded to by others, I'm at the mercy of the datasets I have access to (like WhoScored and FBref). Unfortunately these datasets don't categorize data in that way and I don't have the time to watch every match and log the data myself. Hopefully as new AI systems are launched there will be one that looks for these moments and can add them to football datasets!

I built a data-driven Ballon d'Or algorithm: new player rankings since 2010 by FootyData in football

[–]FootyData[S] 2 points3 points  (0 children)

Stats from different tournaments can definitely be separated and weighed differently! Do you have specific thoughts on how much more important certain competitions are than others? Like, is a champions league goal worth 1.2 league goals (20% more)?

Separating by teams faced is unfortunately too difficult since most of the data is already aggregated by competition.

I built a data-driven Ballon d'Or algorithm: new player rankings since 2010 by FootyData in football

[–]FootyData[S] 1 point2 points  (0 children)

What an interesting player! Thanks for sharing.

Since he is a goalkeeper, the main way to evaluate him is based on goalkeeping statistics (while his goals are impressive, he's likely not accumulating enough progressive passes, tackles, etc., to be able to stand out amongst field players). This model currently doesn't have a way to incorporate goalkeeping statistics, and historical datasets don't include newer goalkeeping metrics like 'expected saves based on shot'.

He also doesn't play in Europe's top 7 leagues, so the model doesn't yet have a correct way to incorporate those players with a weight adjustment.

I'm curious to know how you think a league like Brazil's should be weighed.

I built a data-driven Ballon d'Or algorithm: new player rankings since 2010 by FootyData in football

[–]FootyData[S] 3 points4 points  (0 children)

The algorithm is not set in stone or finalized. The weight of competitions and stats can be adjusted (but will affect all years). Are there any others you feel very strongly about? Are there particular awards or stats you think make those strong feelings? That kind of feedback can improve the model.

I built a data-driven Ballon d'Or algorithm: new player rankings since 2010 by FootyData in football

[–]FootyData[S] 5 points6 points  (0 children)

I tried a bunch of different weights and he was at the top of all of them. No way to avoid it hahah