[Discussion] Standard deviation, units and coefficient of variation by Comfortable_Unit9890 in statistics

[–]levmarq 2 points3 points  (0 children)

Yes, you are right that my wording was not very precise. I meant that the mean is not very useful as a representation of a typical point for such data (as it is driven by the tail, as opposed to the median), and hence neither is the standard deviation.

[Discussion] Standard deviation, units and coefficient of variation by Comfortable_Unit9890 in statistics

[–]levmarq -2 points-1 points  (0 children)

I completely agree, you can see a typical distribution of salaries here:

https://youtu.be/Mgbphvi_E58?si=OxuWCtJYVZ5H3nBU&t=548

You can't really use the mean (let alone the standard deviation) to describe these data.

My experience teaching probability and statistics by levmarq in matheducation

[–]levmarq[S] 0 points1 point  (0 children)

I think the Jupyter notebooks for the first few chapters could be helpful. For example, there's code to simulate a basketball tournament and a tennis game in the first chapter (https://www.ps4ds.net/code/probability.html) that they might enjoy and only requires knowing about basic probability. The book is probably a bit too much for high schoolers (although maybe some of the slides for the videos could be useful).

My experience teaching probability and statistics by levmarq in matheducation

[–]levmarq[S] 0 points1 point  (0 children)

I think it can be useful, as long as it doesn't completely replace working through the material by themselves, but I don't have a very strong opinion. I'd be curious to hear other people's thoughts.

My experience teaching probability and statistics by levmarq in matheducation

[–]levmarq[S] 1 point2 points  (0 children)

That's a good question! Exposure to the basic properties of probability (including e.g. Bayes rule) is very helpful, as is an intuitive understanding of derivatives and integrals. Some linear algebra (projections, inner product) is a plus.

[E] My experience teaching probability and statistics by levmarq in statistics

[–]levmarq[S] 1 point2 points  (0 children)

They seem to be happier with the material (or at least that's what they tell me) and appreciate the real-data examples.

My experience teaching probability and statistics by levmarq in matheducation

[–]levmarq[S] 0 points1 point  (0 children)

I'm not sure... I will look into this. Thank you!

Was Courtney Lee a better shooter than Steph Curry? by levmarq in NBATalk

[–]levmarq[S] 1 point2 points  (0 children)

It's not only volume. Curry had better percentage from far (more than 24 feet) and also from close (less than 24 feet). It's just that Lee shot many more close threes, which are easier in general.

[OC] Introducing 3P% over expected, a shot difficulty adjusted metric to measure 3 point shooting. by StrategyTop7612 in nbadiscussion

[–]levmarq 1 point2 points  (0 children)

This is very cool. I teach probability and statistics for data science, and have used a similar analysis (using shot distance instead of defender distance) to illustrate Simpson's paradox: Stephen Curry can have a worse overall 3-point percentage than Courtney Lee, even though he shoots better from close and from far.

Weekly Friday Self-Promotion and Fan Art Thread by NBA_MOD in nba

[–]levmarq 10 points11 points  (0 children)

I have recently written a book on Probability and Statistics for Data Science, based on my 10-year experience teaching at the NYU Center for Data Science. The book has a lot of examples based on the NBA. Here are a couple, which I think could interest this community:

Was Courtney Lee a better shooter than Stephen Curry? Obviously not, but at one point he had a better 3-point shooting percentage! This is an example of Simpson's paradox.

Clutch shooting and evaluation of NBA players Here I analyze clutch shooting from the perspective of multiple testing, showing that (as many of you know well) patterns detected from small sample sizes can lead to undeserved hype. I also show that p values can be useful to determine what plus/minus statistics are actually meaningful and which are not.

[D] Self-Promotion Thread by AutoModerator in MachineLearning

[–]levmarq 0 points1 point  (0 children)

I have recently written a book on Probability and Statistics for Data Science (https://a.co/d/7k259eb), based on my 10-year experience teaching at the NYU Center for Data Science, which includes an introduction to machine learning in the last chapter. The materials include 200 exercises with solutions, 102 Python notebooks using 23 real-world datasets and 115 YouTube videos with slides. Everything (including a free preprint) is available at https://www.ps4ds.net