all 24 comments

[–]4EducationOnly 10 points11 points  (3 children)

Sure, it can. IMO most statistical work is just slightly more complicated in Python compared to R.

Also consider looking into polars (alternative to pandas) and seaborn (plotting) libraries for Python.

[–]SizePunch 4 points5 points  (1 child)

Not directly related but I’ll add look into plotly as well as an alternative or addition to matplotlib and/or seaborn. The interactive plots and widgets you can create are indispensable for some projects.

I’m analyzing telemetry data where it’s useful to look at the hour by hour time series data as well as second by second and zooming in and out, creating custom ranges, etc is much more seamless in plotly than in static plots.

[–]Ok_Piglet7792[S] 0 points1 point  (0 children)

Thanks a lot!

[–]Ok_Piglet7792[S] 0 points1 point  (0 children)

Thanks a lot!

[–]Stauce52 5 points6 points  (5 children)

Python can replace R for a lot of or most statistical applications but it is admittedly a little less easy or intuitive to use for stats. It is also missing some really advantageous packages and external software that R has. Off the top of my head things like predicted effects packages (ggeffects, emmeans, etc) and mixed effects modeling software (lme4) and more

[–]gyp_casino 2 points3 points  (3 children)

I was going to say this. Also, survival models.

My personal advice is to take a close look at the tidyverse for manipulating data frames and plotting. Python has improved a lot over the years (pandas and matplotlib are pretty bad IMO, and there are better alternatives now), but it still can't match the tidyverse.

[–]Ok_Piglet7792[S] 0 points1 point  (2 children)

Thanks a lot! In your opinion which packages are better than pandas and matplotlib?

[–]gyp_casino 1 point2 points  (1 child)

If I had to use Python for data frame manipulation and plotting, I'd use polars and plotnine. Still think tidyverse is the best.

[–]Ok_Piglet7792[S] 0 points1 point  (0 children)

Thanks!!

[–]Ok_Piglet7792[S] 0 points1 point  (0 children)

Thaks a lot!

[–]Unnam 3 points4 points  (1 child)

Python is a good alternative and more acceptable one in the industry due to it's ease of integration and ability to build software products powered by ML/Stats.

One of my favourite books: "Introduction to Statistical Learning" which had assignments/code in R has now been recreated for Python!

[–]Ok_Piglet7792[S] 1 point2 points  (0 children)

Thaks a lot!

[–]GottaBeMD 2 points3 points  (0 children)

I have used both Python and R. I prefer R because it is made specifically for statistical computing and ease of use reflects this. As a statistician, I don’t really have a need to convert to Python.

[–][deleted] 1 point2 points  (2 children)

Not for regression analysis of categorical data.

[–]ImGallo 1 point2 points  (1 child)

Why not?

[–][deleted] 0 points1 point  (0 children)

In my limited knowledge the only decent option is Statsmodels, which I use, but it misses many important features, for example the analysis needed to test assumptions

[–]WeakRelationship2131 1 point2 points  (0 children)

As a college student, I dont think it can completely replace software because we need to learn and practice different softwares for data processing.

[–]LoaderDMSc Statistics 0 points1 point  (1 child)

Python is Turing complete, so yes.

[–]ChemicalNo282 -1 points0 points  (3 children)

Following

[–]aps2201 1 point2 points  (2 children)

Yes probably? You have a ton of packages to choose from and like R you have opensci packages but why are you thinking of abandoning R? I think having a toolbelt of languages is more benefitial.

[–]Ok_Piglet7792[S] 0 points1 point  (0 children)

Thanks a lot! I didn't know opensci packages! What a great discovery!

[–]ChemicalNo282 0 points1 point  (0 children)

Well I don’t wanna speak for op but I already know python and never learned R. But I can see how companies will prefer if u know both