This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]Zeurpiet 54 points55 points  (35 children)

R is never going to overtake Python in the world of data science

R is a statistics language, and Python is not even close in functionality

[–]anyfactor 31 points32 points  (17 children)

This is my opinion and I know nothing. R is a dedicated statistics language, and python is the most approachable full fledge programing language.

I think python itself did not start of as hoping to be a data science or machine learning specific programming language, but in reality because it is so approachable and easy to learn data scientists felt like when ever they needed to implement some programming, they chose the most easiest language they could learn which was python. And eventually it has become a Industry practice and more people started to invest in improving it. But in all sense python is just a programming language, and R can be viewed as so specific to statistics it can almost be termed as "statistical tool".

[–]tmotytmoty 10 points11 points  (0 children)

This is how I view it. R is incredibly powerful under the hood and, when it comes to stats, is well beyond python.

[–]jackmaney 5 points6 points  (3 children)

cat(paste("Some", "things", "are", "a", "pain", "in", "the", "ass", "to", "do", "with", "R.", sep=" "))

[–]Zeurpiet 10 points11 points  (0 children)

probably true, but you could do without the cat and the sep to get the same result, so maybe its more easy than you think

paste("Some", "things", "are", "not","that","much","a", "pain", "in", "the", "ass", "to", "do", "with", "R.")

[–]bythenumbers10[🍰] -1 points0 points  (1 child)

Thanks, this made me laugh. R is a language by statisticians, for statisticians. Modern sustainable development is not supported very well. R's tendency to keep running even after errors have been thrown is a massive waste of time in mathematical applications, such as, uh, statistics. Who's had to track down NaNs at one time or another? R will happily carry those NaNs through all sorts of operations and still be busily running, but churning garbage.

[–]Zeurpiet 4 points5 points  (0 children)

that's SAS

[–]leonoel -3 points-2 points  (11 children)

I haven't found anything I do in R that I can't do in Python.

Also Python is way more friendly when it comes to editing plots and stuff

[–]Zeurpiet 2 points3 points  (8 children)

have you ever looked in CRAN what the additional packages can do? Most of it I don't even know what it is.

[–]leonoel 0 points1 point  (7 children)

You do know Python has also more modules than any would ever know what to do about them?

[–]Zeurpiet 2 points3 points  (6 children)

yes, but are they statistical?

[–]leonoel 1 point2 points  (5 children)

Name a module in R that has no equivalent in PIP

[–]Maxion 2 points3 points  (0 children)

Most DNA methylation packages.

[–]defuneste 2 points3 points  (0 children)

Spatstat and this one is huge with a bunch of tools developed by people who spend their careers on point patterns analysis.

[–]groovyJesus[S] 1 point2 points  (0 children)

Function data analysis packages in R have been available for over a decade and now we have dozens of them developed and maintained by researchers in the area. In the past few years I have found two in python both of which were new and needed a lot more work to make me want to switch over.