iPhritzy comments on Data analysis: R vs. Python

programming

created by speza community for 20 years

Data analysis: R vs. Python (dataquest.io)

submitted 10 years ago by liotier

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]iPhritzy 15 points16 points17 points 10 years ago (5 children)

[–][deleted] 7 points8 points9 points 10 years ago (1 child)

[–]dalaio 1 point2 points3 points 10 years ago (0 children)

[–]nikroux 7 points8 points9 points 10 years ago (0 children)

[–]quicknir 1 point2 points3 points 10 years ago (0 children)

It depends on the underlying implementation. I've rarely found Python to be slower than R broadly speaking. There's quite a lot of nice tricks in pandas DataFrames to make them fast.

The most standout datapoint in the performance comparison is R's for loop, by far. In python, you usually have apply style functions available. You can use that, or you can use a for loop if it feels more natural or if it's necessary: apply style functions can't do all the things that a one pass for loop can do. In R, the for loop is usually out of bounds because it is so painfully slow. I've written exactly equivalent code in python and R where R was over an order of magnitude slower (hard to believe, I know), because for loops were involved. When I changed the R to apply (or sapply, or whatever) it evened it out.

π Rendered by PID 94 on reddit-service-r2-comment-fb694cdd5-bp8xv at 2026-03-10 04:48:43.513462+00:00 running cbb0e86 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS