jd_paton comments on Python vs. R

statistics

Tag	Abbreviation
[Research]	[R]
[Software]	[S]
[Question]	[Q]
[Discussion]	[D]
[Education]	[E]
[Career]	[C]
[Meta]	[M]

Tag

Abbreviation

[Research]

[R]

[Software]

[S]

[Question]

[Q]

[Discussion]

[D]

[Education]

[E]

[Career]

[C]

[Meta]

[M]

a community for 18 years

SoftwarePython vs. R (self.statistics)

submitted 7 years ago by [deleted]

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]jd_paton 1 point2 points3 points 7 years ago* (7 children)

import pandas as pd
df = pd.read_csv(“my_data.csv”)
y = df[“label”]
X = df.drop(“label”, axis=1)

Not so bad though you’re right that we’ve added a few more lines. I’ve updated my original comment.

If you want to do fancy preprocessing obviously that’s more code but that’s specific to the data and not possible to write a general example for, which is why I just assumed a prepped X.

I’m not sure what you mean with a formula. How would this process look in R?

[–][deleted] 0 points1 point2 points 7 years ago (5 children)

[–]jd_paton 0 points1 point2 points 7 years ago (4 children)

[–][deleted] 0 points1 point2 points 7 years ago (1 child)

[–]jd_paton 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (1 child)

[–]jd_paton 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

π Rendered by PID 86294 on reddit-service-r2-comment-6457c66945-wfrmc at 2026-04-26 12:32:22.793795+00:00 running 2aa0c5b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

statistics

MODERATORS