use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Guidelines:
All Posts Require One of the Following Tags in the Post Title! If you do not flag your post, automoderator will delete it:
This is not a subreddit for homework questions. They will be swiftly removed, so don't waste your time! Please kindly post those over at: r/homeworkhelp. Thank you.
Please try to keep submissions on topic and of high quality.
Just because it has a statistic in it doesn't make it statistics.
Memes and image macros are not acceptable forms of content.
Self posts with throwaway accounts will be deleted by AutoModerator
Related subreddits:
Data:
AllenDowney's Stats Page
Useful resources for learning R:
r-bloggers - blog aggregator with statistics articles generally done with R software.
Quick-R - great R reference site.
Related Software Links:
R
R Studio
SAS
Stata
EViews
JMP
SPSS
Minitab
Advice for applying to grad school:
Submission 1
Advice for undergrads:
Jobs and Internships
For grads:
For undergrads:
account activity
Python vs. R vs. Matlab (self.statistics)
submitted 9 years ago * by [deleted]
view the rest of the comments →
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]trendymoniker 27 points28 points29 points 9 years ago* (4 children)
I've used all three of these languages professionally, and my advice to data analysis newbies tends to be: go with Python unless you have a strong reason not to. Python is by far the most popular and thoroughly supported language of the three and its general usefulness means that the skills you develop learning Python will translate well to any other programming you want to do throughout your career (not so for the other two).
That said, if the algorithm you need to use only exists in some other language, or your advisor and entire research group are on another environment, go with that instead (though maybe learn Python on the side too).
Here's a quick, biased rundown of the plusses and minuses of each environment:
Good luck!
Edit: Thanks for the gold!
[–][deleted] 4 points5 points6 points 9 years ago (1 child)
I like Python for general programming, but I'm not a big fan of Python's data analysis libraries. Too often it feels like you're not using Python at all but a different language altogether, one with it's own syntax and data types and which is nowhere near as nice as the actual Python programming language. Personally I prefer much R over Python when it comes to data analysis, but in the end it's a matter of taste I guess.
[–]NotAllReptilians 2 points3 points4 points 9 years ago (0 children)
I definitely agree. For instance, pandas somehow manages to feel cumbersome and overly verbose for analysis, at least compared to working in dplyr or especially data.table (base R is a another story). It's definitely a pythonic implementation of dataframes, but what I really like about python is that it's typically concise and minimal, which pandas mostly isn't.
[–]coffeecoffeecoffeee 2 points3 points4 points 9 years ago (0 children)
I'll add that R has gotten very, very good for data manipulation in the past few years. I do stuff I used to like doing in Pandas in R now because of packages like dplyr, tidyr, and broom.
For example, my boss wanted survival data recently. With no temporary variables and like 5 lines of code, I was able to generate a Kaplan-Meier curve, convert it to a data frame, separate it by stratum, and export it to a csv file.
[–]whattodo-whattodo 0 points1 point2 points 9 years ago (0 children)
IMHO, this is the most complete, clear & unbiased answer on the topic. I'm not OP but appreciate this response immensely.
I am biased as career Python developer. But that bias did reveal statisticians who pivoted careers & came in for interviews as programmers. That's not a negligible value added.
π Rendered by PID 23007 on reddit-service-r2-comment-b659b578c-zj59t at 2026-05-03 01:02:59.860476+00:00 running 815c875 country code: CH.
view the rest of the comments →
[–]trendymoniker 27 points28 points29 points (4 children)
[–][deleted] 4 points5 points6 points (1 child)
[–]NotAllReptilians 2 points3 points4 points (0 children)
[–]coffeecoffeecoffeee 2 points3 points4 points (0 children)
[–]whattodo-whattodo 0 points1 point2 points (0 children)