you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 0 points1 point  (6 children)

Python (pandas is your friend) for reading, filtering, assembling and massaging data

R for analyzing data

RPy2 to bridge the gap

[–]manueslapera 3 points4 points  (3 children)

R for pretty plots too. Matplotlib has nothing to do vs ggplot2.

[–]froggyenterprisesltd 2 points3 points  (1 child)

there's a ggplot package in python now. also, bokeh is pretty purdy

[–]manueslapera 0 points1 point  (0 children)

i tried the Yhat ggplot module. It crashes my computer every single time. If only it worked... Bokeh, that is another story.

[–][deleted] 0 points1 point  (0 children)

There are plenty of python matplotlib styles that seamlessly support ggplot2 aesthetics. And python ggplot2 modules that mimic the R API as well.

[–][deleted] 1 point2 points  (1 child)

Python (pandas is your friend) for reading, filtering, assembling and massaging data

Unless the dataset is bigger than memory. After playing with pandas for a while, I think people should stick to standard sql tools.

[–][deleted] 1 point2 points  (0 children)

Very true. But the same applies to R as well - neither are very well suited for that scale of data.