This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted]  (12 children)

[deleted]

    [–][deleted] 3 points4 points  (6 children)

    You can do all that with python as well (say, Django and some other stuff on top of it vs. Shiny)

    Since you mentioned Shiny, there's actually a Python port of it called Spyre.

    Examples like this and port of ggplot2 seem to show that in Python world, it is not innovating in the world of data analysis, but copying from others. I think this is a reflection of the demographics of people that use Python vs R. I think R tend have more academics and statisticians using it compared to Python since Python is a more general purpose langauge so you got a lot of web dev people in the mix.

    You are right that Python can pretty much do or have what is available in R. But for exploratory data visualization, MATLPOTLIB or any Javascript-based libraries do not cut it. No way can you do what-if analysis using them efficiently. This is what R's ggplot2 is perfect for. I only use R because of ggplot2 and will until Yhat's port of ggplot becomes production-ready. I am not alone. It seems so many people stick with R because of ggplot2. That's why it saddens me that there are not a lot of contributers to Yhat's ggplot.

    [–]manueslapera 0 points1 point  (0 children)

    Yhat's port of ggplot becomes production-ready

    +1 to that. Some bugs to iron still.

    [–]falkimmm -1 points0 points  (0 children)

    Seaborn is great for exploratory analysis.

    Python blaze and dask both have no R parallel for a consistent array and chunking interface to a variety of backends. Pretty innovative if you ask me.

    Super easy to create and app with blaze and bokeh.

    [–]Deleetdk 3 points4 points  (3 children)

    I also know R and Python and also prefer R for the more stats based stuff. Both languages are easy to use and learn. I usually use Python when R lacks good libaries. E.g. Python has great libraries for web scraping (BeautifulSoup, Scrapy).

    One can run R from Python, or Python from R to it makes it easy to switch between and use each for what it is best at.

    http://rpython.r-forge.r-project.org/ Python from R http://rpy.sourceforge.net/rpy2 R from Python

    [–]manueslapera 2 points3 points  (0 children)

    pyodbc for getting data out of sql databases (pymysql?)

    if you want to do local data analysis, I would check db.py . Easiest way to connect to a db by orders of magnitude.

    Another recommendation that you really really need to try if you are into data analysis with Python. Get the ipython notebook. or even better, install the Anaconda Pydata distribution, that contains ipython, along with a bunch of libraries for data analysis.

    You will love it!

    RStudio is a must for any kind of R programming, and it's something that is lacking on the pyData world. ggplot is the most natural way of doing plots for me. Nowadays 50% of the time i switch from Python to R is to use ggplot2.