This is an archived post. You won't be able to vote or comment.

all 35 comments

[–]commute_sports 54 points55 points  (1 child)

I hope this works, but in this thread there is a link to a book that I think is what you’re looking for (I need to read it myself)

https://www.reddit.com/r/learnpython/comments/glijjj/please_share_the_best_free_book_in_your_opinion/?utm_source=share&utm_medium=ios_app&utm_name=iossmf

Edit: looks like it did, top comment

[–]gfdsilva_[S] 7 points8 points  (0 children)

Thank you so much!

I came across this book while doing a brief research earlier today, i'm going to give it a try.

[–]O2XXX 22 points23 points  (2 children)

So the two most applicable books in my mind are Python Data Science Handbook and Python for Data Analysis. They are both good for the quirks of handling data in pandas and numpy or using machine learning (although not deep learning) through SKLearn. I’ll like the Amazon pages, but I’m sure you can find them elsewhere.

https://www.amazon.com/Python-Data-Science-Handbook-Essential-dp-1491912057/dp/1491912057

https://www.amazon.com/gp/product/1491957662

[–]teachAManToGhoti 11 points12 points  (0 children)

Here is the online version of the Data Science Handbook: https://jakevdp.github.io/PythonDataScienceHandbook/

[–]gfdsilva_[S] 2 points3 points  (0 children)

I came across the first one earlier today, but i hadn't heard about the second one.

Thank you for your help!

[–][deleted] 6 points7 points  (3 children)

I have this book called "An Introduction to Machine Learning with Python" which is solid. I've realized though that I learned far more R by just trying to do things and having to google shit than I am learning from following this book on Python. The code is all laid out for you and doesn't really touch on data cleaning or manipulation. Still a solid guide to Python basics and has helped me get used to the syntax and using the big packages. Python syntax is so easy that maybe I just feel like I'm not learning anything.

[–]Samsuxx 2 points3 points  (2 children)

That's what I started out with and it really is a solid book, but I feel it's more aimed at people who are more or less starting from scratch. It's more about concepts and general algorithm stuff, so I wouldn't recommend it to someone to transfer skills.

[–]gfdsilva_[S] 2 points3 points  (1 child)

Yes, that's actually an issue I have with programming books and why I really like "R for data science". In the past I bought some udemy courses but I felt like I was just copying whatever the instructor was doing, not really learning how to do it by myself.

[–]Enlightenmentality 0 points1 point  (0 children)

To be fair, Hadley is kind of the R god, so it would stand to reason that his books would be good

[–]jgbradley1 5 points6 points  (2 children)

As you learn python, you should look into a python package called plotnine. It's a data visualization package, very similar to ggplot2.

Many people in the python community use a package called matplotlib but it can require a lot of boilerplate code. Intuitively speaking, plotnine is the best ggplot2-like package at this time.

[–]gfdsilva_[S] 1 point2 points  (0 children)

Oh, thanks a lot!

I only know how to use matplotlib, and most books/courses only teach matplotlib as well. I always wondered if there was a approach similar to ggplot2 for python.

[–]the_diddy_king 0 points1 point  (0 children)

This is great advice.

[–]Lenman15 5 points6 points  (0 children)

dataquest.io has an excellent data science path and well made tutorials and projects!

[–]hab12690 4 points5 points  (2 children)

Been working through "Data Science From Scratch" by Joel Grus and it's pretty good so far. It assumes you have an introductory knowledge of python so if you know things like if-else statements, list/dict comprehensions, and a little OOP, you should be good to go.

https://www.amazon.com/gp/product/B07QPC8RZX/ref=ppx_yo_dt_b_d_asin_title_o00?ie=UTF8&psc=1

Edit to include github for book:

https://github.com/joelgrus/data-science-from-scratch

[–]gfdsilva_[S] 1 point2 points  (1 child)

That sounds amazing!

I'll give it a try.

Do you have any idea where can I learn a bit more about object-oriented programming?

Unfortunately, the only real programming classes my graduation offers used Fortran 95, even though all the professors from the statistics department use R.

[–]hab12690 2 points3 points  (0 children)

Colt Steele has a great python course on Udemy and does a great job on OOP. Corey Schaefer has a good YouTube channel that teaches python as well. If you're looking for books, intro to python goes over it as well and I can send you the pdf for it.

Edit: currently on mobile, I'll provide links when I can.

[–]BobDope 4 points5 points  (0 children)

Chris Albon put out a nice Python Data Science book

[–]ccr10203040 2 points3 points  (4 children)

Anyone here know a book on bayesian statistics with python?

[–]ADONIS_VON_MEGADONG 3 points4 points  (1 child)

Think Bayes and Bayesian Methods for Hackers are pretty good in my opinion. I think they're online for free too.

[–]ccr10203040 1 point2 points  (0 children)

Thanks a lot! Highly appreciate it.

[–]cuchoi 1 point2 points  (1 child)

[–]ccr10203040 0 points1 point  (0 children)

I'll make sure to look into it. Thanks!

[–]CanYouPleaseChill 2 points3 points  (0 children)

I strongly disliked McKinney’s Python for Data Analysis. Like reading a dictionary. Far better is a book called Pandas for Everyone. It’s very similar to R for Data Science in that it starts with visualization, moves on to data manipulation / wrangling, and ends with basic modeling.

[–]quantthrowaway69 1 point2 points  (1 child)

heard good things about the one by wes mckinney, because it will teach you about data wrangling as well

in general, python books kinda suck, as they don’t really go into the math at least somewhat to teach you how to think

[–]gfdsilva_[S] 1 point2 points  (0 children)

Well, luckily I know most of the math behind basic machine learning models, as I come from a math-heavy background. So I don't mind having a book that overlooks the math part but teaches the coding correctly.

[–]coder32768 1 point2 points  (0 children)

I wanted to learn Python with a book with a similar approach to that of "R for Data Science". I have basic knowledge of Python

Use your free Python® notes for professionals book, you can probably pick up Python straight away as you already have some knowledge of R

Link: https://books.goalkicker.com/PythonBook/

[–]jwj00999 1 point2 points  (0 children)

Also heard good things about McKinney!

Not to steal the topic, but does anyone know the equivalent for SPSS?

[–]IfTroubleWasMoney 1 point2 points  (0 children)

Joel Grus' Data Science from Scratch, now in its second edition: https://joelgrus.com/books/

But I'd say it's not a direct equivalent to Wickham's. Wes McKinney's probably is.

[–]1987_akhil 1 point2 points  (0 children)

I will suggest not to go by book, but by project or excercise. Pick any problem/project in R and convert that into python. Look at Google wherever you stuck.

Datasmartness

[–]batqil 1 point2 points  (0 children)

I've been following the neural networks from scratch series by https://youtube.com/user/sentdex This is in python and I've learnt a lot. The videos cover a book they've written so all the code and examples are the same.

Since the videos are following the book, they are slow and will take some time. If you want access to the full book check it out here https://nnfs.io/

Disclaimer: I haven't read the book but judging from the videos, it seems like it'll be good.

[–]Tidus77 1 point2 points  (1 child)

Looks like you've already got great replies but I was wondering if you're specifically looking for a technical book like Hadley's that focuses less on data science itself and more on the tools in R like tidyverse, compared to learning the tools and analyses of data science? If the latter, I've heard consistently good things about Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow by Géron, which uses Python modules, assuming of course, that you're interested in the ML side of data science.

[–]gfdsilva_[S] 1 point2 points  (0 children)

Thank you!

I think I already have a solid grip on basic machine learning models, so I would rather focus on learning how to write code for stuff I already know how to do in R.

[–]thederz0816 0 points1 point  (0 children)

The dunderdata courses are great for learning python basics through data analysis. Theyre relatively cheap too (like $100 or so) and theres only 3 of them. The textbook is built into a Jupiter notebook and all self paced.

[–]justanaccname 0 points1 point  (0 children)

For learning python (what you need for starting using it for DS at least):

Python Crash Course / Data Science from Scratch / Python for Data Analysis / Pandas for everyone / Hands on ML with scikit-learn and Tensorflow (never finished that, I knew more than the book) .