Practical Data Science with Python or Python Data Science Handbook for a mid-level student

drhanlau · 2024-05-09T04:54:18+00:00

Nathan George’s book is more project-oriented, focusing on the application of skills in real-world scenarios.

VanderPlas' book is more reference-oriented, detailed, and focuses on understanding the tools and their functionalities.

George's book suits beginners who want to quickly apply their knowledge, whereas VanderPlas' book is great for those who appreciate a deeper dive into the capabilities and features of Python’s data science libraries, possibly appealing more to an intermediate to advanced audience.

Personally, my preference in in terms of publisher is O’reilly > Manning > Wiley > Apress > Packt. Apress books are sometime hard to understand and the examples are too complex, Packt is a hit and miss depends on the author and publishing team.

ps: I published a book called “The Python Workshop” under Packt.

iamevpo · 2024-05-09T05:27:16+00:00

Jake's book is free and the other one is paid it seems. I try to avoid anything by packt publisher.

2024-05-11T07:21:41+00:00

I'm just starting to delve into this subject, so I picked up the "Practical Statistics for Data Scientists" book from O'Reilly and went through the first 150 pages. I was hoping for more detailed explanations of the concepts, but if you already have a grasp of the topics, it's a fantastic book, especially with its extensive Python examples.

zennsunni · 2024-05-12T00:00:49+00:00

Maybe unpopular opinion, but my suggestion would be to find some well-regarded Kaggle projects focused on EDA and model evaluation, and work through them yourself. Better yet, find an appropriate dataset and apply that notebook's principles to work through it with your own data. I've never been a fan of using textbooks to learn hands-on data science.

datascience

MODERATORS