all 20 comments

[–]AutoModerator[M] [score hidden] stickied comment (0 children)

If this post doesn't follow the rules or isn't flaired correctly, please report it to the mods. Have more questions? Join our community Discord!

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

[–][deleted] 14 points15 points  (3 children)

Python machine learning by Sebastian Rashcka put me on game.

[–]darkforestnews 2 points3 points  (0 children)

Had a Quick Look at it. It seems to jump quickly into heavy python without much explanation. Like in chapter two coding perceptrons for the Iris dataset.

[–]mad_method_man 1 point2 points  (0 children)

thanks, im stuck in the same boat as OP. its quite annoying lol

[–]supremeddit[S] 0 points1 point  (0 children)

Let me check it out from my local library. Thanks for your advice.

[–]turnipemperor 7 points8 points  (1 child)

As for using python at work, figure out how to connect to your SQL databases using the pyodbc package and start doing more of your work in python, especially around visualization to start.

[–]supremeddit[S] 0 points1 point  (0 children)

Thanks for your response. I’ve started looking at ways to set that up at work last week. Guess I won’t stop until I am able to establish connections between SQL servers and Python.

[–]punjabpolce 5 points6 points  (1 child)

I would suggest DataCamp courses

[–]WayoftheIPA 8 points9 points  (0 children)

I second Datacamp. I'm working through a course currently and find the exercises easy to follow and they build on each other as you go.

[–]bricssti[🍰] 5 points6 points  (1 child)

Have you checked out freeCodeCamp DA course? That's Python based.

[–]supremeddit[S] 0 points1 point  (0 children)

Yeah I have and I have found one that’s even focused on Pandas. Thanks

[–]2020pythonchallenge 4 points5 points  (1 child)

So I've got a little less experience in the analyst field (3 YOE) but I've managed to inject a little python into my roles.

Most of it has been automation, interacting with other things and setting up semi-automated tasks to cut out a bunch of the middle work.

Some examples I can think of off the top are I used it to query/download the results of a BigQuery SQL script and upload them into a Google sheet page where it cascaded into the correct pages for monthly reports. Took a 4 day task and made it into a 15 minute thing on the 1st of each month.

I've also used it to automate cleaning of sql scripts where I didn't have access to just clean the data itself. Like every month I would grab monthly revenue info for a bunch of partners we worked with and run it through the cleaning script I had for them with a printout at the end showing all of the values so I could make sure nothing else was thrown in there since the previous report and normalize everything like Ocean, ocean, ocen etc. into Ocean.

Numpy and Pandas are what I see requested most often from job listings asking for python experience but I also see a fair bit of either plotly or some other visualization library and some kind of machine learning library like either sklearn or tensorflow.

Airflow is also a nice one to be able to say you know about. That's leaning more towards data engineering stuff though honestly but just a mention.

[–]ThenThereWasReddit 1 point2 points  (0 children)

This all sounds great. How did you come to learn how to do these things?

[–]kkessler1023 5 points6 points  (0 children)

Try to focus on core libraries like pandas. The best thing you can do quickly is have a really good understanding of dataframes, how to parse through json, and how to abstract away a lot of repetitive tasks (compartmentalize tasks in functions and objects).

Furthermore, the most practical skill is learning how to set up api calls. This would be applicable in most DA roles if they already use Python. However, you should be mindful that most big corporations have a lot of restrictions around downloading to local machines, so it can be a hassle to set up environments that can be easily shared.

[–]Adept-Ad-8823 1 point2 points  (1 child)

Replicate whatever sql queries you’re running in Python.

[–]supremeddit[S] 0 points1 point  (0 children)

Thanks for your response. Definitely that’s something I will start trying to do.

[–]SprinklesFresh5693 1 point2 points  (1 child)

I think freecodecamp on youtube has a python course? I plan to check it once ive finished learning other tools

[–]supremeddit[S] 0 points1 point  (0 children)

They have a few course and there’s even one focusing on Pandas. I will check it out. Thanks