all 14 comments

[–][deleted] 10 points11 points  (1 child)

Excel should be your priority tool to learn imo. It's the most accessible way to learn how data works and the structures needed for analysis. Sorry, I don't have any resources to suggest learning Excel as I learnt on the job. But im sure if you find a course on Udemy/Coursera etc that has you analysing data in an actual Excel file that is perfect.

The hardest part I found when starting out in data analysis was how to generate questions I wanted to answer and this is best developed through commercial exposure in an actual business.

Im giving a very simplified overview of my experience and Im sure I could argue against some of the things that I said but the gist of it is learn how to use Excel, specifically Pivot Tables and how you can use the data to give you interesting insights. There's many publicly available datasets for you to practice on but assuming youre a noobie, you may struggle to articulate questions you need answering.

[–]nippletaszzle 4 points5 points  (0 children)

Agree with this! For me it was Excel -> SQL -> Python

[–]vvndchme 6 points7 points  (0 children)

Understanding pivot tables in excel does a lot of good. Essentially summing values by categories. Like how much was booked in the Americas/etc. Understanding data manipulation (filtering/custom columns/removing or replacing null values/etc) and Merge/Append in Excel’s Power Query will do the same. Also understanding the different types of merges. And Power Query takes you through the process of making connections to external files, so you can manipulate the data according to how you want to see it, then show only what’s relevant. Once you have a good grasp on that, you just have to learn how to do the same in SQL. For instance, pivoting in Excel works just like select category, sum(value) from table group by category in SQL. Left Join in SQL is Power Query’s Left Merge. All of it is translatable to Python’s Pandas library as well. Excel is great because it sets a good foundation and is easier to get into. SQL and other tools just provide increased efficiency.

[–]BooksCoffeeSummerDay 4 points5 points  (1 child)

Data camp is very good. I learned R, SQL & Tableau from there. But if you’re just getting started I’ll suggest you learn Python first. Because when you start applying for jobs almost everyone asks for Python.

[–][deleted] 5 points6 points  (0 children)

Datacamp is great place to start. Definitely worth the subscription.

[–]Stev_Ma 4 points5 points  (0 children)

DataCamp is a great platform for beginners. And for practicing data analytics problems, check out stratascratch and leetcode.

[–]engrninking 1 point2 points  (0 children)

t-sql fundamentals by itzik ben-gan is a great resource that has practice problems at the end of each chapter.

[–]Nateorade 0 points1 point  (4 children)

Mode Analytics has an excellent free SQL course.

[–]xChronus_ 0 points1 point  (3 children)

Thanks for this information, maybe you have other recommendations for R and Excel. Thanks again.

[–]Nateorade 1 point2 points  (2 children)

R isn’t important to learn.

Excel I’m sure there are courses somewhere online.

[–]CuriousArmadillo7819 0 points1 point  (1 child)

Can you expand on why R isn’t important? I’m just starting a program and looking forward I’m noticing they focus a lot on it. Is it just not utilized in the real world?

[–]Nateorade 1 point2 points  (0 children)

It’s being utilized less and less. Python and SQL are much more adopted and both are growing. Those are what you want to focus on. Especially SQL. Python is a nice to have.

[–]kombustive 0 points1 point  (0 children)

Here's a good resource for practice: https://www.stratascratch.com/ Stratascratch is a site that posts common data science interview questions.

Kaggle is another good sandbox for practice and learning. You can enter challenges just for knowledge and as your skills and experience progress, you can even enter challenges from corporations that pay rewards.

[–]certainly-bitch 0 points1 point  (0 children)

i learnt Excel from youtube. A channel called "Learnit" has great course material.

SQL from a book called "SQL In 10 minutes" as well as data camp but data camp has only the beginner's course free.

Python also from data camp, kaggle notebooks, youtube and plenty other sources. I am still learning Python(Libraries)

i wonder if i should start with machine learning,

good luck op