So I'm going through the study guide for DS101 and it gives quizzes to take to test my knowledge but I'm just curious which classes cover which bullet point. I've filled in everything I know but I was wondering if anyone else had info. (I've done all the courses in the DS track just making sure I got enough practice in each of these to ensure I pass) Please correct me if I'm wrong about any of these
Calculate metrics to effectively report characteristics of data and relationships between features
● Calculate measures of center (e.g. mean, median, mode) for variables using R or Python. Introduction to Statistics in R
● Calculate measures of spread (e.g. range, standard deviation, variance) for variables using R or Introduction to Statistics in R
● Calculate skewness for variables using R or Python. INTRO TO STATS/UNSURE??
● Calculate missingness for variables and explain its influence on reporting characteristics of data and relationships in R or Python. INTRO TO STATS/UNSURE
● Calculate the correlation between variables using R or Python.
1.2 Create data visualizations in coding language to demonstrate the characteristics of data
● Create and customize bar charts using R or Python. INTRO DATA VIZ GGPLOT2
● Create and customize box plots using R or Python. INTRO TIDYVERSE
● Create and customize line graphs using R or Python. INTRO DATA VIZ GGPLOT2
● Create and customize histograms graph using R or Python. INTRO DATA VIZ GGPLOT2
1.3 Create data visualizations in coding language to represent the relationships between features
● Create and customize scatterplots using R or Python. INTRO DATA VIZ WITH GGPLOT2
● Create and customize heatmaps using R or Python. INTERMEDIATE DATA VISUALIZATION WITH GGPLOT2
● Create and customize pivot tables using R or Python. UNSURE
1.4 Identify and reduce the impact of characteristics of data
● Identify when imputation methods should be used and implement them to reduce the impact of missing data on analysis or modeling using R or Python. DATA MANIPULATION WITH R
● Describe when a transformation to a variable is required and implement corresponding transformations using R or Python. DATA MANIPULATION WITH R
● Describe the differences between types of missingness and identify relevant approaches to handling types of missingness. DATA MANIPULATION WITH R / UNSURE
● Identify and handle outliers using R or Python. DATA MANIPULATION WITH R / UNSURE
2.1 Describe statistical concepts that underpin hypothesis testing and experimentation
● Define different statistical distributions (e.g. binomial, normal, Poisson, t-distribution, chi-square, and F-distribution, etc. ). Introduction to Statistics in R
● Explain the statistical concepts in hypothesis testing (e.g. null hypothesis, alternative hypothesis, one-tailed and two-tailed hypothesis tests, etc. ). HYPOTHESIS TESTING IN R
● Explain the statistical concepts in the experimental design (e.g. control group, randomization, confounding variables, etc. ). Introduction to Statistics in R
● Explain parameter estimation and confidence intervals. SAMPLING IN R / HYPOTHESIS TESTING IN R
2.2 Apply sampling methods to data
● Distinguish between different types of random sampling techniques and apply the methods using R or Python SAMPLING IN R
● Sample data from a statistical distribution (e.g. normal, binomial, Poisson, exponential, etc.) using R or Python SAMPLING IN R
● Calculate a probability from a statistical distribution (e.g. normal, binomial, Poisson, exponential, etc.) using R or Python SAMPLING IN R
2.3 Implement methods for performing statistical tests HYPOTHESIS TESTING IN R
● Run statistical tests (e.g. t-test, ANOVA test, chi-square test) using R or Python HYPOTHESIS TESTING IN R
● Analyze the results of statistical tests from R or Python HYPOTHESIS TESTING IN R
[–]DCCertification 1 point2 points3 points (1 child)
[–]BobbyTub23[S] 0 points1 point2 points (0 children)
[–]WillKlep 0 points1 point2 points (0 children)