Chi-Square Test of Independence
A Chi-Square Test of Independence compares frequencies of one categorical variable for different values of a second categorical variable. The null hypothesis is that the relative proportions of one variable are independent of the second variable. The alternate hypothesis is that the relative proportions of one variable are associated with the second variable.
read moreHypothesis Testing and ANOVA
Analysis of variance assesses whether the means of two or more groups are statistically different from each other. This analysis is appropriate whenever we want to compare the means (quantitative variables) of groups (categorical variables). The null hypothesis is that there is no difference in the mean of the quantitative variable across groups (categorical variable), while the alternative is that there is a difference.
read moreData Management & Visualization: Creating Graphs for Our Data
After implementing useful data management decisions, it is time to create visual representations of our data that help us better display our findings by graphing the variables we study. According to this week's assignment I use visual tools to display the variables and the relationships between them.
read moreData Management & Visualization: Making Data Management Decisions
The following Python code is an example of how to make and implement data management decisions. In the previous assignment I already created categories for the relevant variables, as the dataset I study mostly have continuous data. I revise and alter some of the categories to be more similar to those used by World Bank and WHO.
read moreData Management & Visualization: Frequency Distributions
This week I will call in a dataset and run frequency distributions for my chosen variables. I will also demonstrate how to create a subset of the data rows (i.e. observations). During the course I use Gapminder dataset, which contains quantitative variables.
read more