Chi-Square Test of Independence

k 02 augusztus 2016 by Ernő Gólya

A Chi-Square Test of Independence compares frequencies of one categorical variable for different values of a second categorical variable. The null hypothesis is that the relative proportions of one variable are independent of the second variable. The alternate hypothesis is that the relative proportions of one variable are associated with the second variable.

read more

Hypothesis Testing and ANOVA

h 01 augusztus 2016 by Ernő Gólya

Analysis of variance assesses whether the means of two or more groups are statistically different from each other. This analysis is appropriate whenever we want to compare the means (quantitative variables) of groups (categorical variables). The null hypothesis is that there is no difference in the mean of the quantitative variable across groups (categorical variable), while the alternative is that there is a difference.

read more

Data Management & Visualization: Creating Graphs for Our Data

cs 28 július 2016 by Ernő Gólya

After implementing useful data management decisions, it is time to create visual representations of our data that help us better display our findings by graphing the variables we study. According to this week's assignment I use visual tools to display the variables and the relationships between them.

read more

Data Management & Visualization: Making Data Management Decisions

k 26 július 2016 by Ernő Gólya

The following Python code is an example of how to make and implement data management decisions. In the previous assignment I already created categories for the relevant variables, as the dataset I study mostly have continuous data. I revise and alter some of the categories to be more similar to those used by World Bank and WHO.

read more

Data Management & Visualization: Frequency Distributions

cs 21 július 2016 by Ernő Gólya

This week I will call in a dataset and run frequency distributions for my chosen variables. I will also demonstrate how to create a subset of the data rows (i.e. observations). During the course I use Gapminder dataset, which contains quantitative variables.

read more