In this course, appart from simulating own data for some examples, we employ several datasets.

Countries dataset

Data on European countries, primarily from Eurostat. It can be downloaded in several formats

The data contains the following variables:

Jméno proměnné Popis Zdroj
country Country name
code Two digit coutry code
gdp GDP in the milions of EUR (2018) Eurostat
population Country population on the 1.1.2018 Eurostat
area Area of the country CIA factbook
eu_member Is the country a EU member? (2019) European Union
postsoviet Was the country a member of the Eastern block Wikipedia
life_exp Life expectancy at birth (2017) UN
uni_prc % of people with university education between 15-64 years (2018) Eurostat
poverty_risk % of people in risk of poverty (2017) Eurostat
material_dep % of people in material deprivation, 3 or less items (2017) Eurostat
hdi Human development index (2018) UN
foundation_date Date of founding Wikipedia
maj_belief Biggest religious group (2018) Pew Researcher Center

Czech 2017 parliamentary elections voter turnout

Data on the voter turnout in the Czech 2017 parliamentary elections from the 8th round of the European Social Survey.

Comma seperated values format is available here (right click and choose Save link as…).

We recommend to download the data and save them ina folder called data together with other materials for the course.

United Nations data from John Fox

This is United Nations country level data from 1998, originally used by John Fox in his book Applied Regression Analysis and Generalized Linear Models

The data can be downloaded in .txt format here (right click and choose Save link as…). The description of the data here.