MVEX01-16-18 Statistical Analysis of data

Datasets are the results of experiments or surveys measured or observed on certain individuals. They can be
  • Numbers: quantitative variable (e.g. salary)
  • Codes: qualitative variables (e.g. sex)
Data analysis consists in searching for existing relations between individuals, or observations in these datasets.

In other words, there is no statistical modelling: data analysis is an advanced tool of descriptive statistics. Three types of methods can be distinguished

  1. Classical descriptive statistics: study one or two observed variables
  2. Analysis of a scatter plot in higher dimension: principal component analysis, etc...
  3. Classification: group individuals into homogenous categories according to a certain criterion.
The goal of this project is to study one of these datasets with these three methods using the software R. Several datasets are available; students are also welcomed to choose their own dataset.
Obs! För GU-studenter räknas projektet som ett projekt i Matematisk Statistik (MSG900/MSG910).
Projektkod: MVEX01-16-17
Gruppstorlek: 3-6
Speciella förkunskapskrav: notions in descriptive statistics and in R
Handledare: Maud Thomas, 031-7728296,
Examinator: Maria Roginskaya
Institution: Matematiska vetenskaper

Publicerad: fr 30 okt 2015.