Departments' graduate courses

Course start and periodicity may vary. Please see details for each course for up-to-date information. The courses are managed and administered by the respective departments. For more information about the courses, how to sign up, and other practical issues, please contact the examiner or course contact to be found in the course information. 

Exploratory Data Analysis

  • Course code: FACE001
  • Course higher education credits: 2.5
  • Graduate school: Civil and Environmental Engineering
  • Course start: 2021-04-22
  • Course end: 2021-05-26
  • Course is normally given: The course schedule has not been established yet.
  • Language: The course will be given in English
The course will review basic concepts in data science and introduce some advanced methods for exploring multivariate datasets. We will work with data organization and planning for multivariate data analyses. Participants will receive training in concepts and gain experience in applying some popular unsupervised data mining techniques for exploring datasets with the aid of online learning platforms. Additionally, students will work with their own multivariate datasets. Some questions to be answered include, “how closely are my samples related to each other, and are any of my measured variables correlated with others?” The focus will be on exploring continuous variables (e.g. density, age, length, weight) while using discrete or categorical variables (e.g. batch, model, gender) as visualization aids. The main methods to be practiced are Principal Components Analysis and Cluster Analysis, which will be placed in a general context of machine learning techniques. Students are encouraged to bring their own datasets and/or can use one of the datasets provided. The examples will mainly use Matlab and R, although other software (e.g. Python) will be accommodated when possible.

Topics to be covered

  • Data organisation and planning
  • Data types and distributions
  • Principal Component Analysis
  • Classification / Clustering
  • Support vector machine
  • Visualization

Obtaining course credit

To pass the course, you will need to complete homework assignments, give two in class presentations and hand in a final report.


This is an online course with in person discussions on zoom. The class typically meets on Thursdays
We will mainly use online resources, with details to be announced.
Additionally, this book is highly recommended:
Tufte, E.R. The visual display of quantitative information. ISBN 0961392142
Associate Professor Kathleen Murphy
More information
Enquiries: Kate Murphy,

Page manager Published: Wed 10 Feb 2021.