The following topics will be discussed during the course:
- Unix shell
- Python refresher
- Structured data analysis
- Research data management
- Data visualization
- Research code management
- Using Python Notebooks for communication
- Introduction to machine learning & AI
- High performance computing
- Ethics of AI
Course leader and examiner
Leon Boschman, course leader
Sverker Holmgren, examiner
Running schedule
The course consists of a weekly full-day workshop, involving lectures, group/individual tasks, and discussions, for a total of 10 days (1 day per week, for 10 weeks). Most of the tasks involve actual data, and you are encouraged to bring your own data to the course.
Language
English
Literature
This course builds on the material in the sources below. All the reference material is freely avialable online. If you have any trouble acquiring the material, please reach out to the course organisers. The books, sites, and articles with reference material are:
- The Linux Command Line, A complete Introduction by William Shotts (available at https://www.dbooks.org/the-linux-command-line-1593279523/)
- Python Data Science Handbook by Jake VanderPlas (available at https://jakevdp.github.io/PythonDataScienceHandbook/)
- https://github.com/c3se/gts-course
- An Overview of Artificial Intelligence Ethics, IEEE Transactions on Artificial Intelligence, vol. 4, no. 4, August 2023
- Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3:160018 doi: 10.1038/sdata.2016.18 (2016)
- Barker, M., Chue Hong, N.P., Katz, D.S. et al. Introducing the FAIR Principles for research software. Sci Data 9, 622 (2022). https://doi.org/10.1038/s41597-022-01710-x
Assessment
Full attendance at at least 8 sessions, and active participation in hands-on exercises and discussions during these sessions. The practice-oriented nature of this course requires you to be present on Campus, so please make sure that you are available for all the assigned dates. Grading U, G.