Foundations of data science
This course is listed
- in Aachen RWTHonline as Foundations of data science,
- in Bonn Basis as MA-INF 4228 Foundations of data science.
Direct SIGN UP to eCampus. (For visiting see below.) Such a sign up simply gives access to the course materials, it is no formal registration for the course or its exam, that has to be done with BASIS or RWTHonline, see below.
Contents
Data science aims at making sense of big data. To that end various tools have to be understood for helping in analyzing the arising structures.
Often data comes as a collection of vectors with a large number of components. To understand their common structure is the first main objective of understanding the data. The geometry and the linear algebra behind them become relevant and enlightening. Yet, the intuition from low-dimensional space turns out to be often misleading. We need to be aware of the particular properties of high-dimensional spaces when working with such data. Fruitful methods for the analysis include singular vector decomposition from linear algebra and supervised and unsupervised machine learning. If time permits we also consider random graphs which are the second most used model for real world phenomena.
Lecture
Time & Place
- Consultation time: Monday, 1200 c.t.-1400, online lecture room.
- Plenum: Wednesday, 1200 c.t.-1400, online lecture room.
- Tutorial 1: Monday, 1400 c.t.-1600, tutorial room (via eCampus).
- Tutorial 2: Monday, 1600 c.t.-1800, tutorial room (via eCampus).
First meeting: Monday, 4 April 2022, 1215-1600 with a break: online lecture room.
To ease your communication you can at any time appoint with each other in this free room.
Exam tools and materials
You must be part of the group '| 2022 SoSe | MA-INF 4228 - Foundations of Data Science - Technical test' within the eCampus course to get access. This will be the basis for the exam access list, so: If you are registered for the exam but you do not have access, contact me now!
Needed tools for the exam:
- Working camera and connection to BigBlueButton.
- Access to eCampus.
- Show (or print) a PDF.
- Digitize several pages into a PDF and upload PDF to eCampus. (Scanner or camera...)
- Paper, pens, ...
Allowed material:
- all teaching materials from the lecture and the tutorial,
- as well as all of your own materials and notes,
- a calculator, a python or sage session,
- cheat sheet (handwritten!, A4, double sided).
Exam
Technical setup test and Pre-exam meeting: Wednesday, 10 August 2022, 1000-1200, online lecture room.
Exam1: Friday, 12 August 2022, 1000-1300, online (eCampus, BBB).
Post-exam1: individual exam reviews.
Exam2: Friday, 30 September 2022, 1300-1600, online (eCampus, BBB).
Post-exam2: individual exam reviews.
Literature
- Avrim Blum, John Hopcroft, and Ravindran Kannan (2020). Foundations of Data Science. Cambridge University Press, ISBN 9781108485067, eISBN 9781108620321.
Drafts are on Hopcroft's page: PDF. - Olivier Bousquet, Stéphane Boucheron & Gábor Lugosi (2004). Introduction to Statistical Learning Theory. In Bousquet, v. Luxburg & G. Rätsch (editors), Advanced Lectures in Machine Learning, Springer, pp. 169--207, 2004. Webpage, PDF.
- Marc Peter Deisenroth, A Aldo Faisal, and Cheng Soon Ong (2019). Mathematics for Machine Learning. Webpage with PDF.
Allocation
4+2 SWS.
- Master in Media Informatics:
8 ECTS credits.
Students have to register this course with RWTHonline. - Master in Computer Science at University of Bonn: MA-INF 4228.
9 CP.
Students have to register for the exam to this course with POS/BASIS (see here).