Foundations of data science
This course is listed in Aachen RWTHonline as Foundations of data science, in Bonn Basis as MA-INF 4228 Foundations of data science.
Contents
Data science aims at making sense of big data. To that end various tools have to be understood for helping in analyzing the arising structures.
Often data comes as a collection of vectors with a large number of components. To understand their common structure is the first main objective of understanding the data. The geometry and the linear algebra behind them become relevant and enlightening. Yet, the intuition from low-dimensional space turns out to be often misleading. We need to be aware of the particular properties of high-dimensional spaces when working with such data. Fruitful methods for the analysis include singular vector decomposition from linear algebra and supervised and unsupervised machine learning. If time permits we also consider random graphs which are the second most used model for real world phenomena.
Lecture
Time & Place
- Monday, 1215 c.t.-1400, BigBlueButton/moodle lecture room [b-it bitmax (0.109)].
- Thursday, 1200 c.t.-1400, BigBlueButton/moodle lecture room [b-it bitmax (0.109)].
- Tutorial: Monday, 1400 c.t.-1600, BigBlueButton/moodle tutorial room [b-it bitmax (0.109)].
First meeting: Monday, 20 April 2020, 1215-1600 with a break. This is a double session!
You must be enrolled in the moodle to enter the lecture room.
Exam
Pre-exam meeting: Thursday, 13 August 2020, 1000-1200, BigBlueButton/moodle lecture room.
Exam: Monday, 17 August 2020, 1000-1300, room Wolfgang-Paul-Hörsaal, Kreuzbergweg 28, 53115 Bonn.
Post-exam meeting: individual exam reviews.
Exam2: Thursday, 24 September 2020, 1300-1500, Meckenheimer Allee 176, Hörsäle 2 und 4.
Post-exam2: individual exam reviews,
Notes & Exercises
You will find notes and exercises at sciebo until March 2021.
Literature
- Avrim Blum, John Hopcroft, and Ravindran Kannan (2020). Foundations of Data Science. Cambridge University Press, ISBN 9781108485067, eISBN 9781108620321.
Drafts are on Hopcroft's page: PDF. - Olivier Bousquet, Stéphane Boucheron & Gábor Lugosi (2004). Introduction to Statistical Learning Theory. In Bousquet, v. Luxburg & G. Rätsch (editors), Advanced Lectures in Machine Learning, Springer, pp. 169--207, 2004. Webpage, PDF.
- Marc Peter Deisenroth, A Aldo Faisal, and Cheng Soon Ong (2019). Mathematics for Machine Learning. Webpage with PDF.
Allocation
4+2 SWS.
- Master in Media Informatics:
8 ECTS credits.
Students have to register this course with RWTHonline. - Master in Computer Science at University of Bonn: MA-INF 4228.
9 CP.
Students have to register for the exam to this course with POS/BASIS (see here).
The lecture's mailing list
Students are encouraged to ask and answer any questions related to the course on the mailing list:
20ss-fds-at-lists.bit.uni-bonn.de
You can subscribe to and unsubscribe from the mailing list using the information given on the list's Info page.