This seminar is about random objects in very high-dimensional Euclidean spaces, as they appear for instance in data science. One intriguing property of the associated theory is that our low-dimensional intuition often fails spectacularly, as the following examples show.
We will explore the mathematical background of these phenomena and review, in particular, inequalities concerning concentration of probability. Depending on number of participants and interest we might even learn about their consequences for random graphs and matrix completion.
Time & location: Wed 16:15 - 17:45, Room: A3/HS001 (Hörsaal), Exception: A6/SR 025/026 on June 15
Péter Koltai (peter.koltai@fu-berlin.de)
A rigorous course in probability theory, further undergraduate linear algebra and calculus.
[Ver] R. Vershynin: High-Dimensional Probability
https://www.math.uci.edu/~rvershyn/papers/HDP-book/HDP-book.pdf
[BHK] A. Blum, J. Hopcroft, and R. Kannan: Foundations of Data Science.
https://home.ttic.edu/~avrim/book.pdf
The seminar will consist of weekly student talks (~60 min) and following discussion. Each talk will be moderated by another student participant of the seminar. Every speaker should present to P. Koltai a detailed concept of their talk at least two weeks prior to the talk. Please make an appointment via peter.koltai@fu-berlin.de.
The final grade will be composed from the results of the own talk(s).
Topics will be assigned during the first class on Wednesday, Apr 20. These include (page numbers refer to the online version of the book [Ver]):
0. Appetizer: Probabilistic proof and approximate version of Caratheodory
1. Basics on random variables (pp. 6-12)
2. Hoeffding (pp. 13-19)
3. Chernoff + degrees of random graphs (pp. 19-23)
4. Sub-gaussian distributions: Definition and examples (pp. 24-29)
5. General Hoeffding’s and Khintchine’s inequalities, Centering, sub-exponential distributions (pp. 29-35)
6. Bernstein’s inequality & outlook (pp. 37-40)
7. Random vectors in high dimensions: norm & PCA (pp. 42-47)
8. TBA
8+. Johnson-Lindenstrauss lemma & others
For the schedule, see this link.
Course No | Course Type | Hours |
---|---|---|
19246111 | Seminar | 2 |
Time Span | 20.04.2022 - 20.07.2022 |
---|---|
Instructors |
Péter Koltai
|
0089c_MA120 | 2014, MSc Informatik (Mono), 120 LPs |
0280c_MA120 | 2018, MSc Mathematik (Mono), 120 LP |
0590b_MA120 | 2021, MSc Data Science, 120 LP |
Day | Time | Location | Details |
---|---|---|---|
Wednesday | 16-17:30 | A3/Hs 001 Hörsaal | 2022-04-20 - 2022-07-20 |