Prof Dr Christoph Schommer, University Luxembourg, Dept of Computer Science, Campus Belval. Home: illas.uni.lu/members/christoph_schommer Email: christoph.schommer@{uni.lu, fu-berlin.de}
Whats it all about?
The processing and the understanding of natural language is one of the most important aspects of Artificial Intelligence in general. This is not only due to, e.g., the simulation of natural conversations among humans as a cognitive process, but also due to daily text-related applications that underpin the importance of AI as a supporting instrument ("AI is for humans"). Concrete examples are, e.g., chatbot technology, the generation of texts, the application of machine learning in text-intensive environments, and the retrieval of the right information by search engines. Just to note that the aspect of ambiguity is still an issue that even deepl.com and translate.google.com are unable to solve (October 2020).
This course will take place in Winter Term 2020/21 from 5 November 2020 until 25 February 2021 (15 weeks) on Thursdays, 10h15 - 11h45 (Lecture) and 12h15 - 13h45 (Tutorial) via Webex. In terms of content, this is not a lingustic course, but rather a course that includes aspects of language processing, machine learning and calculable methods. It should also be pointed out that the course is not intended to be a monologue of the professor, but that we will work together on the underlying topic.
Our weekly meeting link:
https://unilu.webex.com/unilu/j.php?MTID=mbf9408197d3ac6f555770c1db0411717
5 November 2020 - 25 February 2021
{ Meeting number: 163 940 6354
Password: yQ6tVkmN5M6 // Host key: 698099 }
The course is organized as Lecture and a Tutorial that is performed by the course participants ("Presentation groups" 1-11).
____________
IMPORTANT (4 November 2020)
I have organised a DOODLE-Poll, where you can select the day of presentation (since we are around 45 participants, 11 days are presented):
Please note that a) each participant can cross only one slot and b) the number of presenters on the specific day is limited to 2.
In the lecture part of the course, we discuss selected aspects of selected chapters of the book by David Jurafsky, James Martin: "Speech and Language Processing". Source: see https://web.stanford.edu/~jurafsky/slp3/
In the Tutorial, you work with the book "Natural Language Processing with Python" by Steven Bird, Ewan Klein, and Edward Loper. Source: https://www.nltk.org/book/
Additional references:
Contact
For questions or comments of any kind, do not hesitate to send me a letter: christoph.schommer@ { uni.lu, fu-berlin.de}
Please note that the course takes place only if the number of participants is >= 5.
Evaluation
The evaluation is as follows: 40% Final assignment + 60% Tutorial. Both Final Assignment and Tutorial must be followed, otherwise there will be no mark given. Please note that the final assignment takes place on 4 March 2021 from 10-12
Lecture
In the lecture, I will address selected topics of Natural Language Processing as given in the book by Jurafsky as well as below.
Tutorial
Depending on the # of participants, the number of participants per presentation group fluctuates from 1 to 4. The default value is 2. The documents are available via pdf under Resources. You may also find the documents as html-page by clicking on the corresponding link below:
Please note that there are no exercises but that the Tutorial is mainly a self-study.
What you will learn in the Tutorial part (see book by Bird et al.):
Please note:
Evaluation Scheme - Lecture + Tutorial
Lecture
We will have a final assignment at the end of the course. The final assignment takes up to 100 minutes work (net; respecting the time for receiving and sending, the whole time is 120 minutes), where you have to work on the question sheet that I will send to you by email. You are then invited to send me your answer sheet by email, too. The questions are less about facts but more about an understanding of the the overall subject.
Please note: we will have a test assignment under real conditions on the 18th of February 2021. The purpose of this assignment/exam is a) to learn how I ask and what answers I expect and b) to have a summary/conclusions of the course. We will then discuss the test assignment/exam a week later (25 February). Please note that your answers are not evaluated and that they do not count. It is a dress rehearsal.
Tutorial
Please note: the presenters must have sitched ON their video. The programming environment and/or slides have to be shared with all.
Preliminary Schedule
05 November
10-12 Course Overview; Introduction
12-14 Tutorial Overview
12 November
10-12 Regular expressions, Word repair
12-14 { Self-study }
19 November
10-12 Language Models: n-grams, Smoothing
12-14 { Self-study }
26 November
10-12 Vector Space Model; tf.idf; Word embeddings
12-14 { Self-study }
03 December
10-12 Part-of-Speech; HMM; Viterbi
12-14 Group 01: Stachnik, Vindimian
10 December
10-12 Naive Bayes; Precision, Recall, and F-measure
12-14 Group 02: Bich, Kaibel
17 December
10-12 Practical NLP aspects - my talk at the European Language Resource Consortium (ELRC)
12-14 Group 03: Akperov, Golghalyani
07 January
10-12 Connectionism for a Word-Sense Disambiguation
12-14 Group 04: Kothari, Schäfer
14 January
10-12 Context-free Grammars, CYK Chart Parser
12-14 Group 05: Bockhorn, Chisaru
21 January
10-12 Information Extraction, Associative Memories based on converstations.
12-14 Group 06: Mies, Pinto
28 January
10-12 Text classification
12-14 Group 07: Kirchner, Szwedowicz
04 February
10-12 Test assignment as a Summary of the course + Preparation
12-14 Group 08: Akkus, Baral
11 February
10-12 We discuss the test exam/assignment /Dress rehearsal
12-14 Group 09: Hristov, Rademann
18 February
10-12 Group 10: PS, Comak
12-14 Group 11: Cornelius, Lin
25 February
10-12 self-study + preparation for the final assignment (no course, no tutorial today)
12-14 self-study + preparation for the final assignment (no course, no tutorial today)
FINAL EXAM/ASSIGNMENT : 4 March 2021, 10-12.
Rules and conditions:
Course No | Course Type | Hours |
---|---|---|
19331501 | Vorlesung | 2 |
19331502 | Übung | 2 |
Time Span | 05.11.2020 - 04.03.2021 |
---|---|
Instructors |
Christoph Schommer
|
0086c_k150 | 2014, BSc Informatik (Mono), 150 LPs |
0086d_k135 | 2014, BSc Informatik (Mono), 135 LPs |
0087d_k90 | 2015, BSc Informatik (Kombi), 90 LPs |
0088d_m60 | 2015, MSc Informatik (Kombi), 60 LPs |
0089b_MA120 | 2008, MSc Informatik (Mono), 120 LPs |
0089c_MA120 | 2014, MSc Informatik (Mono), 120 LPs |
0207b_m37 | 2015, MSc Informatik (Lehramt), 37 LPs |
0208b_m42 | 2015, MSc Informatik (Lehramt), 42 LPs |
0458a_m37 | 2015, MSc Informatik (Lehramt), 37 LPs |
0471a_m42 | 2015, MSc Informatik (Lehramt), 42 LPs |
0556a_m37 | 2018, M-Ed Fach 1 Informatik (Lehramt an Integrierten Sekundarschulen und Gymnasien), 37 LPs |
0557a_m42 | 2018, M-Ed Fach 2 Informatik (Lehramt an Integrierten Sekundarschulen und Gymnasien), 42 LPs |
Day | Time | Location | Details |
---|---|---|---|
Thursday | 10-12 | Online | 2020-11-05 - 2021-02-25 |
Day | Time | Location | Details |
---|---|---|---|
Thursday | 12-14 | Online | Übung 01 |