This Spring of 2023 I will be teaching a class at the Department of Linguistics in Helsinki under the fairly general and demanding header “Approaches to Natural Language Understanding” with Timothee Mickus. We will mostly talk about approaches to knowledge representation of language and have several labs for students to experiment with.
L0: https://docs.google.com/presentation/d/1EOPs-q1JvXHo_jbOPVmgWTN_yl_-yCpYOvECsBvrYBw/edit?usp=sharing
L1: https://docs.google.com/presentation/d/1YO374ZDfBt6GCd4WaDo6DjaWcpRwxdgIk9S-VmaIN1o/edit?usp=sharing
Reading material:
- Magnus Sahlgren: The Distributional Hypothesis
- Timothee Mickus: Chapter 5: Limits of the distributional hypothesis
- Gemma Boleda: Distributional Semantics and Linguistic Theory
- Alessandro Lenci: Distributional Models of Word Meaning
- Marianna Apidianaki: From word types to tokens and back
- Hinrich Schütze: Word Space
L2: Evaluation and Benchmarks and Shared Tasks
L9: Podcast dataset
- Clifton et al: 100,000 Podcasts: A Spoken English Document Corpus
- Jones et al: Current Challenges and Future Directions in Podcast Information Access
- Jones et al: Podcast Track 2020
- Karlgren et al: Podcast Track 2021
L10: Audio features
- Alexander et al: Audio Features for Podcast Retrieval
L13: Research directions
Jussi Karlgren and Pentti Kanerva: Semantics in High-dimensional Space