Part II - NATURAL LANGUAGE PROCESSING

DATE
TOPIC
MATERIALS
30/9
1/10
COMPUTATIONAL PHONETICS and
SPEECH PROCESSING:
  • Speech samples: properties and acoustic measures
  • Analysis in the frequency domain, Spectrograms
  • Applications in the acoustic phonetic field.
  • Speech recognition with Deep Neural Networks
  • Spoken Dialogue Systems (ChatBots)
- Slides (II.1), Link
- RJ - Chapter 2
- Slides (II.2), (II.3), (II.3b)
- Tutorial [Fabien, Doshi]
- [FT] - Chapter 8
- [SLP3] - Chapter 14
6/10
Tokenisation and Sentence splitting - Slides (II.4),
- [Schmid, 2008]
6/10
COMPUTATIONAL MORPHOLOGY:
  • Morphological operations
  • Static lexica, Two-level morphology using FSA
- Slides (II.5)
- Beesley & Karttunen [2000] Tutorial, Chap. 1
- [FT] - Section 6.2
7/10
8/10
COMPUTATIONAL SYNTAX:
  • Part-of-speech tagging and Lemmatisation.
  • Grammars for natural language:
    • Phrase structure grammars
    • Dependency Grammars
    • Treebanks
- [FT] - Sections 6.3, 6.4, 7.1
- [SLP3] - Chapters 17 and 18
- Slides (II.5) (II.6)
- Paper [Carlberger, Krann 1999]
- Paper [Tamburini 2016]
8/10
13/10
  • Formal and Natural languages:
    • The Chomsky hierarchy.
    • Natural language complexity.
  • Modern formalisms for parsing natural languages (LTAG).
  • Natural language Dependency Parsing.
- Slides (II.6)
- Paper [Branco, 2018]
- Slide [Horacek et al. 2011]
- SLP3 - Chapter 19 (and 17)
- Slide [Stymne, 2014]]
- Paper [Traxler, 2010]
- Paper [Vacareanu et al. 2020]
14/10
15/10
COMPUTATIONAL SEMANTICS:
  • Lexical semantic resources:
    WordNet and FrameNet.
  • Word Sense Disambiguation.
  • Distributional Semantics & Word-Space models.
  • Word, Sentence and Document embeddings.
  • Distributional approaches to sentence/text semantics.
- [FT] - Chapter 4, Sections 6.6, 6.7
- [SLP3] - Chapter 5
- Slides (II.7), (II.8) (II.9)
- [Miller et al. 93] (only the 1st and 2nd papers)
- [Miller Fellbaum 2007]
- FrameNet site
Paper [Lenci, 2008]
Papers [Mikolov et al. 2013; Le, Mikolov 2014]
Chap. 1, 2 and 4 from [Liu, 2020]

REFERENCES

[FT]
Tamburini, F. (2022). Neural Models for the Automatic Processing of Italian, Bologna: Pàtron.

[SLP3]
D. Jurafsky and J.H. Martin (in press). Speech and Language Processing, Prentice Hall. (3rd edition DRAFT)