Special Topics in Computer Science The Art of Information Retrieval Chapter 2: Modeling Alexander Gelbukh .
Special Topics in Computer Science Advanced Topics in Information Retrieval Chapter 2: Modeling Alexander Gelbukh .
Vocabulary size and term distribution: tokenization, text normalization and stemming Lecture 2.