Processing of large document collections Part 2 (Text categorization, term selection) Helena Ahonen-Myka Spring 2005.