Processing of large document collections Part 8 (Information extraction) Helena Ahonen-Myka Spring 2005.