Classification and clustering methods development and implementation for unstructured documents collections