Dacena

4
Dipartimento di Informatica Sistemistica e Comunicazione Università degli studi di Milano - Bicocca DACENA PROJECT DATA CONTEXT EXTRACTION FOR NEWS ARTICLES In collaboration with Density Design Design Department - Politecnico di Milano

Transcript of Dacena

Dipartimento di Informatica Sistemistica e ComunicazioneUniversità degli studi di Milano - Bicocca

DACENA PROJECT

DATA CONTEXT EXTRACTION FOR NEWSARTICLES

In collaboration with Density DesignDesign Department - Politecnico di Milano

 

DaCENA is an application that combines the factual context in a newspaper article. The factual information is structured data which make up relevant facts about main entities in the article. These facts represent relationships between the well-defined semantics entities. The approach is based on a process characterized by three basic steps:

• Article scraping and semantic annotation of the document base• Extraction of RDF graph, which represents the set of all the semantic associations• Rating extracts paths (from the graph) and selection of the reduced graph, through the ranking of

the paths that allows identification of the most relevant ones

DaCENA is a project about Data Journalism and concerns issues raised by emerging technologies related to information quality, reliability and accuracy of the news in the web 2.0 and in the infobesity age. Many news are acquired through social networks; this implies speed, randomness and lack of attention in the acquisition of information. DaCENA provides addictional context information, relating to the subjects in the article that are not immediately explored. In this way, DaCENA allows the user to interpret in more detail the content of the news. These analysis increase the accuracy of the news, enrich the information, allow a more detailed judgment about it.

In this perspective DaCENA enables verification of partial and decontextualized information.

 

 

TWO types of smantic path

visualization/interaction

+ Ability to customize the interaction 2

1

GRAPHFigure 1

GLOBAL VISUALIZATION Figure

2