How to become a web developer that stands out from the crowd
Dacena
Transcript of Dacena
Dipartimento di Informatica Sistemistica e ComunicazioneUniversità degli studi di Milano - Bicocca
DACENA PROJECT
DATA CONTEXT EXTRACTION FOR NEWSARTICLES
In collaboration with Density DesignDesign Department - Politecnico di Milano
DaCENA is an application that combines the factual context in a newspaper article. The factual information is structured data which make up relevant facts about main entities in the article. These facts represent relationships between the well-defined semantics entities. The approach is based on a process characterized by three basic steps:
• Article scraping and semantic annotation of the document base• Extraction of RDF graph, which represents the set of all the semantic associations• Rating extracts paths (from the graph) and selection of the reduced graph, through the ranking of
the paths that allows identification of the most relevant ones
DaCENA is a project about Data Journalism and concerns issues raised by emerging technologies related to information quality, reliability and accuracy of the news in the web 2.0 and in the infobesity age. Many news are acquired through social networks; this implies speed, randomness and lack of attention in the acquisition of information. DaCENA provides addictional context information, relating to the subjects in the article that are not immediately explored. In this way, DaCENA allows the user to interpret in more detail the content of the news. These analysis increase the accuracy of the news, enrich the information, allow a more detailed judgment about it.
In this perspective DaCENA enables verification of partial and decontextualized information.