Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for...

33
Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*, Diego Reforgiato, Alberto Salvati*, Eva Blomqvist, Enrico Daga*, Francesco Draicchio, Paolo Ciancarini°, Sergio Consoli, Silvio Peroni°, Daria Spampinato* CNR Semantic Technology Lab, ISTC-CNR, Rome/Catania [email protected] ; *[email protected] ; °[email protected] http://stlab.istc.cnr.it http://wit.istc.cnr.it/stlab-tools http://data.cnr.it

Transcript of Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for...

Page 1: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Semantic Technologies for Linked Open Data at the STLab

Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*, Diego Reforgiato, Alberto Salvati*,

Eva Blomqvist, Enrico Daga*, Francesco Draicchio, Paolo Ciancarini°, Sergio Consoli, Silvio Peroni°, Daria Spampinato*

CNR Semantic Technology Lab, ISTC-CNR, Rome/Catania [email protected] ; *[email protected] ; °[email protected]

http://stlab.istc.cnr.it http://wit.istc.cnr.it/stlab-tools

http://data.cnr.it

Page 2: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

People• STLab@ISTC

Aldo Gangemi Valentina Presutti Daria Spampinato Andrea Nuzzolese Diego Reforgiato Stefania Capotosti Sergio Consoli Alessio Iabichella

• STLab@SI Alberto Salvati

Gianluca Troiani

• STLab Associates

Paolo Ciancarini (UniBo)

Malvina Nissim

(UniBo)

Massi Ciaram

ita (Google)

Alfio Gliozzo (IBM)

Eva Blomqvist

(Un. Linköping)

Enrico Daga (Open University

)

Alessandro Adamou (Open Un.)

Francesco Draicchio (UniBo)

Francesco Antinucci (CNR)

• STLab (Semantic Technology Lab) è un laboratorio dell’ISTC (Istituto di Scienze e Tecnologie Cognitive) del CNR, con sedi a Roma e Catania, attivo anche a Bologna e Parigi

�2

Page 3: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Outline

•The Linked Open Data (LOD) of CNR !

•The Semantic Scout !

•Machine reading for the Semantic Web !

•Knowledge pattern discovery and usage

�3

Page 4: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

A practical experience: data.cnr.it and the Semantic Scout

Joint work by STLab and the Information Systems unit of CNR Thanks to

Alberto Salvati, Enrico Daga, Gianluca Troiani, Andrea Pompili, Angelo Olivieri

Past collaboration with Claudio Baldassarre (UN-FAO) and Alfio Gliozzo (now IBM-Watson)

Linked Open Data in Public Administrations

�4

Page 5: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Objective and results• Objectives

• Publishing CNR data as LOD

• Matching the research demand to the research supply in the largest research institution (CNR) in Italy

!

• Results • data.cnr.it

• The CNR ontology network and data available as LOD

• Semantic interoperability between heterogeneous data sources

• The Semantic Scout - http://bit.ly/semanticscout • Expert finding based on competence

• Monitoring funding and evolution of different research areas and units

• Browsing and reporting capabilities

�5

Page 6: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

data.cnr.it

�6

Page 7: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

data.cnr.it

�6

Page 8: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

data.cnr.it

�6

Page 9: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

�7

Methods for data conversion, extraction, inference, integration, linking, publishing, and searching

Page 10: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Semantic scoutSemantic search

�8 http://bit.ly/semanticscout

Page 11: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Semantic scoutBrowsing

�8 http://bit.ly/semanticscout

Page 12: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Semantic scoutRelation explorer

�8 http://bit.ly/semanticscout

Page 13: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Semantic scoutExporting exploration results

�8 http://bit.ly/semanticscout

Page 14: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Semantic scoutAutomated reporting

�8 http://bit.ly/semanticscout

Page 15: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Machine reading for the Semantic Web

Page 16: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Apache Stanbol• A set of reusable components for semantic content

management • To extend traditional content management systems with semantic services

accessible as HTTP REST services

• Stanbol is the main software result of the EU IP IKS !

• Our contribution: the Knowledge Representation and Reasoning layer of Stanbol • Services used to define and manipulate semantic data models in CMS, i.e.,

Ontology Network Manager component

• Services able to retrieve additional semantic information about content, i.e., Reaoners and Rules components

�10

Page 17: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Stanbol in a nutshell

�11

NER and linking to LOD datasets

Page 18: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

• The Black Hand might not have decided to barbarously assassinate Franz Ferdinand after he arrived in Sarajevo on June 28th, 1914

qualities

modality

negation

type induction WSD taxonomy induction

semantic roles

NER

events

dates

The <span xmlns:dbo="http://dbpedia.org/ontology/" xmlns:dbr="http://dbpedia.org/resource/" about="dbr:Black_Hand_(Serbia)" typeof=”dbo:Agent">Black Hand</span> might not have decided to barbarously assassinate <span xmlns:schemaorg="http://schema.org/" xmlns:dbr="http://dbpedia.org/resource/" about="dbr:Archduke_Franz_Ferdinand_of_Austria" typeof=”schemaorg:Person”>Franz Ferdinand</span> after he arrived in <span xmlns:schemaorg="http://schema.org/" xmlns:dbr="http://dbpedia.org/resource/" about="dbr:Sarajevo” typeof=”schemaorg:City”>Sarajevo</span> on June 28th, 1914

sample RDFa annotation

FRED http://wit.istc.cnr.it/stlab-tools/fred/

co-reference tense

�12�12

RESTful

Page 19: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Tìpalo

• Motivation

• It is difficult to automatically generate enterprise taxonomies from data available as plain documents

!

• Objective

• To enable automatic generate taxonomies by exploiting the richness of natural language text

�13

Page 20: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Typing DBpedia entities with Tìpalo

Typing

NER

Taxonomy induction

WSD Alignment to WordNet supersenses

Alignment to Dolce

http://wit.istc.cnr.it/stlab-tools/tipalo/ �14

“Pakito is the alias of french electronic dance music artist Julien Ranouil” (cf. wikipedia.org)

RESTful

Page 21: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Sentilo

• Sentilo is a new method of Sentic Computing

• i.e., Semantic Sentiment Analysis, which is a new research area

!

• Motivations

• Sentiment Analysis does not take into account semantic features when computing opinion scores

• Semantics can give a lot of information for Sentiment Analysis methods

!

• Objectives

• To provide Sentiment Analysis methods with Semantic information

• To identify more easily and also using semantic information the opinion

�15

Page 22: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Sentilo

�16

Topic

Sub topics

Opinion holder

Opinions

Sentiment scores

“Robert is happy because Silvio Berlusconi finally was

condemned by judges”

http://wit.istc.cnr.it/stlab-tools/sentilo /

RESTful

Page 23: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Knowledge pattern discovery

Page 24: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Bottom-up: schema extraction

�18

Page 25: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Encyclopedic Knowledge Patterns

• 184 Encyclopaedic Knowledge Patterns (EKPs) were discovered by identifying invariances in the structure of Wikipedia page links !

• EKPs are represented as OWL2 ontologies !

• They capture concepts that are typically used by Wikipedia users for describing things of a certain type

�19

Page 26: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

An EKP for OfficeHolder

http://ontologydesignpatterns.org/ekp/�20

Page 27: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

An EKP for OfficeHolder

Formal represenation

http://ontologydesignpatterns.org/ekp/�20

Page 28: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

An EKP for OfficeHolder

Access to data

http://ontologydesignpatterns.org/ekp/�20

Page 29: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

An EKP for OfficeHolder

Textual grounding

From wikipedia.org

http://ontologydesignpatterns.org/ekp/�20

Page 30: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Aemoo

• Aemoo exploits EKPs for • Entity summarisation and Exploratory search

• Distinguishing between core and peculiar knowledge

!

• The data sources are Wikipedia, DBpedia, Twitter, and GoogleNews !

• Aemoo is a KP-aware application • Benefits from KPs for addressing knowledge interaction tasks

• Uses KPs as the basic unit of mean for representing, exchanging, as well as reasoning with knolwedge

�21

Page 31: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Aemoo UI

http://aemoo.org

�22

Page 32: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

Conclusions

�23

• We have provided a practical overview about how to build Linked Open Data !

• We have provided case studies and scenarios for exploiting Linked Data !

• We have shown Linked Data-compliant algorithms and tools

Page 33: Semantic Technologies for Linked Open Data at the STLab · 2019-06-14 · Semantic Technologies for Linked Open Data at the STLab Aldo Gangemi*, Andrea Nuzzolese, Valentina Presutti*,

�24

Thank you!