Automatically Acquiring a Semantic Network of Related Concepts

AUTOMATICALLY ACQUIRING A SEMANTIC NETWORK OF RELATED CONCEPTS

Date: 2011/11/14Source: Sean Szumlanski et. al (CIKM’10)Advisor: Jia-ling, KohSpeaker: Jiun Jia, Chiou

OUTLINE Introduction Relational strength Categorical relatedness Disambiguate nouns Evaluation Conclusion

INTRODUCTION Relationships between noun senses (concepts) in

the WordNet ontology constitute a rich taxonomy of semantic similarity.

To understand the role of semantic relatedness, for example, the following sentences:

(1) The astronomer photographed the star. (2) The paparazzi photographed the star.

INTRODUCTION

The semantic network relates not just words, but concepts.

This network could presumably be used as a kernel to infer quantitative relatedness scores, in the same way that WordNet has been used to derive semantic similarity scores between concepts.

INTRODUCTION Motivation: Automatically disambiguate nouns to their appropriate senses(i.e., concept).

Relatedness between nouns is discovered automatically from co-occurrence in Wikipedia texts. Goal: Construct a semantic network, nouns in Wikipedia are linked to their semantically related concept in the WordNet noun ontology. Automatically disambiguate nouns in Wikipedia to their corresponding noun senses in WordNet: sense similarity clustering high degrees of inter-relatedness

THE SEMANTIC NETWORK UNFOLDS IN THREE STAGES:

1. Measure the relational strength between nouns co-occurring in Wikipedia .

2. Use this quantitative measure to make categorical assertions about relatedness between nouns.

3. Disambiguate related nouns automatically, giving rise to a semantic network of related concepts.

TERMINOLOGY Target: Any noun for which we would like to extract relatedness data. Ex: park

Co-Target: Nouns co-occurring with a target. Ex: tree、 grass、 soil

FROM CO-OCCURRENCE TO RELATIONAL STRENGTH

Relational strength:

P(c) is the relative frequency of c’s occurrence in the corpus

P(c|t) is the probability of encountering c in a sentence containing t

DKL is Kullback-Leibler divergence:

If >1 positive correlation =1 independent

<1 negative correlation

--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

---------------------------c1:5c2:8c3:2 total nouns:100 c4:4c5:6

Corpus

P(c1)= = = 0.05

P(c2)= = = 0.08

P(c3) = = 0.02 P(c4)= = 0.05

P(c5)= = 0.06c1:5c2:8c3:2 c4:4c5:6

P(c|t)=

== =0.16

==0.04 = =0.12

DKL=(0.08log)+(0.16log)+(0.04log)+(0.12log)+(0.2log)

= 0.0163+0.0482+0.012+ 0.0456+0.1046=02267

Co-target of target in sentences

Srel(t,c1)= = = 0.072

Srel(t,c2)= = = 0.2126

Srel(t,c3)= = = 0.053

Srel(t,c4)= = = 0.2011

Srel(t,c5)= = = 0.4614

Target

C1 C2 C3 C4

0.072 0.2126 0.053 0.2011

其 Srel除上 Dkl的用意是為了做正規化

0.4614

We are primarily interested in using Srel(t, c) to measure the relatedness of t to c relative to all other co-targets of t, rather than measuring relational strength in a global fashion. DKL is constant, So can be discarded:

This is particularly useful in suppressing words like “article,” which tends to appear frequently with nouns that serve as titles of Wikipedia articles, despite the fact that those nouns are not generally semantically related to “article” at all.

FROM RELATIONAL STRENGTH TO CATEGORICAL RELATEDNESSTo find related nouns:Notion of mutual relatedness Defined: mx(t)[ The set of all nouns mutually related to t within x%]: if c is in the top x% of t’s most strongly related co- targets (sorted by Srel),and t is in the top x% of c’s most strongly related co-targets, we say that t and c are mutually related within x%.

FROM RELATIONAL STRENGTH TO CATEGORICAL RELATEDNESS

Process (find related nouns):

1) To find the nouns categorically related to a target, t, we let x = 20 and find the initial set, mx(t).

2) Then expand this set by incrementing x until 5 iterations pass without t being related to any additional co-targets.

THE METHOD EXHIBITS IMPORTANT PROPERTIES :

This gradation makes it impossible even for human judges to find a clear cutoff

Stringent requirement causes us to miss some related noun pairs.

Ex: “penguin” and “iceberg”

“penguin” and “ice”

“penguin to ice” “ice to penguin”

FROM NOUNS TO CONCEPTSDisambiguate the nouns(3 method):

1. Subsumption Method2. Gloss Method3. Selectional Preference Method selectional association A(t,c):

C is the set of concepts in WordNet denoted by the monosemous nouns that are related to t

Summary of Statistics for the Semantic Network of Related Nouns

Judges’ Evaluations of Accuracy on Relatedand Unrelated Noun Pairs

(4) Primary intended sense or one of its synonyms.

(3) Strongly related sense, but not the primary intended meaning. (2) Weakly related sense; could reasonably be included or excluded from relation to the target. (1) Unrelated sense.

Summary of Statistics for the SemanticNetwork of Related Concepts

The judges were asked to grade the relation of each sense to its monosemous target, using the following scale:

DISCUSSION

CONCLUSION There are several potential applications for this

resource, including semantic interpretation ,noun sense disambiguation in multimedia content delivery systems.

In future work, they expect to continue expanding and refining the semantic network.

the feasibility of applying their algorithm to these targets and using the existing semantic network to guide the process, which is more error prone with nouns that occur infrequently in the corpus and does not currently resolve ambiguity of polysemous-to-polysemous noun relations.

Thank you for your listening !

Automatically Acquiring a Semantic Network of Related Concepts

Documents

Transcript of Automatically Acquiring a Semantic Network of Related Concepts

ACQUIRING INTERCULTURAL COMMUNICATIVE COMPETENCE FROM ...darhiv.ffzg.unizg.hr/5997/1/Acquiring intercultural... · ACQUIRING INTERCULTURAL COMMUNICATIVE COMPETENCE FROM TEXTBOOKS

Automatically Learning Semantic Features for Defect Predictions446wang/paper/icse-16.pdf · cross-project defect prediction (CPDP) compared to tradi-tional features. Our semantic

DEEP LEARNING FOR SEMANTIC SEGMENTATION OF 3D POINT CLOUD.€¦ · DEEP LEARNING FOR SEMANTIC SEGMENTATION OF 3D POINT CLOUD. ... bel and cluster automatically a point cloud based

Automatically Constructing Semantic Web Services from Online Sources

Acquiring knowledge

Evaluation of Automatically Generated Semantic …moeller/publist-sts-pw...Institute for Software Systems Evaluation of Automatically Generated Semantic Descriptions of Multimedia

Acquiring talent

Acquiring Language

Partly based on slides by AnHai Doan - Semantic Scholar€¦ · iMap: Discovering Complex Semantic Matches between Database Schemas – Semi-automatically discovers 1:1 and complex

Weakly Supervised Learning of Semantic Parsers for Mapping …lsz/papers/az-tacl13.pdf · 2013-04-09 · supervised semantic parser from automatically in-duced labels. Our work differs

Fusing Automatically Extracted Annotations for the Semantic Web

IOS Press ActiveRaUL: Automatically Generated Web ... · Semantic Web 0 (2013) 1 1 IOS Press ActiveRaUL: Automatically Generated Web Interfaces for Creating RDF Data Anila Sahar Butta;b,

AIM@SHAPE: Advanced and Innovative Models And Tools for the development of Semantic-based systems for Handling, Acquiring, and Processing knowledge Embedded.

Anemone: a Visual Semantic Graphkth.diva-portal.org/smash/get/diva2:1322081/FULLTEXT02.pdf · the possibility to automatically populate a semantic graph from an ad hoc data set of

Table of Contents · Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations Patrick Pantel and Marco Pennacchiotti ...

A few contributions of the SIFR project · 2015. 12. 21. · SIFR axes of research (4/8): Semantic distance framework Automatically compute existing (Rada, Wu&Palmer, Resnik ) semantic

Semantic Interoperability: Automatically Resolving Vocabularies 4 th Semantic Interoperability Conference February 10, 2006 Chuck Mosher 8500 Leesburg.

HYBRID-BRIDGE: Efﬁciently Bridging the Semantic Gap in ...lin.3021/file/NDSS14a.pdf · automatically bridging the semantic gap in VMI by reusing the legacy binary code. Speciﬁcally,

Automatically Learning Semantic Features for Defect Predictionlintan/publications/deeplearn-icse16.pdf · Automatically Learning Semantic Features for Defect Prediction Song Wang,Taiyue

FUSING AUTOMATICALLY EXTRACTED ANNOTATIONS FOR THE SEMANTIC WEBpeople.kmi.open.ac.uk/andriy/nikolov-thesis-submitted.pdf · 2012-05-18 · FUSING AUTOMATICALLY EXTRACTED ANNOTATIONS