A Survey on Unsupervised Graph-based Word Sense Disambiguation

16

Click here to load reader

description

Presents comparative evaluations of graph based word sense disambiguation techniques using several measures of word semantic similarity and several ranking algorithms. Unsupervised word sense disambiguation has received a lot of attention lately because of it's fast execution time and it's ability to make the most of a small input corpus. Recent state of the art graph based systems have tried to close the gap between the supervised and the unsupervised approaches.

Transcript of A Survey on Unsupervised Graph-based Word Sense Disambiguation

Page 1: A Survey on Unsupervised Graph-based Word Sense Disambiguation

A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana [email protected]

UAIC, Iasi

Page 2: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 2

Plan

1.Introduction2.State of the Art3.Experiments and Results4.Conclusions5.References

Page 3: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 3

Introduction● WSD = assign automatically the most

appropriate meaning to a polysemous word within a given context (Sinha et al, 2007)

● Use Cases:● Machine translation● Speech processing● Boosting the performance of tasks like text retrieval, document

classification and document clustering

Page 4: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 4

State of the Art● Supervised WSD vs Unsupervised WSD● GWSD and Semantic Graph Construction● SAN Method● Page-Rank Method● HITS Method● P-Rank Method

Page 5: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 5

Supervised WSD vs Unsupervised WSD ● Most approaches transform

the sense of the word into a feature vector

● Low execution time ● Accuracy of 60%-70%● Major disadvantage:

knowledge aquisition bottleneck (accuracy connected to the amount of manually anotated data)

● Identify the best sense candidate for a model of the word sense dependency in text

● Ranking algorithm to choose their most likely combination

● Window, graph based representation of the model

● Fast execution time● Accuracy of 40%-60%

Page 6: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 6

Graph-based WSD● GWSD = graph representation used to model

word sense dependencies in text (WSD with graphs, not just word window)

● Goal: identify the most probable sense (label) for each word

● Advantage: takes into account information drawn from the entire graph

Page 7: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 7

Semantic Graph Construction (I)● Example (Sinha et al, 2007)

Page 8: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 8

Semantic Graph Construction (II)● Example (Tsatsaronis et al, 2010)

Page 9: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 9

The Page-Rank Method (Brin and Page, 1998)

● Ranking algorithm based on the idea of voting: when one node links to another it offers a vote to that other node

● The higher the number of votes for a note, the higher the importance of the node

● Recursively score the candidate nodes for a weighted undirected graph

Page 10: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 10

The P-Rank Method (Zao et al, 2009)

● Check the structural similarity of nodes in an information network

● Based on the idea that two nodes are similar if they reference and also reference similar nodes

● Represents a generalization of other state of the art measures like CoCitation, Coupling, Amsler, SimLink

Page 11: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 11

The HITS Method (Kleinberg,1999)● Identify authorities = the most important nodes

in the graph● Identify hubs = the nodes which point to

authorities ● The sense with the highest authority is chosen

as the most likely one for each word● Major disadvantage: densely connected nodes

can attract the highest score (clique attack)

Page 12: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 12

Experiments and Results (I)● Senseval 2 and 3 data sets often used for testing● Occurencies for Senseval 2 using WordNet 2

● Occurencies for Senseval 3 using WordNet 2

Page 13: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 13

Experiments and Results (II)● Accuracies on the Senseval 2 and 3 English All

Words Task data sets (Tsatsaronis et al)

Page 14: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 14

Conclusions● Recent systems minimise the gap between supervised

and unsupervised approaches.● The graph-based methods make the most of the rich

semantic model they employ.● Unsupervised approaches seek the optimal value for

the parameters using as little training data as possible and testing on as large a dataset as possible.

● Future work: implement P-Rank using a different representation, for example Sinha et al.

Page 15: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 15

References1. Tsatsaronis, G., Varlamis, I., Norvag, K. : An Experimental

Study on Unsupervised Graph-based Word Sense Disambiguation. In Proc. of CICLing (2010).

2. Sinha, R., Mihalcea, R. :Unsupervised graph-based word sense disambiguation using measures of semantic similarity. In Proc. of ICSC (2007).

3. Mihalcea, R., Csomai, A. : Senselearner: Word sense disambiguation for all words in unrestricted text. In Proc. of ACL, pages 53-56 (2005).

4. Tsatsaronis, G., Vazirgiannis, M., Androutsopoulos, I. :Word Sense Disambiguation with Spreading Activation Networks Generated from Thesauri. In Proc. of IJCAI (2007).

Page 16: A Survey on Unsupervised Graph-based Word Sense Disambiguation

Elena-Oana Tabaranu 16

Questions?