Computing text semantic relatedness using the contents and links of a hypertext encyclopedia

Computing text semantic relatedness using the contents and links of a

hypertext encyclopedia

Presenter : Bo-Sheng Wang 　Authors : Majid Yazdania,b,*, Andrei Popescu-Belisa

AI, 2013

Outlines

• Motivation• Objectives• Methodology• Empirical analyses• Experiments• Conclusions• Comments

Motivation

• Existing measures of semantic relatedness based on lexical overlap, though widely used, are of little help when text similarity is not based on identical words.

Objectives• Therefore, they will computing text semantic

relatedness based on concepts and their relations, which have linguistic as well as extra-linguistic dimensions, remains a challenge especially in the general domain and/or over noisy

Methodology-build concept network

• Concept– They removed all Wikipedia articles.• (Talk,File, Image, Template, Category, Portal, and List,)

– Disambiguation pages were removed.– They set a cut-off limit of 100 non-stop words.– They extracted the corresponding anchor text

and considered it as another possible secondary title for the linked article.

Methodology

Methodology-build concept network• Relatoins– They focus in the present study on the hyperlinks

and links computed from similarity of content, of category.

– we computed the lexical similarity between articles as the cosine similarity between the vectors derived from the articles’ texts, after stopword removal and stemming using Snowball.

Methodology

Methodology-VP

Methodology-VP to weighted sets of concepts and to texts

Methodology-Approximation

Methodology-Approximation• T–truncated

• ε-truncated

Methodology-Learning embedding

Empirical analyses• Convergence of the T-truncated

Empirical analyses

• Convergence of ε-truncated

Empirical analyses

Experiments

• Average training error

Experiments

• Average training error

Experiments

• Word Similarity

Experiments

• Word Similarity

Experiments

• Document similarity

Experiments

• Document clustering

Experiments

• Comparison of VP and cosine similarity

Experiments

• Text classification

Experiments

Conclusions

Comments

• Advantages

• Disadvantage

• Applications– Text categorization

Computing text semantic relatedness using the contents and links of a hypertext encyclopedia

Documents

Transcript of Computing text semantic relatedness using the contents and links of a hypertext encyclopedia

form-independent meaning representation for eventualitieshomepages.inf.ed.ac.uk/steedman/papers/temporality/21-RobertXTru… · represents relatedness in meaning (although relatedness

World Wide Web (WWW) Hypertext Transfer Protocol (HTTP) E ...dfitzpat/content/CA106/ComponentsOfInter… · Hypertext and HTML • Hypertext – method of presenting information.

Running head: Nature relatedness - curve.carleton.ca

Designing Smart Specialization Policy: relatedness ...

Hypertext system

Making relatedness a treatment goal

Modes of Relatedness in Psychotherapy

Roots and Event Structure I - Linguistic Society of America · 2019-09-24 · Subject-verb relatedness 2.3 2.2 Verb-object relatedness 2.9 3.0 Subject-object relatedness 2.7 2.7 McKoon

Computing Relationships and Relatedness Between ...

php (Hypertext Preprocessor)

A Hypertext-Based Annotation System For Electronic Scholarshipbasseq.com/thesis/finalbinder.pdf · HTML HyperText Markup Language HTTP Hypertext Transfer Protocol ... HTML, CSS, and

Institutional relatedness behind product …mikepeng/documents/Peng17_APJM_SunTan.pdfInstitutional relatedness behind product diversification and international diversification Sunny

Hypertext task

Hypertext Transport Protocolszhou/568/HTTP.pdf · 2010. 4. 2. · Hypertext Transport Protocol . HTTP • Hypertext Transport Protocol ... • URL – Uniform Resource Locator •

Relatedness part 2 - Familias

Hypertext System Hwssn

Indexes as Hypertext

Multinational enterprises, industrial relatedness and ...

Hypertext/ hypermedia

Hernia and Work Relatedness