Traversing Documents by Using Semantic Relationships

34
Traversing Documents by Using Semantic Relationships Presenter: Bilal Gonen

description

Traversing Documents by Using Semantic Relationships. Presenter: Bilal Gonen. Semantic Browser. Semantic browser is a tool that supports browsing and navigation of a document space by utilizing the semantic relationships. Physical Links vs. Virtual Links. affects. co_occurs_with. - PowerPoint PPT Presentation

Transcript of Traversing Documents by Using Semantic Relationships

Page 1: Traversing Documents by Using Semantic Relationships

Traversing Documents by Using Semantic Relationships

Presenter: Bilal Gonen

Page 2: Traversing Documents by Using Semantic Relationships

Semantic Browser

• Semantic browser is a tool that supports browsing and navigation of a document space by utilizing the semantic relationships.

Page 3: Traversing Documents by Using Semantic Relationships

Physical Links vs. Virtual Links

co_occurs_with

analyzesaffects

is_result_of

href

href

href

href

affects

affects

affects

co_occurs_withco_occurs_with

co_occurs_with

co_occurs_with

co_occurs_withis_result_of

is_result_of

is_result_of

is_result_of

Page 4: Traversing Documents by Using Semantic Relationships

A Real Example

How are these articles related?

How do we find other documents related with “melanoma”?

One common option is to use statistical techniques.

Page 5: Traversing Documents by Using Semantic Relationships

Recommendation Systems

Amazon, the best-known commercial recommender system, recommends books to customers based on the statistical similarity between customers' previous purchases.

The product: Digital Camera

Customers who bought this item also often bought

Digital Memory Card

Statistical proximity

Page 6: Traversing Documents by Using Semantic Relationships

A Real Example

Such a statistical technique may return these terms.

sun's harmful raysskinskin cancerlegankleskin pigmentmelaninaneuploidy

There are no named relationships.

Page 7: Traversing Documents by Using Semantic Relationships

Ontology

Pathologic Function

Body Substance

Substance

Diagnostic Procedure

Amino Acid Peptide or

ProteinDisease or Syndrome

Neoplastic Process

affects

Instance level

Schema level

Ureteral Calculi

Kidney Neoplasms

InstanceOf InstanceOf InstanceOf

analyzes

rdfs:subClassOfrdfs:subClassOf

rdfs:subClassOf

Ontology is at the heart of Semantic Web.

Page 8: Traversing Documents by Using Semantic Relationships

Relationships In Ontology

breast cancer bone cancer

non-melanoma melanoma

blood cancerskin cancer

cancers

aneuploidy euploidymonoploidy

chromosomal disorder

is_result_of

Page 9: Traversing Documents by Using Semantic Relationships

A Real Example

Our approach is to offer several relationships to the user.

aneuploidyallelic imbalancechromosome aberrations

This is what user is interested in.

affectsco_occurs_with

occurs_in

is_result_of

Return files which includes “aneuploidy”

Chromosomal AneuploidiesIdentification of AneuploidyDefinition of AneuploidyAneuploidy and Deletions

Name of files in which “aneuploidy” occurs.

Page 10: Traversing Documents by Using Semantic Relationships

JSP(Java Server Page)

Java Script

AJAXuser

Lucene Index for documentsPubMed

dataset

Ontology SemDis API

Lucene indexing is used to index the documents with the 21,945 MESH terms when they occur in the documents.

User Interface(HTML page)

The advantage of the AJAX technology is to send and receive only needed information between the client and server.

request

Built in LSDIS Lab. This API is used to process the triples in the ontology.

Contains 135 classes and 49 relationships in schema level. And 21,945 entity instances in the instance level

Contains 48,252 documents

Because we also used the synonyms of the 21,945 MESH terms, therefore we used ~104,000 terms to index the documents.

Page 11: Traversing Documents by Using Semantic Relationships

AJAX (Asynchronous JavaScript And XML)

Page 12: Traversing Documents by Using Semantic Relationships

AJAX (Asynchronous JavaScript And XML)

Only this part is loaded to the client side.

Page 13: Traversing Documents by Using Semantic Relationships

User Interface(HTML page)

The MESH term is sent to server as a request to get its types from the ontology by using the SemDis API.

Java Script

AJAX

JSP(Java Server Page)

user

Lucene Index for documentsPubMed

dataset

Ontology SemDis API

SemDis API gets the types of the instance term from the ontology.

request

response

keyword

related documents

List of the documents are returned from the Lucene index.

Page 14: Traversing Documents by Using Semantic Relationships
Page 15: Traversing Documents by Using Semantic Relationships
Page 16: Traversing Documents by Using Semantic Relationships
Page 17: Traversing Documents by Using Semantic Relationships
Page 18: Traversing Documents by Using Semantic Relationships
Page 19: Traversing Documents by Using Semantic Relationships
Page 20: Traversing Documents by Using Semantic Relationships
Page 21: Traversing Documents by Using Semantic Relationships
Page 22: Traversing Documents by Using Semantic Relationships
Page 23: Traversing Documents by Using Semantic Relationships
Page 24: Traversing Documents by Using Semantic Relationships
Page 25: Traversing Documents by Using Semantic Relationships
Page 26: Traversing Documents by Using Semantic Relationships
Page 27: Traversing Documents by Using Semantic Relationships
Page 28: Traversing Documents by Using Semantic Relationships
Page 29: Traversing Documents by Using Semantic Relationships
Page 30: Traversing Documents by Using Semantic Relationships
Page 31: Traversing Documents by Using Semantic Relationships
Page 32: Traversing Documents by Using Semantic Relationships
Page 33: Traversing Documents by Using Semantic Relationships
Page 34: Traversing Documents by Using Semantic Relationships

Questions, Comments

Thank you…

Email: [email protected]: www.bilalgonen.com