30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the...

18
30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval Krzysztof Janowicz Institute for Geoinformatics; University of Münster
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    215
  • download

    2

Transcript of 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the...

Page 1: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

30.10.06 Krzysztof Janowicz

SIM-DL-Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval

Krzysztof JanowiczInstitute for Geoinformatics; University of Münster

Page 2: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 2

Outline

• Motivation: Yet Another Similarity Theory?

• Similarity & Subsumption based IR

• Matching Scenario

• SIM-DL Framework

• Human Subject Testing

• Results, Conclusions & Outlook

Page 3: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 3

Yet Another Similarity Theory?

Available ontologies (DL!)Available theories

Measured between (re)representation!

http://flickr.com/photos/genista/25390358/

Page 4: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 4

Similarity & Subsumption based Retrieval

Page 5: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 5

Similarity vs. Subsumption

• Subsumption-based Retrieval(+) Results fit user’s requirements (subconcepts!)(-) Too generic / too specific result set(-) Artificial search concept

• Similarity-based Retrieval(+) Search concept = searched concept(-) Results not necessarily fit user’s requirements

Page 6: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 6

Matching-Scenario

• Accommodation web portal

• External services (SOA)

• Use shared base vocabulary

• Local interface and terminology

• Hotel, Houseboat, Youth Hostel, Botel,….

Task: Integrate Amsterdam-Accommodation Service

Where to put botels?

Page 7: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 7

Houseboats, Hotel &Botel

Page 8: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 8

Some Impressions

Pictures received by email, taken from wikipedia and http://www.hotels.nl/amsterdam/botel/

Page 9: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 9

SIM-DL: Representation (ALCNR)

Page 10: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 10

SIM-DL: Framework

1. Specify search concept and context

2. Rephrase concepts to canonical NF

3. Generate alignment matrix

4. Apply sim-functions for selected combinations

5. Derive normalized overall similarity

Page 11: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 11

SIM-DL: Search Concept & Context

(-) Results not necessarily fit user’s requirements

Define Context

Clcs ≡ Housing

Cs ≡ Botel

Page 12: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 12

Rephrase Concepts to Canonical NF

• ALCNR Normal Form:

+ Rewriting rules (e.g. R() ≡ (≤ 0 R))

+ Minimal set of descriptions (concepts)

Canonical Normal Form

Page 13: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 13

Generate Alignment Matrix

• Cartesian Product Cs Ct

CsiCsj

Csk

Cti N … …

Ctj … … H

Ctk … CO …

Ctl … … …

HierarchiesNeighborhoodsCo-Occurrence

H > N > CO

Page 14: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 14

Apply Similarity-Functions (for selected combinations)

• Individual similarity functions for each DL language constructor:{union, intersection, role-intersection, existential

quantification, value restriction, cardinality}

• For Hierarchies, Neighborhoods, Co-Occurrence

edge_distancemax_distance

Page 15: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 15

Amalgamated & Normalized Overall Similarity

• Union-Constructor:Weighted sum of similarities on CNF union levelWeightings derived from A-Box, T-Box or A&T-Box

• Intersection-Constructor:Sum of similarities on CNF intersection levelNormalization to [0,1]

Page 16: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 16

Human Subject Testing: Roles & Fillers

Auto. weighted average (>)

Multiplicative approach (<)

User input

Disjoint from watercourse Meets river

Page 17: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

Krzysztof Janowicz SIM-DL (for ALCNR) 17

Results, Conclusion & Outlook

• SIM-DL combines subsumption and similarity

• Adapts results from psychology & computer science Cognitive Engineering ;-)

• Only basic model of Alignment and Context

• More Human Subject Tests needed

• More expressive DL

• Usability?

( )ALCRP DALCNRNear ?

Page 18: 30.10.06 Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.

30.10.06 Krzysztof Janowicz

Questions? Thanks for your attention!

Visit www.similarity-blog.de for related literature.

From: http://www.jobblog.ch/sommer-250