Post on 14-Apr-2018
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
1/27
Rutherford Appleton Laboratory
SKOSEcoterm 2006
Alistair MilesCCLRC Rutherford Appleton Laboratory
Semantic Web Best Practices and Deployment
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
2/27
http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 2
Reminder: what is it?
Simple Knowledge Organisation System Formal language for representing
controlled structured vocabularies(thesauri, classification schemes, ?)
Subject metadata &informationretrieval this document is about romantic love.
this document is about the cure of tuberculosis by x-
ray in India in the 1950s.
Application ofRDF
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
3/27
http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 3
Since Ecoterm 2005
SKOS Core Guide &SKOS CoreVocabulary Specification First Working Draft May 2005
Second Working Draft October 2005
Minor changes
Quick Guide to Publishing aThesaurus on the Semantic Web First Working Draft May 2005
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
4/27
http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 4
What comes next ?
Life after SWBPD-WG ? Plans for next phase of W3C
Semantic Web Activity
New WG? SKOS W3C Recommendation by end
2007?
N.B. Not yet approved!
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
5/27
http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 5
If Rec then
What is the scope? What is thefundamental design goal?
First part of SKOS Rec would be
requirements specification. Between now and Sept/Oct 2006
define scope and requirements.
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
6/27
http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 6
What Id like to do here
Talk about some of the assumptionsbehind SKOS.
Sketch some ideas on how to define
scope and requirements for SKOS. Get your feedback.
public-esw-thes@w3.org
SKOS: Requirements for Standardization
isegserv.itd.rl.ac.uk/public/skos/press/dc2006/paper.pdf
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
7/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 7
Brief history of scope
2003-04: SWAD-Europe ISO 2788 thesauri Non-standard thesauri via extensibility e.g.
GeMET
Classification scheme (PACS)
Multilingual thesauri
Semantic mapping
2004: W3C Glossaries
2005: Discussion re terminologies Subject headings? Gazeteers?
Folksonomies? Taxonomies?
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
8/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 8
Assumptions: purpose
Formal representation of controlledstructured vocabularies intended foruse in information retrievalapplications.
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
9/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 9
Assumptions: workflow
a) Build a vocabularyb) Build an index
c) Retrieve
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
10/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 10
Assumptions: components
Vocabulary Development Application Something to help build a vocabulary
Indexing Application Something to help build an index
Retrieval Application Something to help retrieve things
SKOS ultimately designed to supportinteroperation of these three keycomponents.
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
11/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 11
Proposed scope
SKOS is a formal language forrepresenting controlled structuredvocabularies intended for use withininformation retrieval applications.
SKOS is required to support theinteroperation of these three keycomponents.
I.e. define the requirements for SKOS bydescribing a set offunctionalities thatmust be enabled.
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
12/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 12
Other components
Vocabulary mapping ? Metadata registries ?
?
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
13/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 13
Component specs
first discuss social andtechnological context, then return tocomponent specs
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
14/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 14
Context
What is the social and technologicalcontext in which controlledstructured vocabs are used?
Assume two basic needs Locate something I already know about. Discover something new.
N.B. a good location service is not
necessarily a good discoveryservice.
Cf. Google and del.icio.us
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
15/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 15
Strategies
Basic strategies for implementingretrieval services
1. Statistical text analysis
2. Analysis of user behaviour3. Index with controlled vocab
Other strategies 1. kos-assisted text analysis?
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
16/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 16
Cost problem
Given that applying controlled structuredvocab for retrieval involves significantinitial and ongoing investment
Given that other strategies are cheaper
Huge pressure to drive down cost andincrease utility.
Requirement for seamless integration. I.e. controlled vocab is seldom used in isolation, most
applications will combine strategies.
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
17/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 17
Use case
Search portal Use combined strategies.
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
18/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 18
Component specs
Important factors
Minimise cost. Decentralisation. Assistance.
Maximise utility. Query expansion. Smart ranking. Maximize lifetime.
Use the Semantic Web! Situation A. search across many collections, where
indexers use same controlled vocab. Situation B. search across many collections, where
indexes use different controlled vocabs.
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
19/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 19
Focus areas
Decentralisation requires differentmodels of collaboration and change.
Representing change a key factor tokeeping a vocab applicable.
Ranking and scoring well understoodfor text, less so for controlled index.
Theory of query expansion? Fieldtrials of query expansion?
Strategies for providing assistance?
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
20/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 20
Change and collaboration
Continuum of collaboration models:centralized decentralised
Continuum of change managementmodels: continuous discrete
Decentralization can reduce cost ofdevelopment and maintenance
Change management can ensure
continued utility maximize ROI Support for declarative representation of
change a requirement for SKOS.
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
21/27http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 21
Semantic Web architecture
Exploit Semantic Web facility todistribute and merge data.
However, publication of data in theSemantic Web, best practices needwork.
See Best Practice Recipes forPublishing RDF Vocabularies W3CWorking Draft (Google publishingRDF).
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
22/27
http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 22
Semantic Web architecture
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
23/27
http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 23
Direct interaction
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
24/27
http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 24
Information retrieval
Indexing and query evaluation wellunderstood for text content.
Less well understood for controlledmetadata.
Query types?
Query evaluation strategies, e.g.
query expansion? Ranking?
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
25/27
http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 25
Assistance for indexers
Provide suggestions Comparison of labels and annotations Machine learning
Exploit lexical resources
?
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
26/27
http://www.w3.org/2004/02/skosAlistair Miles, Ecoterm 2006, slide 26
Assistance for mappers
Provide suggestions Analysis of labels and annotations Exploit lexical resources
?
7/29/2019 Simple Knowledge Organisation System-Ecoterm-2006
27/27
htt // 3 /2004/02/ kAli t i Mil E t 2006 lid 27
Summary
SKOS: fundamental requirement to
support information retrieval usingcontrolled structured vocabularies.
Define requirements by describinginformation retrieval functionalities.
Divide functionalities into: Presentation styles Query types e.g. compound queries, coordination Query evaluation strategies
Assumptions: Key components Semantic Web interaction Context pressure to make vocabularies profitable Issues: change, assistance, theory