1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.
-
Upload
phillip-fletcher -
Category
Documents
-
view
220 -
download
1
Transcript of 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.
![Page 1: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/1.jpg)
1
Vocabulary and Terminology Standards
Marjorie M.K.Hlava
President
Access Innovations, Inc.
![Page 2: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/2.jpg)
2
What are they for?
• Accuracy
• Disambiguation
• Recall
• Precision
• Relevance
• Removing Noise
• Increasing Hits
![Page 3: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/3.jpg)
3
Vocabulary Control
• Define Scope of term (meaning)• Equivalence between synonyms
– And Quasi Synonyms– Single concept– Cars / Automobiles
• Distinguish homographs– Mercury – – Planet, metal, myth, car
![Page 4: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/4.jpg)
4
Nice to have
• Equivalence – synonyms - Rogets
• Associative – related terms
• Hierarchies – taxonomy view
• History – development of the term
• Notes – help the user
• Notation – different way to sort
• Display – how to show the terms
![Page 5: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/5.jpg)
5
Where do they come from?
• ANSI / NISO– National Information Standards Organization
• ISO– International Standards Organization
• TC 46• TC 37
• W3C– World Wide Web Consortium
• US OMB• IFLA
![Page 6: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/6.jpg)
6
Effectiveness of Indexing
• Indexing as a means for identifying and retrieving content objects depends upon a well constructed indexing language
• Improves precision by defining the scope of the terms
• Improves recall by allowing different terms for the same concept
![Page 7: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/7.jpg)
7
Vocabulary Control
• Eliminate ambiguity
• Control synonyms – insure a term has only one meaning
• Establish relationships
• Test and validate terms
• Gather the linguistic relationships in one place
![Page 8: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/8.jpg)
8
Facets
• Bottom up approach to organization• Find the parts of the knowledge• Useful in
– Developing areas– Interdisciplinary fields– Multiple hierarchies– Where location of CO is not important– Where several groups have ownership of the
whole…
![Page 9: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/9.jpg)
9
Facets
• Multiple attributes for a single object– Topic
• By composition• By use
– Format– Audience– Intellectual level
• Could be fields, • Could be facets
![Page 10: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/10.jpg)
10
Interoperability
• Same content – different domains• Different vocabulary – same domain• Degree of specificity / granularity• Handling of synonyms• Search methodology
– Z39.50– Pre coordinated– Post coordinated
• Literary warrant established in vocab used• Intended purpose (academic vs lay public)
![Page 11: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/11.jpg)
11
Interoperability
• Merging vocabularies
• Merging databases
• Indexing using single vocab
• Federated searching
• Mapping or crosswalks– Decide on master vocab
• Authority records
![Page 12: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/12.jpg)
12
Z39.19 -2006 – Controlled Vocabularies - What’s new?
• Interoperability• Synonym Rings• Facet Analysis• Taxonomy• Semantic Relationships• Glossary• Web Format• Changes in terminology
![Page 13: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/13.jpg)
13
Other methods - Zeng• Derivation Modeling• Translation and
Adaptation• Satellite controlled
vocabularies• Node or leaf linking
S1
Tree structure of thesaurus
= leaf node
S1S1
Tree structure of thesaurus
= leaf node
S1
S
(Source)
S
S
New
New
<=== N e w ===>S
S
(Source)
S
S
New
New
<=== N e w ===>S
Superstructure
NewNew
Superstructure
NewNew
![Page 14: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/14.jpg)
14
Other methods Section 10.8
• Switching• Linking Through a
Temporary Union List • Linking Through
Controlled Vocabulary Servers
Lin
kin
g s
yste
m
Linked display
Th
es
au
rus
re
po
sit
ory
Te
mp
ora
ry u
nio
n l
istS
(Source)
S
S
normalized
Lin
kin
g s
yste
m
Linked display
Th
es
au
rus
re
po
sit
ory
Te
mp
ora
ry u
nio
n l
istS
(Source)
S
S
normalized
Switch
S(Source)
S
SS
Switch
S(Source)
S
SS
ProtocolServer
SProtocolServer
Application
S
SS
ApplicationProtocolServer
SProtocolServer
Application
S
SS
Application
![Page 15: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/15.jpg)
15
Semantic Network
• Use to cluster terms from many sources• Create an underlying structure for all to map to• Group terms to conceptual scheme• Define types of concepts• Define types of relationships• Generate synonym rings for retrieval
![Page 16: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/16.jpg)
16
Semantic network for clustered terms from a combined
vocabulary – UMLS portion
![Page 17: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/17.jpg)
17
Semantic network showing types of relationships among concepts - UMLS
![Page 18: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/18.jpg)
18
Lexical Database
• Terms from many vocabularies
• Create clusters of concepts
• Allow many kinds of relationships– IsA, HasA, synonyms, hierarchical, etc
![Page 19: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/19.jpg)
19
Multiple senses for the same noun in a lexical database
• bridge, span -- (a structure that allows people or vehicles to cross an obstacle such as a river or canal or railway etc.)
• bridge, bridge circuit -- (a circuit consisting of two branches (4 arms arranged in a diamond configuration) across which a meter is connected)
• bridge -- (something resembling a bridge in form or function; "his letters provided a bridge across the centuries")
• bridge -- (the hard ridge that forms the upper part of the nose; "her glasses left marks on the bridge of her nose")
• bridge -- (any of various card games based on whist for four players)• bridge -- (a wooden support that holds the strings up)• bridge, bridgework -- (a denture anchored to teeth on either side of
missing teeth)• bridge, nosepiece -- (the link between two lenses; rests on nose)• bridge, bridge deck -- (an upper deck where a ship is steered and the
captain stands)
![Page 20: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/20.jpg)
20
ISO TC 46 – SC 6 or 9
• Controlled vocabulary and other information standards• ISO 5127 – Information and Documentation –
Vocabulary• ISO 2788-1986 Guidelines for the establishment and
development of monolingual thesauri = BS 5723:1987• ISO 5964-1985 Guidelines for the establishment and
development of multilingual thesauri = BS 6723:1985• NEW - BSA 8723 - Parts 1 – 4 Stella Dexter Clarke
![Page 21: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/21.jpg)
21
British Standards - BS 8723
• Structured vocabularies for information retrieval – Guide
• Part 1: General• Part 2: Thesauri• Part 3: Vocabularies other than thesauri• Part 4: Interoperability between
vocabularies• Part 5: Interoperability with applications
![Page 22: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/22.jpg)
22
ISO TC 37
Scope of ISO TC 37: Standardization of principles, methods and applications relating to terminology and other language resources.
• TC 37/SC 1 - Principles and methods• TC 37/SC 2 - Terminography and lexicography• TC 37/SC 3 - Computer applications for
terminology• TC 37/SC 4 - Language resource management
![Page 23: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/23.jpg)
23
Sample StandardsPrinciples of concept-oriented terminology and data categories:
ISO 704:2000 Terminology work - Principles and methodsISO 860:1996 Terminology work - Harmonization of concepts and terms ISO 1087-1:2000 Terminology work - Vocabulary - Part 1: Theory and applicationISO 1087-2:2000 Terminology work - Vocabulary - Part 2: Computer applications ISO 10241:1992 Preparation and layout of international terminology standards ISO 12200:1999 Computer applications in terminology - Machine-readable terminology interchange format (MARTIF) - Negotiated interchangeISO 12616:2002 Translation-oriented terminographyISO/TR 12618:1994 Computer aids in terminology - Creation and use of terminological databases and text corpora ISO 12620:1999 Computer applications in terminology - Data categoriesused to create glossaries
![Page 24: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/24.jpg)
24
W3C
• OWL – Web Ontology Language• RDF – Resource Description Format• Topic Maps• SKOS - Simple Knowledge Organization
Systems
• Which community to serve?• Build on the current standard• Might make this link next
![Page 25: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/25.jpg)
25
Other things to watch
• Other W3C and ISO areas
• Support groups– Blogs– Communities of Practice
• SIMILE
• Web 2.0 activities
• WSDL – Web Services Digital Library
![Page 26: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/26.jpg)
26
Other Relevant ISO & W3C Standards
For translation, terminology and applied linguists go to:
http://appling.kent.edu/ResourcePages/LTStandards/Chart/standards.chart.htm#Ontology
•Markup Languages •Metadata Resources •Character Coding •Access Protocols and Interoperability•Content Creation, Manipulation, and Maintenance •Authoring Standards •Text and Content Markup •Translation Standards •Terminology and Lexicography Standards •ISO TC 37 Standards •Terminology Interchange Standards •Controlled Language Standards •Taxonomy and Ontology Standards •Corpus Management Standards •Locale-Related Standards
![Page 27: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/27.jpg)
27
SIMILE
SSemantic emantic IInteroperability of nteroperability of MMetadata and etadata and IInformation in unnformation in unLLike ike EEnvironmentsnvironments
Forming a data reference for open source Forming a data reference for open source taxonomiestaxonomies
![Page 28: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/28.jpg)
28
Web Services – Web 2.0
• Acts as middleware • Allows migration from legacy systems • Uses HTTP remote procedure calls• Allows interaction between distributed, loose coupled
and reusable software components• Openness:
– Machine readable description available – WSDL – Web Services Digital Library– accessible via public (UDDI-) Registries
• Will help integrate “taxonomies into web presentations of data
![Page 29: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/29.jpg)
29
Summary
• Standards for Vocabulary now come from many places. There are 19 main ones to watch
• If you have a current standard compliant thesaurus you can probably implement it in the Web 2.0 arena
• Vocabulary control improves search– Precision and accuracy– Recall and relevance
![Page 30: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/30.jpg)
30
Thank you for your time
Marjorie M.K. HlavaPresident
Access Innovations / Data Harmony505-998-0800
![Page 31: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/31.jpg)
31
: xrefer Research Mapping
: xrefer Research Mapping
![Page 32: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/32.jpg)
32
Data Harmony View - VxInsight
![Page 33: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/33.jpg)
33
Scenario: Web Services based Infrastructure for Digital Libraries
UDDI-Registry
Existing web services
NewsserviceScientificdatabase
OAI based document harvester
XML/SOAP XML/SOAP XML/SOAP
Network-node 1
XM
L/SO
AP
WSDL-descriptions
Network-node 2
XM
L/SO
AP
Network-node n
XM
L/SO
AP
![Page 34: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/34.jpg)
34
DL-Services (logical view)
Service (user‘s perspective)
Types of services
Types of material
Metamodel
Functionality modeland implementation
Service Provision(Web-Service)
![Page 35: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/35.jpg)
35
Framework (logical view)
XML-Repository
Metamodelof Services
Support Services
Service <<facade>>
WSDL-Service Support
Generator
Service-Localization
Dispatcher
Webserver(Cocoon2)
Java-classes (framework application)
Service creation and
assembly Transaction- management
Data processedby Services
![Page 36: 1 Vocabulary and Terminology Standards Marjorie M.K.Hlava President Access Innovations, Inc.](https://reader035.fdocuments.us/reader035/viewer/2022062713/56649ccc5503460f94996666/html5/thumbnails/36.jpg)
36
For services and software
• Contact• Marjorie Hlava• Jay Ven Eman• Access Innovations, Inc• Data Harmony
– Thesaurus Master, MAI, XIS– Auto indexing and content Management
• 505-998-0800• [email protected]