Sunday May 4 – 5 PM Bradford, Hlava, McNaughton. Presenters - Taxonomies Marjorie Hlava Access...
-
Upload
harry-simmons -
Category
Documents
-
view
216 -
download
1
Transcript of Sunday May 4 – 5 PM Bradford, Hlava, McNaughton. Presenters - Taxonomies Marjorie Hlava Access...
Sunday May 4 – 5 PM
Bradford, Hlava, McNaughton
Taxonomies CSE
Presenters - TaxonomiesMarjorie Hlava
Access Innovations
Monica BradfordAmerican Association for the Advancement of ScienceAAAS
Charlotte McNaughtonASCE
What is a Taxonomy? ANSI/ NISO Z39.19-2010
“A collection of controlled vocabulary terms organized into
a hierarchical structure.”
controlled
Missing: equivalence, homographic, and associative relationships and notes
Yes!
Structure Of Controlled Vocabularies
Copyright © 2013 Access Innovations, Inc.
Lists Synonyms Taxonomy Thesaurus Ontology
Ambiguity Ambiguity Ambiguity Specifies a KOS Synonym Synonym Directionality in Hierarchy Hierarchy Relationships relationships
INCREASING COMPLEXITY and CONTROL
Taxonomy? Thesaurus?Main Term (MT) Top Term (TT)Broader Terms (BT)Narrower Terms (NT)Related Terms (RT)
See also (SA)Non-Preferred Term (NP)
Used for (UF), See (S)Scope Note (SN)History (H)
Copyright © 2013 Access Innovations, Inc.
= subject term, heading, node, category, descriptor, class
TAXONOMY
THESAURUSOWL can specify
How Do Terms Relate?
Hierarchical relationships-- Parents and their children
Equivalence relationships
-- Aliases, synonyms
Associative relationships
-- Cousins
TAXONOMY
THESAURUS
Disambiguation
Bridge Structure
Bridge Dentistry
Bridge Game
Bridge Concept
Achieving SynonymyFind like concepts
Merge the terms
Choose a preferred form
Build term record HierarchyEquivalenceAssociative
Taxonomy
Linked DataAssert that the AIP Thesaurus term “Nonlinear optics” refers to the same conceptas the dbpedia page “Nonlinear optics” by putting links in both places.
Content Recommender
Thesaurus terms
Similar content
The more terms in common, the higher the recommendation of content as similar.
7. Content Recommender
More Articles on the same topic
Selected Article Search “thin film sputtering”
Grants available
Upcoming conferences on this topic
Authors working in this space
Journal Profile PagesEach journal can be characterized by the most frequently used indexing terms.
2014 JLAPEN (Journal of Laser Applications) 13 most frequently used indexing terms:
Powders
Manufacturing
Laser materials
Laser applications
Laser industrial applications
Laser beams
Solidification
Cladding
Laser ablation
Laser beam welding
Engineering
Nanoparticles
Materials properties
Image Indexing
Index the image by analyzing the text
associated with the image:
caption
Author Submission/Reviewer Tools
Image: Courtesy AACR and EJPress
Add a box:“SuggestNew terms”
Reports and Research Toolsfor Internal Use
Reports and Research Toolsfor Internal Use
Thesaurus Master
Machine Aided Indexer
(M.A.I.™)
DatabaseRepositor
y
SearchPresentation
Layer
Increasesaccuracy
Browse by SubjectAuto-completionBroader TermsNarrower TermsRelated Terms
Client Taxonomy
Inline Tagging
Metadata and Entity Extractor
Automatic Summarization
Search Softwar
e
Client Data
Full Text
HTML, PDF,
Data Feeds,
etc.
Client taxonomy
The Workflow
19
Tag and Createmetadata
Put in data base with tags
Build Search inverted index
Create user interface
Gather source data
Taxonomy Driven Search Presentation
Navigate the full taxonomy “tree”
BROWSE
Auto-completion using the taxonomy
Guide the user
Copyright © 2005 - Access Innovations, Inc.
Taxonomyview
ThesaurusTerm Record
view
Knowledge Organization Systems
Semantic network
Ontology
Thesaurus
Taxonomy
Controlled vocabulary
Synonym set/ring
Name authority file
Uncontrolled list•Unrelated Entities•Ambiguity
•Linked Entities•Contextual Specificity
•Simple•Low Value
•Complex•High value
Uncontrolled list has the Highest Cost over Time!
Thanks!
Questions after all three speakers
Marjorie M K Hlava, PresidentAccess Innovations, Inc.4725 Indian School Rd, Ste. 100Albuquerque, NM 87110+1-505-998-0800www.accessinn.comwww.dataharmony.comEmail: [email protected]
About Access InnovationsAccess Innovations are experts in content creation, enrichment, and conversion services. We provide services to semantically enrich and tag raw text into highly structured data. We deliver clean, well-formed, metadata-enriched content so our clients can reuse, repurpose, store, and find their knowledge assets. We go beyond the standards to build taxonomies and other data control structures as a solid foundation for your information. Our services and software allow organizations to use and present their information to both internal and external constituents by leveraging search, presentation, and e-commerce. We change search to found!
Quick Facts• Founded in 1978• Headquartered in Albuquerque, NM• Privately held• Delivered more than 2000 engagements
Suggested taxonomy descriptors
Normal text extraction
Near conceptual synonyms
Nonsensical suggestions
Small Taxonomy
Near synonym, conceptual duplicate
Refined presentation
Dependent concepts
Ontological dependencies