Sharon Q. Yang, Rider University, NJ Yan Yi Lee, Wagner College, NYC

39
Is Semantic Web Our Future? Computers in Libraries Conference 2012 March 21-23, 2012 Hilton Washington Washington, DC Sharon Q. Yang, Rider University, NJ Yan Yi Lee, Wagner College, NYC

description

Is Semantic Web Our Future? Computers in Libraries Conference 2012 March 21-23, 2012 Hilton Washington Washington , DC. Sharon Q. Yang, Rider University, NJ Yan Yi Lee, Wagner College, NYC. What is Semantic Web?. A vision by Tim Berners-Lee, Director of World Wide Web Consortium, in - PowerPoint PPT Presentation

Transcript of Sharon Q. Yang, Rider University, NJ Yan Yi Lee, Wagner College, NYC

Page 1: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Is Semantic Web Our Future? Computers in Libraries Conference 2012

March 21-23, 2012Hilton Washington

Washington, DC

Sharon Q. Yang, Rider University, NJ Yan Yi Lee, Wagner College, NYC

Page 2: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

What is Semantic Web?

• A vision by Tim Berners-Lee, Director of World Wide Web Consortium, in late 1999• Web 3.0, Giant Global Graph, Web of linked

data, a web of data• An extension of current Web, not a replacement• “A web of data that can be processed directly and indirectly by machines” –

Tim Berners-Lee

*Photo of Tim Berners-Lee in 2005 from Wikipedia

Page 3: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Three Things to Remember about Semantic Web

1. Machines understand/process data 2. Entity relationships (RDA is also about entity relationships)

– Relationships between humans and things – properties of humans and things (attributes and values)

3. A Web of linked data vs. a Web of linked documents

Page 4: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Demos of Semantic Web Applications

• Hakia at http://www.hakia.com (a semantic Search engine)• Friend of A Friend at http://www.foaf-search.net/• LIBRIS Swedish Union Catalog at

http://libris.kb.se/index.jsp?language=en• Isearch at http://www.isearch.com/?refer=3338• Dbpedia at http://dbpedia.neofonie.de/browse/• Notable Names Database at www.NNDB.com

Page 5: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

A Word May Have Many Meanings…

• I love Boston-Which of the 26 Bostons in the world?• UC Berkeley –People write it in 50 different ways on

the Internet (Metaweb Inc . at http://wiki.freebase.com/wiki/Main_Page)

• A single entity?• A single entity vs. text of different meanings and

spellings• A single entity is a thing, place, person, concept,

object or anything

Page 6: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Uniform Resource Identifier (URI)

• An entity may be represented by URI in Semantic Web• An entity is also called a resource • Examples of URI from LCSH in SKOS

– Example of URI - Shakespeare– Example of URI – 911 Terrorist attacks– Example of URI- Semantic Web

• URI is am important basic building block in linking data

Page 7: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Resource Description Framework (RDF)-Entity Relationship Model

RDF statements are often referred to as “triples” that consist of a subject, predicate, and object, which correspond to a resource (subject), a property (predicate), and a property value (object).

Page 8: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

RDF Triples

• Subject – an entity (can be a URI)• Predicate -property or attribute (can be a URI)• Object – a property value (can be a URI)• Examples:

– New York-- is place of publication of --Raintree County– Viking Penguin-- is publisher of --Raintree County– 1994-- is date of publication of-- Raintree County (Caren Koyle “Library Data in the

Web World”)

– T-shirt –color-red• Languages: RDF/XMS, N3, Turtle, N-Triples, Json

Page 9: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

RDF/XML

”The Secret Agent” is written by Joseph Conrad

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:lib="http://www.zvon.org/library">

<rdf:Description about="The Secret Agent"> <lib:creator>Joseph Conrad</lib:creator> </rdf:Description> </rdf:RDF>

Is created by

*RDF Tutorial at http://zvon.org/xxl/RDFTutorial/General/contents.html

http://library.rider.edu/books/TheSecretAgent http://www.nndb.com/JosephConrad

Page 10: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Share and Link Data?

MySQl MS SQL Oracle

Page 11: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Interoperability and Cross Domain Sharing

Shared Base Ontology and

Common Vocabulary

Database 1 Database 2 Database 3

Page 12: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Vocabularies and Ontologies

• Vocabulary - A collection of terms given a well-defined meaning that is consistent across contexts.

• Ontology - Allows you to define contextual relationships behind a defined vocabulary. It is the cornerstone of defining a knowledge domain. (Semantic Modeling Tutorial at www.linkeddatatools.com)

Page 13: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Semantic Web Ontologies

• “An ontology is a formal specification of a shared conceptualization”1

• “the success of the semantic Web depends predominantly on the proliferation of ontologies…” 2

• Different domain has different ontology

• Ontologies are written in Web Ontology Language (OWL) and RDFS (RDF Schema) and others.

1. Tom Gruber at http://www-ksl.stanford.edu/kst/what-is-an-ontology.html2. Kaushal Girl “Role of Ontology in Semantic Web”

Page 14: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Finished Ontologies

Page 15: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Illustration

Name Workplace Interest

Sharon Yang Lawrenceville Linked data

Institution Location Type

Rider US University

Database 1Database 2

Page 16: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

More on Ontology

Place

Workplace

Location

Lawrenceville

United States

RDFS: subClassOf

RDFS: subClassOf

RDF: type

RDF: type

P:subdivisonName

Person

Institution

Has

works for University Is

Has

Page 17: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Power of Inference in Semantic Web

• Inference establishes new relationships• Example – Tom is a cat – Every cat is a mammal (defined in ontology)– Tom is a mammal

Page 18: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Advantages of Joining Semantic Web

•Bibliographical data is now stored in databases and not searchable on the Web. •Silos, invisible Web, dark Web, deep Web, hidden Web•Releasing bibliographical data and displaying it on the Internet•Searching and retrieval by semantic relationships•Shared standards and data models with the rest of the world•Data exchange with other metadata communities

Page 19: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Semantic Web Development in Libraries

• RDA Vocabulary & OMR (JSC/CDMI/ALA) • MulDiCat (IFLA)• Authorities and Vocabularies (LC)• Linked Data Research (OCLC Research Projects)

Page 20: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

RDA Vocabularies and OMR

• RDA (Resource Description and Access) - New Cataloging Rules, released in 2009

- US National Libraries RDA Test, July 2010 March 2011 ̴ - Implementation, March 31, 2013

- Based on FRBR, FRAD standards, Entity-Relationship - Build Semantic Web enabled vocabularies• RDA Vocabularies published in OMR (Open Metadata

Registry)

Page 21: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

RDA Vocabularies and OMR - continued

• OMR – Open Metadata Registry - Formerly founded by NSDL (National Science Digital Library) - Currently managed by Metadata Management Associates - Available openly to anyone who wish to use the service - Used by the resource description community - Used by Semantic Web developers - Goals: Metadata discovery Reuse Standardization Interoperability

Page 22: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

RDA Vocabularies and OMR - continued

• RDA Vocabularies published in OMR, 2011 2012 ̴ - DCMI/RDA Task Group - JSC for development of RDA - ALA (Co-Publisher of RDA)• Base Domain: http://rdvocab.info/• Data model: SKOS (Simple Knowledge Organization System)• Stand model: RDF

Page 23: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

RDA Vocabularies and OMR - continued

Page 24: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

RDA Vocabularies and OMR - continue

Page 25: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

RDA Vocabularies and OMR - continue

Page 26: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

RDA Vocabularies and OMR - continue

“The Committee is committed to publishing and maintaining the content of the RDA vocabularies, synchronized with the text of RDA, in order to support their use by the resource description community and by developers of Semantic Web applications” – Alan Danskin, 2011 (Chair of JSC, 2009 – 2011)

“The RDA vocabularies represent many decades of library experience and practice which is now shared with the rest of the world.” - Gordon Dunsire, 2011 (co-Chair of the DCMI/RDA Task Group)

Page 27: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

MulDiCat

• MulDiCat – Multilingual Dictionary of Cataloging Terms and Concepts

- MulDiCat Working Group, IFLA Cataloging Section - Definitions for terms and concepts used by catalogers - In 27 different languages (will be more) - Authoritative translations of IFLA cataloging standards - Store in IFLA namespace as a SKOS file - IFLA Namespaces - iflastandards.info

Page 28: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

MulDiCat - continued

Page 29: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Authorities and Vocabularies

• Library of Congress Authority Data in SKOS• Delivered as Linked-Data• Accessible for both human and machine• Visualization of relationships between concepts and

values• id.loc.gov

Page 30: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Authorities and Vocabularies - continued

Include - LC Subject Headings - LC Name Authority File - LC Children’s Subject Headings - Genre/Form Terms - Thesaurus of Graphic Materials ……

Page 31: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Authorities and Vocabularies - continued

Page 32: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Linked Data Research

• OCLC Linked Data Research - Identify things via URIs

- Improve discovery • Related research projects - FAST (Faceted Application of Subject Terminology)

- VIAF (The Virtual International Authority File)

Page 33: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

FAST

• FAST – Faceted Application of Subject Terminology - An experimental Linked Data service by OCLC - Adapt LCSH with simplified syntax - Easy to understand, apply, and use - Data model: SKOS - FAST Linked Data http://experimental.worldcat.org/fast/

Page 34: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

FAST - continued

Link to LCSH at id.loc.gov

Page 35: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

VIAF

• VIAF – The Virtual International Authority File - International Joint project (LC, German National

Library, National Library of France, etc.) - A single authority service - combined 21 name

authority files from 18 organizations around the world

- Hosted in OCLC http://viaf.org - Plays a role in Semantic Web

Page 36: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

VIAF - continued

Page 37: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Giant Graph (Wikipedia)

Page 38: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Giant Graph (Wikipedia)

Page 39: Sharon Q. Yang, Rider University, NJ  Yan Yi Lee, Wagner College, NYC

Is Semantic Web Our Future? Food for Thought…

Image from http://www.projectappleseed.org/homework.html