Linked Data Integration for Semantic Dialogue and Backend Access
Linked Data Integration and semantic web
-
Upload
diego-pessoa -
Category
Education
-
view
219 -
download
5
description
Transcript of Linked Data Integration and semantic web
Linked DataData Integration andSemantic web
Diego [email protected]
How did we store data?
Data Islands
Limited to the company
DatabasesCentral Access
DistributedFederated
Web
Hypertext (Web 1.0)
Social/Collaborative Content(Web 2.0)
Massive data volumes
Web Data Volume?
Growing at 40% per year
45 ZB ~= 48.318.382.080 TB
It means we have problems?
Searching the web…
Who are the brazilian players (includingw/ dual nationality) in the 2014 worldcup?
Googling…
54.700.000 results?!?!
Just one player information
Let’s try again
81.100.000 results?! (50%+)
WTF?
Let’s try again
And now?!?!
We need data! Machines process data!
How to resolve?APIs? Mashups?
Web Challenges…Increase content
structure
Provide semantics
to data
Establish links
among contents
Publishing of
Standard data
WebEvolution
Rich data
Vocabularies
Semantics
Presenting…
“The Semantic Web is the extension of the World Wide Web that enables people to share content beyond the boundaries of applications and websites. It has been described in rather different ways: as a utopic vision, as a web of data, or merely as a natural paradigm shift in our daily use of the Web. Most
of all, the Semantic Web has inspired and engaged many people to create innovative semantic technologies and
applications.”semanticweb.org
Semantic Web
Unique Identifiers (URI)
Data = Resources
Easy sharing!
Semantic WebBut… How to represent data in the Web?
Example - Traditional way (tuples):
Id Name Former Institution
Birthplace
01 Diego Pessoa UFPB Campina Grande/PB02 Everaldo Netto FAL Palmeiras/PE03 Gabrielle Karine UTFPR Medianeira/PR04 Marcelo Iury UFCG Fortaleza/CE
Student
Semantic WebBut… How to represent data in the Web?
Example - Traditional way (tuples):01 Diego Pessoa UFPB Campina Grande/PB
Former Institution
UFPBFALUFTPRUFCG
1)
2)
We need something more!
We need triples!Subject Predicate ObjectGabrielle Karine Was born in Medianeira/PRDiego Pessoa Studied In UFPB
Campina Grande Is in ParaíbaGabrielle Karine Is friend of Everaldo NettoFAL Is In AlagoasAlagoas Part of Maceió
Extra links:
DBPEDIA
Triples as Graphs
Diego Pessoa
Campina Grande
Paraíba Brazil
Gabrielle Karine
Everaldo Netto
Alagoas
Maceió
Was born in
Is in
Is part of
Is part of
Is in
Is in Is friend of
Combining different sources!
But…How to identify different resources?
Diego Pessoa Diego Pessoa=?CIn IFPB
URI (Uniform Resource Identifiers)Ex.: CPF, ISBN, URL
cin.ufpe.br/~derp diegopessoa.com#about
Web App 1
Web App 2
Web App 3
Web App 4
is same as
Semantic WebStack
And how about Linked Data?
“Linked Data is about using the Web to connect related data that wasn't previously linked, or using the Web to lower the barriers to linking data currently linked using other methods.”
linkeddata.org
“A term used to describe a recommended best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web using URIs and RDF.”
wikipedia
Linked Data Principles1. Use URIs as names for
things.
Tim Berners-Lee. Linked Data - Design Issues, 2006. http://www.w3.org/DesignIssues/LinkedData.html. 7, 26, 82
2. Use HTTP URIs, so that people can look up
those names.3. When someone looks up a URI, provide
useful information, using the standards
(RDF, SPARQL).4. Include links to other URIs, so that they can
discover more things
LODCloud
Guidelines to publish linked data1. Right URI CreationAlways HTTP
Avoid technical details (ex.: cin.ufpe.br:8080/~derp/index.php
Keep stable and persistent addresses
Feel free to use unique identifiers. (ex.: #isbn-number, #cpf)
Guidelines to publish linked data2. Use dereferenceable URIs
Hash URI (Ex.:Entity Berlin): http://linkeddata.openlinksw.com/about/Berlin#this
Slash URI (Ex.:Entity Berlin): http://dbpedia.org/resource/Berlin
Guidelines to publish linked data3. RDF Link Creation
Manual or automaticExternal/Internal links
Friend-of-a-Friend (FOAF)
Semantically-Interlinked Online Communities (SIOC)
Simple Knowledge Organization System (SKOS)
Description of a Project (DOAP)
Creative Commons (CC)
Dublin Core (DC)
Guidelines to publish linked data4. Explicit additional ways to access data
Provide SPARQL endpoint
Framework Jena provides endpoints implementations:Joseki and Fuseki
XML JSON
RDF/XML
Turtle
N3 HTML
Guidelines to publish linked data5. Standards to publish linked data
Tools for RDF conversion from CSV, XML, relational data, spreadsheets.
(Ex.: ConvertRDF)
Data load in triple database (RDF Store)
RDF Store publishing: Provide interface to access Linked Data and SPARQL endpoint.
Consuming linked dataBrowsers
Tabulator (Firefox Add-on) Marbles (Web App) (*Fail)
Consuming linked dataBrowsers
Disco HyperData Browser (Web app) And others…
Dipper (inactive)
Piggy Bank (mashups)
URI Burner
LinkSailor
Graphite RDF Browser
Consuming linked dataSearch enginesSindice Watson
Swoogle
Domain Specific Applicationshttp://revyu.com (Review anything) DBPedia Mobile (DBPedia+Revyu+Flicker)
Domain Specific ApplicationsTalis Apire (discover teaching stuff) BBC Music/Programs (links)
Research ChallengesUser Interfaces and Interaction Paradigms
Application Architectures
Schema Mapping and Data Fusion Link Maintenance
Licensing Trust, Quality and Relevance
Privacy
Christian Bizer, Tom Heath and Tim Berners-Lee (2009) Linked Data - The Story So Far. International Journal on Semantic Web and
Information Systems, Vol. 5(3), Pages 1-22. DOI: 10.4018/jswis.2009081901