Describing Linked Datasets

Post on 02-Jan-2017

222 views 2 download

Transcript of Describing Linked Datasets

DescribingLinkedDatasetsOntheDesignandUsageofvoiD,

the‘VocabularyOfInterlinkedDatasets’LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain

KeithAlexander(Talis),RichardCyganiak(DERI),MichaelHausenblas(DERI)andJunZhao(UniversityofOxford)

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 2

Agenda

•  TheProblem•  OurProposal–voiD•  ApplicaNons•  NextSteps

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 3

TheProblem

2008

2007

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 4

TheProblem

2009

2008

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 5

TheProblem

•  TheLinkingOpenData(LOD)cloudgatherscurrentlyroughlythesamemomentumastheWebintheearly1990s

•  Howdidpeopledealwiththeconsequencesofhavingadecentralizedsystem,backthen?

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 6

TheProblem

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 7

TheProblem

•  From2007on,wehavebeendoingitintheYahoo!‐catalog‐style:manuallycollec>ngandrepresen>ngdataabouttheLinkingOpenDatacloud:–  IntheLODclouddiagram,wegiveaqualitaNveviewinformofavisualgraph

–  InvariousESWWikipageswecreateHTMLtables:•  h`p://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/StaNsNcs

•  h`p://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/LinkStaNsNcs

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 8

TheProblem

h`p://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/StaNsNcs

h`p://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/LinkStaNsNcs

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 9

TheProblem

•  Currently,onlyhumancomprehensibledescrip>ons(theLODcloud,Wikipages)available

•  Wecan’tautomatetasks,suchas– Efficient&effecNvesearch

– SelecNonofdataset(forapps,interlinkingtargets)– GeneraNonofmaps,etc.

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 10

TheProblem

•  Wecan’tapplyourtoolsandmethodswehaveexperienceswith,suchaseditors,engines,stores,etc.

•  Evenworse,itdoesn’tscale– We’dneedaGoogle‐styleapproachthatscaleslikehellandispowerfulenoughtoenabletheabovemenNoned

– ProvidingmetadataabouttheLODcloudinamachine‐comprehensibleway

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 11

Agenda

 TheProblem•  OurProposal–voiD•  ApplicaNons•  NextSteps

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 12

OurProposal‐voiD

•  SoluNon:providingaformaldescripNonof– Whatadatasetisabout(topic,technicaldetails)

– HowandunderwhichcondiNonstoaccessit– Howthedatasetisinterlinkedwithotherdatasets•  QualitaNvelevel:typeofinterlinking•  QuanNtaNvelevel:numberoflinks,resources,etc.

– Howtodiscoverthemetadata

•  voiD,the“VocabularyofInterlinkedDatasets”providespreciselythis

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 13

OurProposal‐voiD

•  AdatasetisasetofRDFtriplesthatarepublished,maintainedoraggregatedbyasingleprovider.

•  Adatasetisauthorita>vewithrespecttoacertainURInamespaceifitcontainsinformaNonaboutresourcesnamedbyURIsinthisnamespace,andispublishedbytheURIowner(URIownershipasoftheAWWW1)

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 14

OurProposal‐voiD

•  AlinksetLSisasetofRDFtripleswhereforalltriplesti=⟨si,pi,oi⟩∈LS,thesubjectisinonedataset,i.e.allsiaredescribedinDS1,andtheobjectisinanotherdataset,i.e.alloiaredescribedinDS2.

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 15

OurProposal‐voiD

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 16

OurProposal‐voiD

voiDofferstwoorthogonalinterlinkingtypes:•  classicLODvs.3rd‐party,differinginwheretheinterlinkingstatementsare

kept.Inthefirstcasetheinterlinkingtriples,i.e.alinkset,arehostedinoneofthetwoinvolveddatasets,whileinthela`ercasethereisathirddatasetinvolvedthatcontainstheinterlinkingtriples,i.e.thelinkset;

•  non‐directedvs.directed,whichaddressestheissueifsomeoneisinterestedinstaNngthedirecNonoftheinterlinkingornot(forexamplewithowl:sameAs)

classicLOD,non‐directed

3rd‐party,non‐directed

classicLOD,directed

3rd‐party,directed

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 17

OurProposal‐voiD

classicLOD,non‐directed

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 18

classicLOD,directed

OurProposal‐voiD

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 19

3rd‐party,non‐directed

OurProposal‐voiD

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 20

3rd‐party,directed

OurProposal‐voiD

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 21

OurProposal‐voiD

•  Reusingtermsfromothervocabularies–  foaf:homepage/IFP

– dcterms:subjectalongwithDBpediaURIsh`p://dbpedia.org/resource/XXX

– SCOVOforstaNsNcsabouttriples,links,etc

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 22

OurProposal‐voiD

•  PublicaNon&discoveryviasitemapsand/orbacklinks(dcterms:isPartOf)

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 23

OurProposal‐voiD

•  OncedatasetprovidershavepublishedtheirvoiDdescripNoninRDFalongwiththeirdataset,onecanaddressthefollowingissues:– Howtofindsomedatasets?

– Howtoefficientlyfindaspecificdataset?– Howtoeffec>velyfinddatasets?– Howtodynamicallyselectdatasets?– Howtoselectdatasetsbasedoncertainpreferences?

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 24

Agenda

 TheProblem OurProposal–voiD•  ApplicaNons•  NextSteps

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 25

Applica>ons

•  GeneraNon(ve,lipSSM,NXparser)•  VocabularyManagement(Talis)

•  Explorer(RKB,LDE)•  QueryFederaNon(Clarck‐Parsia,OpenLink)•  Datasetranking(DING!talk)

•  PotenNalApplicaNons– Mapofdata(Sindice)– DynamicMeshupsforApplicaNon

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 26

Applica>ons

h`p://ld2sd.deri.org/ve

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 27

Applica>ons

h`p://ld2sd.deri.org/lde

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 28

Applica>ons

h`p://db

lp.rkbexplorer.com

/mod

els/void.`l

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 29

Applica>ons

h`p://linkeddata.uriburner.com/

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 30

Agenda

 TheProblem OurProposal–voiD ApplicaNons•  NextSteps

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 31

NextSteps

•  voiD2.0seeissuesath`p://code.google.com/p/void‐impl/issues/list

•  staNsNcsmodule(fix/extendreSCOVO)

•  SPARQLendpoints•  provenance,trust(?)•  AssistpeopleinpublishingvoiD