Operationalizing Linked Open Data - McGill University ·  · 2016-10-24... C. Schwartz3, J....

31
Linked Datasets as of August 2014 Uniprot Alexandria Digital Library Gazetteer lobid Organizations chem2 bio2rdf Multimedia Lab University Ghent Open Data Ecuador Geo Ecuador Serendipity UTPL LOD GovAgriBus Denmark DBpedia live URI Burner Identifiers Eionet RDF Wiktionary DBpedia Viaf Umthes RKB Explorer Courseware Opencyc Diseasome FU-Berlin Eurovoc in SKOS DNB GND Bio2RDF Pubmed Bio2RDF NDC Bio2RDF Mesh Ontos News Portal AEMET Europeana Nomenclator Asturias Geo Wordnet Bio2RDF HGNC Ctic Public Dataset Bio2RDF Homologene Bio2RDF Affymetrix Muninn World War I CKAN Universidad de Cuenca Linkeddata Freebase Linklion Ariadne Organic Edunet Gene Expression Atlas RDF Chembl RDF Biosamples RDF Identifiers Org Biomodels RDF Reactome RDF Disgenet IATI as Linked Data Dutch Ships and Sailors IServe Arago- dbpedia Linked TCGA ABS 270a.info RDF License Thist JudaicaLink BPR OCD Shoah Victims Names Reload 2001 Spanish Census to RDF RKB Explorer Webscience RKB Explorer Eprints Harvest NVS EU Agencies Bodies EPO Linked NUTS RKB Explorer Epsrc Open Mobile Network RKB Explorer Lisbon RKB Explorer Italy CE4R Environment Agency Bathing Water Quality RKB Explorer Kaunas Open Data Thesaurus RKB Explorer Wordnet RKB Explorer ECS Austrian Ski Racers Social- semweb Thesaurus Data Open Ac Uk RKB Explorer IEEE RKB Explorer LAAS RKB Explorer Wiki RKB Explorer JISC RKB Explorer Eprints RKB Explorer Pisa RKB Explorer Darmstadt RKB Explorer unlocode RKB Explorer Newcastle RKB Explorer OS RKB Explorer Curriculum RKB Explorer Resex RKB Explorer Roma RKB Explorer Eurecom RKB Explorer IBM RKB Explorer NSF RKB Explorer kisti RKB Explorer DBLP RKB Explorer ACM RKB Explorer Citeseer RKB Explorer Southampton RKB Explorer Deepblue RKB Explorer Deploy RKB Explorer Risks RKB Explorer ERA RKB Explorer OAI RKB Explorer FT RKB Explorer Ulm RKB Explorer Irit RKB Explorer RAE2001 RKB Explorer Dotac RKB Explorer Budapest Swedish Open Cultural Heritage Radatana German Labor Law Thesaurus GovUK Transport Data GovUK Education Data Enakting Mortality Enakting Crime Enakting Population Enakting CO2Emission Enakting NHS RKB Explorer Crime RKB Explorer cordis Govtrack Geological Survey of Austria Thesaurus Geo Linked Data Gesis Thesoz Bio2RDF Pharmgkb Bio2RDF Sabiork Bio2RDF Ncbigene Bio2RDF Irefindex Bio2RDF Iproclass Bio2RDF GOA Bio2RDF Drugbank Bio2RDF CTD Bio2RDF Biomodels Bio2RDF DBSNP Bio2RDF Clinicaltrials Bio2RDF LSR Bio2RDF Orphanet Bio2RDF Wormbase BIS 270a.info DM2E DBpedia PT DBpedia ES DBpedia CS Alpino RDF YAGO Core KUPKB Getty AAT Semantic Web Journal OpenlinkSW Dataspaces MyOpenlink Dataspaces Typepad Aspire Harper Adams NBN Resolving Worldcat Bio2RDF Bio2RDF ECO Taxon- concept Assets Indymedia GNU Licenses Greek Wordnet DBpedia CIPFA Glottolog StatusNet Bonifaz StatusNet shnoulle StatusNet Kathryl Charging Stations UCL Tekord Didactalia Linked Crunchbase ESD Standards VIVO University of Florida Bio2RDF SGD Resources Product Ontology Datos Bne.es StatusNet Mrblog Bio2RDF Dataset EUNIS GovUK Housing Market LCSH Uniprot KB StatusNet Timttmy StatusNet Fcestrada JITA StatusNet Somsants StatusNet Ilikefreedom Drugbank FU-Berlin StatusNet Dtdns StatusNet Status.net StatusNet Tekk Materia StatusNet Fragdev Morelab DBTune John Peel Sessions RDFize last.fm Open Data Euskadi MSC Lexinfo StatusNet Equestriarp Asn.us StatusNet Macno Oceandrilling Borehole Aspire Qmul Loius StatusNet Maymay DBpedia EU Bio2RDF Taxon StatusNet Tschlotfeldt Jamendo DBTune Aspire NTU Lotico GNOSS Uniprot Metadata Linked Eurostat Aspire Sussex Lexvo Linked Geo Data StatusNet Spip SORS GovUK Homeless- ness Accept. per 1000 TWC IEEEvis Aspire Brunel PlanetData Project Wiki StatusNet Freelish Statistics data.gov.uk StatusNet Mulestable Enipedia UK Legislation API Linked MDB StatusNet Qth Sider FU-Berlin DBpedia DE Skos My Experiment SISVU StatusNet Uni Siegen Opendata Scotland Simd Education Rank StatusNet Kaimi StatusNet Planetlibre DBpedia EL Sztaki LOD DBpedia Lite Drug Interaction Knowledge Base StatusNet Qdnx Amsterdam Museum AS EDN LOD RDF Ohloh Aspire Uclan Hellenic Fire Brigade Bibsonomy Lists Opendata Scotland Simd Income Rank Randomness Guide London Opendata Scotland Simd Health Rank Southampton ECS Eprints FRB 270a.info StatusNet Sebseb01 StatusNet Bka ESD Toolkit Hellenic Police StatusNet Ced117 Open Energy Info Wiki StatusNet Lydiastench Open Data RISP Taxon- concept Occurences Bio2RDF SGD UIS 270a.info NYTimes Linked Open Data Aspire Keele GovUK Households Projections Population W3C Opendata Scotland Simd Housing Rank ZDB StatusNet 1w6 StatusNet Alexandre Franke StatusNet Status StatusNet doomicile StatusNet Hiico Linked GovUK Households 2008 DOI StatusNet Pandaid Brazilian Politicians NHS Jargon Theses.fr Linked Life Data Semantic Web DogFood UMBEL Openly Local StatusNet Ssweeny Linked Food Interactive Maps GNOSS OECD 270a.info Sudoc.fr Green Competitive- ness GNOSS StatusNet Integralblue WOLD Linked Stock Index Apache KDATA Linked Open Piracy StatusNet Quitter StatusNet Scoffoni Open Election Data Project Reference data.gov.uk StatusNet Jonkman Project Gutenberg FU-Berlin DBTropes StatusNet Spraci Libris ECB 270a.info StatusNet Thelovebug Greek Administrative Geography Bio2RDF OMIM StatusNet Orangeseeds National Diet Library WEB NDL Authorities Uniprot Taxonomy DBpedia NL L3S DBLP FAO Geopolitical Deutsche Biographie StatusNet ldnfai StatusNet Keuser StatusNet Russwurm GovUK Imd Income Rank La 2010 StatusNet Datenfahrt StatusNet Imirhil Southampton ac.uk LOD2 Project Wiki DBpedia KO Dailymed FU-Berlin DBpedia IT StatusNet Recit Livejournal StatusNet Exdc Aves3D Open Aspire Manchester Wordnet (VU) StatusNet David Haberthuer Pub Bielefeld NALT Open Library Aspire Plymouth StatusNet Johndrink Water StatusNet Gomertronic StatusNet tl1n StatusNet Progval Testee World Factbook FU-Berlin DBpedia JA StatusNet Cooleysekula Product DB IMF 270a.info StatusNet Postblue StatusNet Skilledtests Eurostat FU-Berlin StatusNet Fcac DWS Group Opendata Scotland Graph Simd Rank Clean Energy Data Reegle Opendata Scotland Simd Employment Rank Chronicling America StatusNet Belfalas Aspire MMU StatusNet Legadolibre Bluk BNB StatusNet Lebsanft GADM Geovocab GovUK Imd Score 2010 Semantic XBRL UK Postcodes Geo Names EEARod Aspire Roehampton BFS 270a.info Camera Deputati Linked Data Bio2RDF GeneID StatusNet Sweetie Belle O'Reilly GNI City Lichfield GovUK Imd Rank 2010 Bible Ontology Idref.fr StatusNet Atari Frosch Dev8d Nobel Prizes StatusNet Soucy Archiveshub Linked Data Linked Railway Data Project FAO 270a.info GovUK Wellbeing Worthwhile Mean Semantic- web.org British Museum Collection GovUK Dev Local Authority Services Code Haus Ordnance Survey Linked Data Wordpress Eurostat RDF StatusNet Kenzoid GEMET GovUK Societal Wellbeing Deprv. imd Score '10 StatusNet 20100 EEA Ciard Ring VIVO Indiana University Pokepedia Transparency 270a.info StatusNet Glou STW Thesaurus for Economics NUTS Geo- vocab BBC Wildlife Finder StatusNet Mystatus Miguiad Eviajes GNOSS Acorn Sat Data Bnf.fr GovUK imd env. rank 2010 StatusNet Opensimchat Open Food Facts LOD ACBDLS FOAF- Profiles StatusNet Samnoble StatusNet Coreyavis Enel Shops DBpedia FR StatusNet Rainbowdash StatusNet Mamalibre Princeton Library Findingaids WWW Foundation Bio2RDF OMIM Resources Opendata Scotland Simd Geographic Access Rank Gutenberg StatusNet Otbm ODCL SOA StatusNet Ourcoffs StatusNet Hackerposse LOV Garnica Plywood GovUK wellb. happy yesterday std. dev. StatusNet Ludost BBC Program- mes Bio2RDF Taxonomy Worldbank 270a.info OSM Music- brainz StatusNet Deuxpi Bizkai Sense StatusNet Morphtown ISO 639 Oasis Aspire Portsmouth Zaragoza Datos Abiertos Opendata Scotland Simd Crime Rank Berlios StatusNet piana GovUK Net Add. Dwellings StatusNet chromic Geospecies linkedct Wordnet (W3C) StatusNet thornton2 StatusNet mkuttner StatusNet linuxwrangling Eurostat Linked Data GovUK societal wellbeing deprv. imd rank '07 GovUK societal wellbeing deprv. imd rank la '10 Linked Open Data of Ecology StatusNet chickenkiller StatusNet gegeweb Deusto Tech StatusNet schiessle Taxon concept GovUK service expenditure Who am I? Why Linked (Open) Data? Field notes on vocabularies Field notes on publishing data Field notes on working with triples Operationalizing Linked Open Data Rob Warren 1 with much input from S. Brown 2 , A. Lemak 2 , C. Faulkner 2 , S. Hulan 5 , C. Schwartz 3 , J. Schellinck 4 , D. Evans, M. Farrell 5 , et al. 1 @muninn_project [email protected] Adjunct, Math and Stats, Carleton Uni. Muninn Project, Canadian Writing Research Collaboratory 2 [email protected] - Uni. of Guelph and Uni. of Alberta Canadian Writing Research Collaboratory 3 Hiberdata 4 Sysabee 5 Uni. of Waterloo Canadian Linked Data Initiative Summit 2016 https://github.com/rwarren2/cldisummit Warren et al Operationalizing Linked Open Data,...

Transcript of Operationalizing Linked Open Data - McGill University ·  · 2016-10-24... C. Schwartz3, J....

Linked Datasets as of August 2014

Uniprot

AlexandriaDigital Library

Gazetteer

lobidOrganizations

chem2bio2rdf

MultimediaLab University

Ghent

Open DataEcuador

GeoEcuador

Serendipity

UTPLLOD

GovAgriBusDenmark

DBpedialive

URIBurner

Identifiers

EionetRDF

lobidResources

WiktionaryDBpedia

Viaf

Umthes

RKBExplorer

Courseware

Opencyc

Olia

Gem.Thesaurus

AudiovisueleArchieven

DiseasomeFU-Berlin

Eurovocin

SKOS

DNBGND

Cornetto

Bio2RDFPubmed

Bio2RDFNDC

Bio2RDFMesh

IDS

OntosNewsPortal

AEMET

ineverycrea

LinkedUser

Feedback

MuseosEspaniaGNOSS

Europeana

NomenclatorAsturias

Red UnoInternacional

GNOSS

GeoWordnet

Bio2RDFHGNC

CticPublic

Dataset

Bio2RDFHomologene

Bio2RDFAffymetrix

MuninnWorld War I

CKAN

GovernmentWeb Integration

forLinkedData

Universidadde CuencaLinkeddata

Freebase

Linklion

Ariadne

OrganicEdunet

GeneExpressionAtlas RDF

ChemblRDF

BiosamplesRDF

IdentifiersOrg

BiomodelsRDF

ReactomeRDF

Disgenet

SemanticQuran

IATI asLinked Data

DutchShips and

Sailors

Verrijktkoninkrijk

IServe

Arago-dbpedia

LinkedTCGA

ABS270a.info

RDFLicense

EnvironmentalApplications

ReferenceThesaurus

Thist

JudaicaLink

BPR

OCD

ShoahVictimsNames

Reload

Data forTourists in

Castilla y Leon

2001SpanishCensusto RDF

RKBExplorer

Webscience

RKBExplorerEprintsHarvest

NVS

EU AgenciesBodies

EPO

LinkedNUTS

RKBExplorer

Epsrc

OpenMobile

Network

RKBExplorerLisbon

RKBExplorer

Italy

CE4R

EnvironmentAgency

Bathing WaterQuality

RKBExplorerKaunas

OpenData

Thesaurus

RKBExplorerWordnet

RKBExplorer

ECS

AustrianSki

Racers

Social-semweb

Thesaurus

DataOpenAc Uk

RKBExplorer

IEEE

RKBExplorer

LAAS

RKBExplorer

Wiki

RKBExplorer

JISC

RKBExplorerEprints

RKBExplorer

Pisa

RKBExplorer

Darmstadt

RKBExplorerunlocode

RKBExplorer

Newcastle

RKBExplorer

OS

RKBExplorer

Curriculum

RKBExplorer

Resex

RKBExplorer

Roma

RKBExplorerEurecom

RKBExplorer

IBM

RKBExplorer

NSF

RKBExplorer

kisti

RKBExplorer

DBLP

RKBExplorer

ACM

RKBExplorerCiteseer

RKBExplorer

Southampton

RKBExplorerDeepblue

RKBExplorerDeploy

RKBExplorer

Risks

RKBExplorer

ERA

RKBExplorer

OAI

RKBExplorer

FT

RKBExplorer

Ulm

RKBExplorer

Irit

RKBExplorerRAE2001

RKBExplorer

Dotac

RKBExplorerBudapest

SwedishOpen Cultural

Heritage

Radatana

CourtsThesaurus

GermanLabor LawThesaurus

GovUKTransport

Data

GovUKEducation

Data

EnaktingMortality

EnaktingEnergy

EnaktingCrime

EnaktingPopulation

EnaktingCO2Emission

EnaktingNHS

RKBExplorer

Crime

RKBExplorercordis

Govtrack

GeologicalSurvey of

AustriaThesaurus

GeoLinkedData

GesisThesoz

Bio2RDFPharmgkb

Bio2RDFSabiorkBio2RDF

Ncbigene

Bio2RDFIrefindex

Bio2RDFIproclass

Bio2RDFGOA

Bio2RDFDrugbank

Bio2RDFCTD

Bio2RDFBiomodels

Bio2RDFDBSNP

Bio2RDFClinicaltrials

Bio2RDFLSR

Bio2RDFOrphanet

Bio2RDFWormbase

BIS270a.info

DM2E

DBpediaPT

DBpediaES

DBpediaCS

DBnary

AlpinoRDF

YAGO

PdevLemon

Lemonuby

Isocat

Ietflang

Core

KUPKB

GettyAAT

SemanticWeb

Journal

OpenlinkSWDataspaces

MyOpenlinkDataspaces

Jugem

Typepad

AspireHarperAdams

NBNResolving

Worldcat

Bio2RDF

Bio2RDFECO

Taxon-conceptAssets

Indymedia

GovUKSocietal

WellbeingDeprivation imd

EmploymentRank La 2010

GNULicenses

GreekWordnet

DBpedia

CIPFA

Yso.fiAllars

Glottolog

StatusNetBonifaz

StatusNetshnoulle

Revyu

StatusNetKathryl

ChargingStations

AspireUCL

Tekord

Didactalia

ArtenueVosmedios

GNOSS

LinkedCrunchbase

ESDStandards

VIVOUniversityof Florida

Bio2RDFSGD

Resources

ProductOntology

DatosBne.es

StatusNetMrblog

Bio2RDFDataset

EUNIS

GovUKHousingMarket

LCSH

GovUKTransparencyImpact ind.Households

In temp.Accom.

UniprotKB

StatusNetTimttmy

SemanticWeb

Grundlagen

GovUKInput ind.

Local AuthorityFunding FromGovernment

Grant

StatusNetFcestrada

JITA

StatusNetSomsants

StatusNetIlikefreedom

DrugbankFU-Berlin

Semanlink

StatusNetDtdns

StatusNetStatus.net

DCSSheffield

AtheliaRFID

StatusNetTekk

ListaEncabezaMientosMateria

StatusNetFragdev

Morelab

DBTuneJohn PeelSessions

RDFizelast.fm

OpenData

Euskadi

GovUKTransparency

Input ind.Local auth.Funding f.

Gvmnt. Grant

MSC

Lexinfo

StatusNetEquestriarp

Asn.us

GovUKSocietal

WellbeingDeprivation ImdHealth Rank la

2010

StatusNetMacno

OceandrillingBorehole

AspireQmul

GovUKImpact

IndicatorsPlanning

ApplicationsGranted

Loius

Datahub.io

StatusNetMaymay

Prospectsand

TrendsGNOSS

GovUKTransparency

Impact IndicatorsEnergy Efficiency

new Builds

DBpediaEU

Bio2RDFTaxon

StatusNetTschlotfeldt

JamendoDBTune

AspireNTU

GovUKSocietal

WellbeingDeprivation Imd

Health Score2010

LoticoGNOSS

UniprotMetadata

LinkedEurostat

AspireSussex

Lexvo

LinkedGeoData

StatusNetSpip

SORS

GovUKHomeless-

nessAccept. per

1000

TWCIEEEvis

AspireBrunel

PlanetDataProject

Wiki

StatusNetFreelish

Statisticsdata.gov.uk

StatusNetMulestable

Enipedia

UKLegislation

API

LinkedMDB

StatusNetQth

SiderFU-Berlin

DBpediaDE

GovUKHouseholds

Social lettingsGeneral Needs

Lettings PrpNumber

Bedrooms

AgrovocSkos

MyExperiment

ProyectoApadrina

GovUKImd CrimeRank 2010

SISVU

GovUKSocietal

WellbeingDeprivation ImdHousing Rank la

2010

StatusNetUni

Siegen

OpendataScotland Simd

EducationRank

StatusNetKaimi

GovUKHouseholds

Accommodatedper 1000

StatusNetPlanetlibre

DBpediaEL

SztakiLOD

DBpediaLite

DrugInteractionKnowledge

BaseStatusNet

Qdnx

AmsterdamMuseum

AS EDN LOD

RDFOhloh

DBTuneartistslast.fm

AspireUclan

HellenicFire Brigade

Bibsonomy

NottinghamTrent

ResourceLists

OpendataScotland SimdIncome Rank

RandomnessGuide

London

OpendataScotland

Simd HealthRank

SouthamptonECS Eprints

FRB270a.info

StatusNetSebseb01

StatusNetBka

ESDToolkit

HellenicPolice

StatusNetCed117

OpenEnergy

Info Wiki

StatusNetLydiastench

OpenDataRISP

Taxon-concept

Occurences

Bio2RDFSGD

UIS270a.info

NYTimesLinked Open

Data

AspireKeele

GovUKHouseholdsProjectionsPopulation

W3C

OpendataScotland

Simd HousingRank

ZDB

StatusNet1w6

StatusNetAlexandre

Franke

DeweyDecimal

Classification

StatusNetStatus

StatusNetdoomicile

CurrencyDesignators

StatusNetHiico

LinkedEdgar

GovUKHouseholds

2008

DOI

StatusNetPandaid

BrazilianPoliticians

NHSJargon

Theses.fr

LinkedLifeData

Semantic WebDogFood

UMBEL

OpenlyLocal

StatusNetSsweeny

LinkedFood

InteractiveMaps

GNOSS

OECD270a.info

Sudoc.fr

GreenCompetitive-

nessGNOSS

StatusNetIntegralblue

WOLD

LinkedStockIndex

Apache

KDATA

LinkedOpenPiracy

GovUKSocietal

WellbeingDeprv. ImdEmpl. Rank

La 2010

BBCMusic

StatusNetQuitter

StatusNetScoffoni

OpenElection

DataProject

Referencedata.gov.uk

StatusNetJonkman

ProjectGutenbergFU-BerlinDBTropes

StatusNetSpraci

Libris

ECB270a.info

StatusNetThelovebug

Icane

GreekAdministrative

Geography

Bio2RDFOMIM

StatusNetOrangeseeds

NationalDiet Library

WEB NDLAuthorities

UniprotTaxonomy

DBpediaNL

L3SDBLP

FAOGeopolitical

Ontology

GovUKImpact

IndicatorsHousing Starts

DeutscheBiographie

StatusNetldnfai

StatusNetKeuser

StatusNetRusswurm

GovUK SocietalWellbeing

Deprivation ImdCrime Rank 2010

GovUKImd Income

Rank La2010

StatusNetDatenfahrt

StatusNetImirhil

Southamptonac.uk

LOD2Project

Wiki

DBpediaKO

DailymedFU-Berlin

WALS

DBpediaIT

StatusNetRecit

Livejournal

StatusNetExdc

Elviajero

Aves3D

OpenCalais

ZaragozaTurruta

AspireManchester

Wordnet(VU)

GovUKTransparency

Impact IndicatorsNeighbourhood

Plans

StatusNetDavid

Haberthuer

B3Kat

PubBielefeld

Prefix.cc

NALT

Vulnera-pedia

GovUKImpact

IndicatorsAffordable

Housing Starts

GovUKWellbeing lsoa

HappyYesterday

Mean

FlickrWrappr

Yso.fiYSA

OpenLibrary

AspirePlymouth

StatusNetJohndrink

Water

StatusNetGomertronic

Tags2conDelicious

StatusNettl1n

StatusNetProgval

Testee

WorldFactbookFU-Berlin

DBpediaJA

StatusNetCooleysekula

ProductDB

IMF270a.info

StatusNetPostblue

StatusNetSkilledtests

NextwebGNOSS

EurostatFU-Berlin

GovUKHouseholds

Social LettingsGeneral Needs

Lettings PrpHousehold

Composition

StatusNetFcac

DWSGroup

OpendataScotland

GraphSimd Rank

DNB

CleanEnergyData

Reegle

OpendataScotland SimdEmployment

Rank

ChroniclingAmerica

GovUKSocietal

WellbeingDeprivation

Imd Rank 2010

StatusNetBelfalas

AspireMMU

StatusNetLegadolibre

BlukBNB

StatusNetLebsanft

GADMGeovocab

GovUKImd Score

2010

SemanticXBRL

UKPostcodes

GeoNames

EEARodAspire

Roehampton

BFS270a.info

CameraDeputatiLinkedData

Bio2RDFGeneID

GovUKTransparency

Impact IndicatorsPlanning

ApplicationsGranted

StatusNetSweetie

Belle

O'Reilly

GNI

CityLichfield

GovUKImd

Rank 2010

BibleOntology

Idref.fr

StatusNetAtari

Frosch

Dev8d

NobelPrizes

StatusNetSoucy

ArchiveshubLinkedData

LinkedRailway

DataProject

FAO270a.info

GovUKWellbeing

WorthwhileMean

Bibbase

Semantic-web.org

BritishMuseum

Collection

GovUKDev LocalAuthorityServices

CodeHaus

Lingvoj

OrdnanceSurveyLinkedData

Wordpress

EurostatRDF

StatusNetKenzoid

GEMET

GovUKSocietal

WellbeingDeprv. imdScore '10

MisMuseosGNOSS

GovUKHouseholdsProjections

totalHouseolds

StatusNet20100

EEA

CiardRing

OpendataScotland Graph

EducationPupils by

School andDatazone

VIVOIndiana

University

Pokepedia

Transparency270a.info

StatusNetGlou

GovUKHomelessness

HouseholdsAccommodated

TemporaryHousing Types

STWThesaurus

forEconomics

DebianPackageTrackingSystem

DBTuneMagnatune

NUTSGeo-vocab

GovUKSocietal

WellbeingDeprivation ImdIncome Rank La

2010

BBCWildlifeFinder

StatusNetMystatus

MiguiadEviajesGNOSS

AcornSat

DataBnf.fr

GovUKimd env.

rank 2010

StatusNetOpensimchat

OpenFoodFacts

GovUKSocietal

WellbeingDeprivation Imd

Education Rank La2010

LODACBDLS

FOAF-Profiles

StatusNetSamnoble

GovUKTransparency

Impact IndicatorsAffordable

Housing Starts

StatusNetCoreyavisEnel

Shops

DBpediaFR

StatusNetRainbowdash

StatusNetMamalibre

PrincetonLibrary

Findingaids

WWWFoundation

Bio2RDFOMIM

Resources

OpendataScotland Simd

GeographicAccess Rank

Gutenberg

StatusNetOtbm

ODCLSOA

StatusNetOurcoffs

Colinda

WebNmasunoTraveler

StatusNetHackerposse

LOV

GarnicaPlywood

GovUKwellb. happy

yesterdaystd. dev.

StatusNetLudost

BBCProgram-

mes

GovUKSocietal

WellbeingDeprivation Imd

EnvironmentRank 2010

Bio2RDFTaxonomy

Worldbank270a.info

OSM

DBTuneMusic-brainz

LinkedMarkMail

StatusNetDeuxpi

GovUKTransparency

ImpactIndicators

Housing Starts

BizkaiSense

GovUKimpact

indicators energyefficiency new

builds

StatusNetMorphtown

GovUKTransparency

Input indicatorsLocal authorities

Working w. tr.Families

ISO 639Oasis

AspirePortsmouth

ZaragozaDatos

AbiertosOpendataScotland

SimdCrime Rank

Berlios

StatusNetpiana

GovUKNet Add.Dwellings

Bootsnall

StatusNetchromic

Geospecies

linkedct

Wordnet(W3C)

StatusNetthornton2

StatusNetmkuttner

StatusNetlinuxwrangling

EurostatLinkedData

GovUKsocietal

wellbeingdeprv. imd

rank '07

GovUKsocietal

wellbeingdeprv. imdrank la '10

LinkedOpen Data

ofEcology

StatusNetchickenkiller

StatusNetgegeweb

DeustoTech

StatusNetschiessle

GovUKtransparency

impactindicatorstr. families

Taxonconcept

GovUKservice

expenditure

GovUKsocietal

wellbeingdeprivation imd

employmentscore 2010

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Operationalizing Linked Open Data

Rob Warren1 with much input fromS. Brown2, A. Lemak2, C. Faulkner2, S. Hulan5, C.

Schwartz3, J. Schellinck4, D. Evans, M. Farrell5, et al.

1@muninn_project [email protected], Math and Stats, Carleton Uni.

Muninn Project, Canadian Writing Research [email protected] - Uni. of Guelph and Uni. of Alberta

Canadian Writing Research Collaboratory3Hiberdata 4Sysabee 5Uni. of Waterloo

Canadian Linked Data Initiative Summit 2016https://github.com/rwarren2/cldisummit

Warren et al Operationalizing Linked Open Data,. . . 1/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Who am I?

LOD Cloud 2014 Muninn WW1

Warren et al Operationalizing Linked Open Data,. . . 2/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Who am I?

CWRC

Warren et al Operationalizing Linked Open Data,. . . 3/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

First ? ? ? ? ? data set on the Canada Open Data Portal

Warren et al Operationalizing Linked Open Data,. . . 4/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Presentation Outline

1 Who am I?

2 Why Linked (Open) Data?

3 Field notes on vocabularies

4 Field notes on publishing data

5 Field notes on working with triples

Warren et al Operationalizing Linked Open Data,. . . 5/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

The business value of LOD.Citations! If you can cite it, it exists!Externalize your costs to someone else.Document your data’s idiosyncrasies.Linked Data is just another fad.It’s already on my website.People will steal my data.There are errors is my data.

Observations:1 There is a bigger market for the individual pieces of your

publication than the whole of it.2 There is a bigger market for your data with people that

don’t share your alphabet.Warren et al Operationalizing Linked Open Data,. . . 6/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

The propeller-head value of LOD.You have a machine readable URI to work with.You can support multiple serializations.You can still reference something, even if not “Open”.You can annotate the data to the nth degree.Easy provenance and tracking of changes.You get multiple languages and Unicode for free.

Observations:1 Forces separation between the data and the application.2 Your use cases for the data are never what people want

out of the application.3 LOD engages with people by engaging their machines.

Warren et al Operationalizing Linked Open Data,. . . 7/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Vocabularies: Use a standard. (Which one!?)

a

ahttps://xkcd.com/927/

Warren et al Operationalizing Linked Open Data,. . . 8/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Vocabulary use options:1 Create your own.2 Use one existing vocabulary.3 Use multiple existing vocabularies.

The data consumer’s perspective:Consumers want to know what to expect in vocabularies.Multiple vocabularies need relationships. (You build them).The vast majority of data consumers cannot useontology reasoning at query time.

Warren et al Operationalizing Linked Open Data,. . . 9/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Case Studies:

Overview: CWRC (http://www.cwrc.ca/)Primarily Orlando TEI-style data.Schema definitions not ontologically sound.Custom ontology linked to other ontologies.Questions of ethnicity, race, skin colour alternate betweenvernacular and technical.

Warren et al Operationalizing Linked Open Data,. . . 10/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Case Studies: CWRC Entry

a

aAnna LeonOwens

Warren et al Operationalizing Linked Open Data,. . . 11/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Case Studies: CWRC Entry

a

aAnna LeonOwens

Warren et al Operationalizing Linked Open Data,. . . 12/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

OutcomesThe Ontology is a data explanation tool. Initially (wrongly)seen as a controlled vocabulary.Much time is being spent on teasing out the intent of thedata as written.The process is very demanding of the scholars.The CWRC ontology in its final form will have paradoxes.Acceptable because it explains data that was not builtwithin an ontologically rational framework.This is good enough for partial data exchange.Massive ancillary linkages to other dataset.

Warren et al Operationalizing Linked Open Data,. . . 13/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Case Studies:

Overview: Muninn (http://rdf.muninn-project.org/)Heterogeneous data sources: text, SQL, images, free formtabular.Erroneous, ambiguous and incomplete data.Multiple purpose built ontologies for specializedapplications.Move to standardized ontologies as they become available.(re: Organization Ontology)No “single” truth, but you are free to decide for yourself.

Warren et al Operationalizing Linked Open Data,. . . 14/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Private Peat, by Harold R. PeatI was sharing a box with a lad whom Iheard the fellows call Bob.

“You’re in the right direction-don’t turnround!”

Partial Information<owl:oneOf rdf:parseType="Collection"><owl:Thing rdf:about="Bob #1"/><owl:Thing rdf:about="Bob #2"/><owl:Thing rdf:about="Bob #3"/> ...</owl:oneOf><rel:knowsByReputationrdf:resource="The Mad Major"/>

Private Peat

Warren et al Operationalizing Linked Open Data,. . . 15/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Attestation PapersDOB 1893-02-31 - February 31, 1893

Partial Information<owl:time rdf:about="Birth"><time:hasDateTimeDescription><time:DateTimeDescription ...><time:year>1893</time:year><time:DateTimeDescription></time:hasDateTimeDescription><rdf:value>1893-02-31</rdf:value></owl:time>

Harry Baird

Warren et al Operationalizing Linked Open Data,. . . 16/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Case Studies Muninn:

British Trench Map Coordinate Translation App

Warren et al Operationalizing Linked Open Data,. . . 17/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Field notes on Vocabularies: Conclusions1 The public interacts with Applications not Data, but

Data is why we are here.2 Do not ever design vocabulary for the application.3 Old data is never clean, sensical or well behaved. The

ontology / vocabulary has to say so and work with it.4 Reuse vocabularies and create new ones on a case by

case basis.5 Great resource athttps://lov.okfn.org/dataset/lov

Warren et al Operationalizing Linked Open Data,. . . 18/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Publishing Linked Data:

Checklist:Dereferencable (URI’s for everything)?Content negotiation (The data format is dead.)?Public facing SPARQL server?Machine and Human readable vocabulary definition?Machine and Human readable data set definition?Production, in-house use of the SPARQL on day 1?

Warren et al Operationalizing Linked Open Data,. . . 19/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Important Note: People write bad programs.“If builders built buildings the way programmers makeprograms, the first woodpecker to come along would destroycivilization.” - Gerald Weinberg

Corollary:Get someone who knows public facing infrastructure to lookthings over for you.

Warren et al Operationalizing Linked Open Data,. . . 20/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

SPARQL servers

SPARQL allows for custom retrieval queries over HTTP withouthaving you involved.

Warren et al Operationalizing Linked Open Data,. . . 21/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

An important note about SPARQLRun SPARQL queries through a reverse HTTP proxy: ngix,polipo, etc.

Why?Offending programmers can be safely ignored.Allows for light infrastructure abuse (auto-completequeries).Improves performance without heavy planning.

Warren et al Operationalizing Linked Open Data,. . . 22/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Tracking data in large data stores:Generate more data as a byproduct of operations:It is easier to delete old triples than to rebuild triples thatshould have existed.Tracking provenance of node is trivial; consider building itinto your work flow.Data and meta-data are merging.The most awesome use of your data is a use case youhave not thought of.

Warren et al Operationalizing Linked Open Data,. . . 23/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Dealing with contentious issues (1/2):

Warren et al Operationalizing Linked Open Data,. . . 24/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Dealing with contentious issues(2/2):

TrenchLabel TrenchLabel

"Military Trench"@en

Trench

rdf:typegeom:geometry

"Regina Trench"@en "Staufen Riegel"@de

skos-xl:Label

rdf:typerdf:type

skosxl:literalFormskosxl:literalForm

OrganizationOrganization

prov:Attribution prov:Attribution

"General Staff,

Geographical Section"@en

rdfs:label "Preußische Landesaufnahme (Deutsch Reich Generalstab)"@de

rdfs:labelskosxl:prefLabelskosxl:prefLabel

Warren et al Operationalizing Linked Open Data,. . . 25/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Important ontological note:The thing and the name of the thing are not the same thing.

Warren et al Operationalizing Linked Open Data,. . . 26/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

��������

��������

��������������

����������������������

���

������������������������

��������������

�����������������������

��������������

���

��������������

������������������������������

���

������������������������

���

�����������������������

��������������

Warren et al Operationalizing Linked Open Data,. . . 27/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Getting value out of low-value items:

Warren et al Operationalizing Linked Open Data,. . . 28/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Print your own Battlefield

Warren et al Operationalizing Linked Open Data,. . . 29/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Recap:Linked Open Data is about data, not applications.The thing and the name of the thing are not the same thing.The most awesome use of your data is a use case youhave not thought of.Vocabulary use means something.LOD engages with people by engaging their machines.

Further informationhttp://www.cwrc.ca/

http://www.muninn-project.org/

https://www.youtube.com/watch?v=aJW16qFkGHU

Warren et al Operationalizing Linked Open Data,. . . 30/31

Who am I?Why Linked (Open) Data?

Field notes on vocabulariesField notes on publishing data

Field notes on working with triples

Questions?

Warren et al Operationalizing Linked Open Data,. . . 31/31