Dive exploring history presentation

19
DIVE INTO THE EVENT-BASED BROWSING OF LINKED HISTORICAL MEDIA VICTOR DE BOER, JOHAN OOMEN, OANA INEL, LORA AROYO, ELCO VAN STAVEREN, WERNER HELMICH AND DENNIS DE BEURS EXPLORING HISTORICAL SOURCES WITH LANGUAGE TECHNOLOGY: RESULTS AND PERSPECTIVES -- 8-9 DEC 2014

Transcript of Dive exploring history presentation

DIVE INTO THE EVENT-BASED BROWSING OF LINKED HISTORICAL MEDIA

VICTOR DE BOER, JOHAN OOMEN, OANA INEL, LORA AROYO, ELCO VAN STAVEREN, WERNER HELMICH AND DENNIS DE BEURS

EXPLORING HISTORICAL SOURCES WITH LANGUAGE TECHNOLOGY: RESULTS AND PERSPECTIVES -- 8-9 DEC 2014

Clarin - Verrijkt Koninkrijk

National-Socialist

29%

Social-Democrat

21%Protestant

13%

Liberal12%

R-Catholic12%

Communist8%

Jewish5%

Back-of-the-Book index

Named Entities

1. Dr. Loe de Jong’s seminal work on Dutch life in WW2 Scanned, OCR’ed, analyzed

2. Enriched through links with external datasets (Semantic Web)

Jur Leinenga:Monsterrollen Noordelijke provincies

Matthias van Rossum GeneraleZeemonsterrollen VOC

KB Delpher Dutch-Asiatic Shipping

(Huygens ING)VOC Opvarenden (DANS Easy)

Clarin - Dutch Ships and Sailors

Clarin - Dutch Ships and Sailors

AGORA PROJECT

Slide

: Lora A

royo

DIGITAL HUMANITIES RESEARCHERS Media researcher Lars Arve

Røssland

of the University of Bergen. (Photo: Andreas R

. Graven)

EXPLORATIVE SEARCH

Erp, M. van; Oomen, J.; Segers, R.; Akker, C. van de; Aroyo, L.; Jacobs, G.; Legêne, S; Meij, L. van der;O ssenbruggen, J.R. van; Schreiber, G. Automatic Heritage Metadata Enrichment with Historic Events Museums and the Web 2011 http://www.museumsandtheweb.com/mw2011/papers/automatic_heritage_metadata_enrichment_with_hi

http

s://ww

w.flickr.co

m/p

ho

tos/d

rainrat/1

47

79

92

89

98

/

DATA: OPENIMAGES.EU

Open videos Netherlands Institute for Sound and Vision3000, mostly news broadcasts

DATA: DELPHER.NL

Scans of Radio bulletins (hand annotated)1937 – 19841.5 Million OCR’ed and NErred

ENTITY EXTRACTION

CROWDTRUTH.ORG

ENTITY EXTRACTION

EVENTS CROWDSOURCING AND LINKING TO CONCEPTS THROUGH CROWDTRUTH.ORG

SEGMENTATION & KEYFRAMES

LINKING EVENTS AND CONCEPTS TO KEYFRAMES

Linked Data

SIMPLE EVENT MODEL (SEM), OPENANNOTATION (OA) AND SKOS

DIVE:MEDIA OBJECT SEM:EVENT

SEM:PLACE

SEM:TIME

SEM:ACTOR

SKOS:CONCEPT

OA:ANNOTATION

LINKS TO EUROPEANA (MULTILINGUAL)LINKS TO DBPEDIA

CLIOPATRIA TRIPLE STORE

130K TRIPLES (FOR NOW)SPARQL ENDPOINT

HTTP://ECULTURE.CS.VU.NL:8877/DIVE/HOME

DIGITAL SUBMARINE UI

http

s://ww

w.flickr.co

m/p

ho

tos/b

enjcarso

n/2

45

17

18

85

INFINITY OF EXPLORATION

http

s://ww

w.flickr.co

m/p

ho

tos/m

ibu

chat/2

77

42

51

41

5

Current workUSE COMMON VOCABULARIES

GTAA: GEMEENSCHAPPELIJKE THESAURUS AUDIOVISUELE ARCHIEVENGEONAMES

ADD AND CLEAN DATA

ADD CROWD CORRECTIONS

EVALUATE

THANK YOU

http

s://ww

w.flickr.co

m/p

ho

tos/ro

bysalto

ri/

[email protected]