SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked...

30
CAA 2011 Beijing SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets Monika Solanki [email protected] Department of Computer Science Joint work with Yi Hong Department of Computer Science Katharina Rebay-Salisbury School of Archaeology and Ancient History University of Leicester, UK April 14, 2011 Monika Solanki

description

Presentation at CAA 2011, Beijing

Transcript of SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked...

Page 1: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

CAA 2011 Beijing

SEA: A Framework for Interactive Querying,Visualisation and Statistical Analysis of Linked

Archaeological Datasets

Monika [email protected]

Department of Computer ScienceJoint work with

Yi HongDepartment of Computer Science

Katharina Rebay-SalisburySchool of Archaeology and Ancient History

University of Leicester, UK

April 14, 2011Monika Solanki

Page 2: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Talk outline CAA 2011 Beijing

Outline

Context: Tracing Networks

Motivation

Case study

Semantic Explorer forArchaeology

Conclusions and Future work

Demo

Monika Solanki

Page 3: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Context CAA 2011 Beijing

Tracing Networks

Investigates the network of contacts across and beyondthe Mediterranean region, between the late bronze ageand the late classical period (c.1500-c.200 BCE) byinterrogating material objects

Seven archaeological case studies fully integrated withcomputer science projects

http://www.tracingnetworks.org/

Monika Solanki

Page 4: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Context CAA 2011 Beijing

Tracing Networks

Monika Solanki

Page 5: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Context CAA 2011 Beijing

Tracing Networks

Archaeologists study a wide range of material objects.

By tracking them at every stage of their production,distribution, use, and consumption across a largegeographical region, over a long time period, they cantrace the links between the people who made, used, andtaught others to make them.

The Chaîne opératoire

Cross-craft interaction

Monika Solanki

Page 6: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Motivation CAA 2011 Beijing

Motivation: Archaeological perspective

Key Barriers to adopting Semantic Web technologiesThe most time-consuming part of an archaeologicalinvestigation is the post-excavation analysis.

There is a lack of tools and platforms that provide anintegrated environment for interactive querying,visualisation and statistical analysis

Traditional search and retrieval mechanisms generallyprovided “Google” style keyword search or “Library” styledrop down search.

They assume knowledge of controlled vocabularies,terminology and structure of the underlying ontologicalschemas.

Monika Solanki

Page 7: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Motivation CAA 2011 Beijing

Motivation: Computer Science perspective

To increase the uptake and usage of semantically richarchaeological data, it needs to be openly available andaccessible by humans and applications.

An integrated view of diverse data sources is innovativeand of immense potential value for the archaeologicalcommunity.

There is therefore a mileage in combining the task ofarchiving, querying and analysing the data within a singleframework.

Archaeological data is fragmentary. Inferencing capabilitiesof reasoners can be used to extract implicit knowledge andcontribute to their existing knowledge bases to completethe fragments.

Monika Solanki

Page 8: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Motivation CAA 2011 Beijing

Case study: Human representations

Human representations, identities and social relations in theLate Bronze and Iron Age of Central Europe

The scope: examining and analysing humanrepresentations on a range of object types and in a rangeof materials, such as bronze and pottery.

The project utilises details such as gestures and postures,dress and associated objects as keys to understandinghow identity and new understandings of society arecommunicated.

Raw data is collected through examining objects frompublished literature or in museum collections.

Monika Solanki

Page 9: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Motivation CAA 2011 Beijing

Human representations

The analysis generates a large volume of data

Along with details of the human representation on objects,the data recorded also includes images of these objects.

We have developed a vocabulary that defines variousconcepts and relationships of interest in the domain ofhuman representation as captured in these images.

Using the ontology we generated linked datasets from theraw data.

We are currently linking to DBpedia and Geonames,however we are also on the lookout for datasets closelyrelated to archaeology with which we can link in the future.

Monika Solanki

Page 10: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Motivation CAA 2011 Beijing

Human representations: Informal queries

Example 1:

“Find images of riders who appear on objects found in Austriawhere the altitude of the excavation site is 500 meters abovesea level. I would also like to know the statistical distribution ofthe material and the technologies used for the production ofthese objects. I would like to visualise the results as a pie chartand see the distribution of the sites where these objects werefound on Google Earth”.

Monika Solanki

Page 11: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Motivation CAA 2011 Beijing

Human representations: Informal queries

Example 2:

“Find all objects which have images of individuals in the orantgesture who are wearing a triangular dress, earrings and whocarry a vessel on their head, where the vessel is supported bytheir left hand. I would also like to know the statisticaldistribution of the gender of these individuals according to thecountry in which the objects were found. I would like tovisualise the results as a tree map and see the distribution ofthe sites where these objects were found on Google Map”.

Monika Solanki

Page 12: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

SEA: Semantic Explorer for Archaeology CAA 2011 Beijing

Semantic Explorer for Archaeology

A web application

RESTful APIs for programmatically accessing the TN-LODcloud

Interactive and global querying of linked datasets

Data visualisations using user defined perspectives

Statistical analysis using bespoke criteria provided byarchaeologists at runtime

Monika Solanki

Page 13: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

SEA: Semantic Explorer for Archaeology CAA 2011 Beijing

SEA: Architecture

Monika Solanki

Page 14: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

SEA: Semantic Explorer for Archaeology CAA 2011 Beijing

SEA: Query Component

Query builder, a SPARQL/SQWRL endpoint and an inferenceengine

Aggregates the input data as RDFtriples

Generates several sub queries eachof which correspond to a specifictask

Formalises the query in SPARQL,includes any constraints

Provides an interface through whichthe SPARQL query generated byaggregating the triples can be edited

Monika Solanki

Page 15: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

SEA: Semantic Explorer for Archaeology CAA 2011 Beijing

SEA: Query Component

Query builder, a SPARQL/SQWRL endpoint and an inferenceengine

Queries can be specified intuitively

Utilises the WordNet dictionary

“Natural Language QuerySummariser”

Records user preferences: statisticalanalysis, visualisation

Monika Solanki

Page 16: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

SEA: Semantic Explorer for Archaeology CAA 2011 Beijing

Building the query using SEA

Monika Solanki

Page 17: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

SEA: Semantic Explorer for Archaeology CAA 2011 Beijing

Human Representation

“Find images of riders who appear on objects found in Austria wherethe altitude of the excavation site is 500 meters above sea level. Iwould also like to know the statistical distribution of the material andthe technologies used for the production of these objects. I would liketo visualise the results as a pie chart and see the distribution of thesites where these objects were found on Google Earth”.

Part 1Find images of riders who appear on objects found in Austria wherethe altitude of the excavation site is 500 meters above sea level.

Part 2I would also like to know the statistical distribution of the material andthe technologies used for the production of these objects.

Monika Solanki

Page 18: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Sub query Part 1

PREFIX tnh:<http://www.tracingnetworks.ac.uk/ontology/human_representation.owl#>

PREFIX rdf:<http://www.w3.org/1999/02/22-rdf-syntax-ns#>SELECT ?individual ?object ?site ?country ?abbr ?type

?tech ?image ?altitude ?materialWHERE{

?individual rdf:type tnh:Individual.?individual tnh:appearOn ?object.?object tnh:isFoundAtSite ?site.?site tnh:isLocatedInCountry ?country.?country tnh:hasCountryAbbr ?abbr.?object tnh:has1stObjectType thn:rider.?object tnh:hasImageLink ?image.?site tnh:hasAltitude ?altitude.FILTER (?altitude>=500).FILTER (?abbr="AT").}LIMIT 3000

Page 19: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

SEA: Semantic Explorer for Archaeology CAA 2011 Beijing

SEA: Query Component

Query builder, a SPARQL/SQWRL endpoint and an inferenceengine

Includes an option to specify anyreasoning rules.

A rule-based inferencing componentspecified to support deductivereasoning.

SWRL or Jena inferencing rulesused to derive implicit statementsfrom existing archaeologicalknowledge bases

Monika Solanki

Page 20: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

SEA: Semantic Explorer for Archaeology CAA 2011 Beijing

SEA: Visualiser Component

Three visualisation modules.Queries generated by the user

Convert the SPARQL triple patterns to GraphMLThe visualiser is interactive and allows a user toexpand/collapse nodes in the graph.Search for a specific node in the graph.

Query Results: linked data, markers on the GoogleEarth/Google maps.

Statistical analysis: commonly used statistical analysismodels.

Monika Solanki

Page 21: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

SEA: Semantic Explorer for Archaeology CAA 2011 Beijing

Visualising the query

Monika Solanki

Page 22: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

SEA: Semantic Explorer for Archaeology CAA 2011 Beijing

Visualising the query results: Google earth

Monika Solanki

Page 23: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

SEA: Semantic Explorer for Archaeology CAA 2011 Beijing

Visualising the query results

Monika Solanki

Page 24: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

SEA: Semantic Explorer for Archaeology CAA 2011 Beijing

SEA: RESTful API

The SEA REST API corresponds to a set of servicessimply accessible through HTTP calls.

The SEA API employs content negotiation to decidewhether the result should be encoded in RDF/XML(default), JSON or plain text.

We have been inspired by the linked data APIs publishedby the data.gov.uk.

The APIs do not provide support for PUT/POST request.They are meant to provide a read only access layer to thedata repositories.

The SEA API layer can also act as a proxy over a SPARQLendpoint. This allows a user to specify a sparql query as aquery parameter.

Monika Solanki

Page 25: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Related work CAA 2011 Beijing

Closely related work

D2RQ: Berlin

Virtuoso: Open Link Software

STAR: Glamorgan, English Heritage

STELLAR: Glamorgan, English Heritage

TRANSLATION: Southampton

Monika Solanki

Page 26: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Related work CAA 2011 Beijing

Grand vision: The TN-LOD cloud

Tracing Networks through Linked Open Data

Monika Solanki

Page 27: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Conclusions CAA 2011 Beijing

Conclusions

Little work has been done so far in the Semantic webcommunity that can motivate archaeologists to adopt theirtechnologies to manage and analysis data.

An exploratory attempt to reconstruct the Chaîneopératoire using the principles of linked open data.

A transformation framework for migrating large volumes ofarchaeological data stored in RDBs to ontology based datasets on the Semantic Web.

SEA: A unified framework that allows archaeologists withbasic knowledge of Semantic Web technologies to“explore” their datasets through interactive querying,visualisation and analysis.

Monika Solanki

Page 28: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Future work CAA 2011 Beijing

Future work

Implement a user-friendly graphical modeling environmentfor the language in GMF (Graphical Modeling Framework)to allow easy creation and editing of transformation rules.

Extend the query interface so that it allows archaeologiststo specify ranking heuristics for the search results.

Extend the visualisation interface by providing a facetedbrowser that allows the archaeologist to visualise queryresults along several facets.

Augment the support provided for inference making.

Keeping a close eye on the linked data cloud for anyrelevant archaeological datasets that may eventually bepublished so that we can link to it.

Monika Solanki

Page 29: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

Acknowledgements CAA 2011 Beijing

Acknowledgements

Computer ScienceProf Jose Fiadeiro

Yi Hong

ArchaeologyProf Lin Foxhall

Katharina Rebay-Salisbury

Monika Solanki

Page 30: SEA: A Framework for Interactive Querying, Visualisation and Statistical Analysis of Linked Archaeological Datasets

CAA 2011 Beijing

Many Thanks!!!

Monika Solanki