An Open Linked Data Strategy for Tourism

20
Can we really compete with v ? Andrea Volpini @cyberandy #SEMANTiCS2014 - Leipzig, Germany 5th September 2014

description

The following high-level challenges from SalzburgerLand Tourism (SLT - a Public Administration responsible for supporting tourism in the region of Salzburg) have shaped the Open Linked Data Strategy implemented by Insideout10 (an Italian company focused on Open Data and Semantic Web Publishing) using the Redlink platform (Redlink GmbH is company offering linked data publishing services and content analysis as-a-service in the cloud): 1. Cross-platform publishing: As more travellers begin massively using mobile devices the entire service offering shall be re-engineered targeting a comprehensive mobile user experience. 2. 3rd Party Content Syndication: With the increasing growth of location based social mobile apps (SoLoMo = Social + Local + Mobile) a large amount of information is nowadays directly delivered to travellers by 3rd party applications such as FourSquare, Facebook or Google. 3. Semantic Search Optimisation: As commercial search engines (Google, Bing, Yahoo!, Yandex and more) are starting to use semantic search more and more users are taking advantage of conversational search and voice-enabled application like Siri and Google Now. 4. Linked Data Ecosystem: Engaging with the local communities is key to nurture and maintain the virtuous circle of information between tourist operators, citizens, local businesses and tourists.

Transcript of An Open Linked Data Strategy for Tourism

Can we really compete with v ?

Andrea Volpini @cyberandy

!#SEMANTiCS2014 - Leipzig, Germany

5th September 2014

A semantic plugin for

using content analysis and linked data services from

An Open Linked Data Project for Tourism in Salzburg

• Cross platform publishing as more travellers massively begin using mobile devices

• Multiple Web CMSs (both proprietary and open source) to be managed simultaneously

• Costly manual curation and interlinking

• Increasing demand for content syndication (from big players like foursquare as well as from local application developers)

• Need for better SEO especially for events and sites (too regional to be understood by commercial search engines)

Remixing existing content and creating new value.

A magazine running on WordPress

An online booking system

freshly updated content on locations and events

a database containing: events, facilities, accommodations, …

Everything we know already from Wikipedia

the World’s largest encyclopedia

Using Linked Data to make sense of the information

Linked Data Publishing

• Data from the online booking system (Feratel) is enriched and transformed in triples using identified vocabularies and ontologies

• Triples are stored in the Redlink triple store in a dedicated context

• RDF data and SPARQL end-points are published to the data website (data.salburgland.com) running CKAN as Linked Open Data

• CKAN makes the data accessibile to third parties in various formats by querying Redlink

Transforming Feratel Data in Semantic Knowledge from SOAP to Linked Data

Ontologies provide a mean to hold everything together Data Modelling

Using LODE: An ontology for Linking Open Descriptions of

EventsAdding the relationships between things

Florianifeier

using RDF different data sources can be integrated providing robot-friendly information to describe real world things

<subject><predicate><object>

Semantic Lifting and Linked Data Principles

• A “word” or “phrase” becomes an identifier used to denote “things” (named entities) existing in the real world

1.Real-world thing are unambiguously represented with web addresses (URI)

2.By accessing these web addresses (HTTP-URI) usable data is sent in return using standard formats (RDF, SPARQL)

3.This data includes links to other data so that people can discover more things

"label":"May",

"reference":

“http://dbpedia.org/

resource/May”

!Type: Thing

"values"["13.7446"],"predicate": "http://www.w3.org/2003/01/geo/wgs84_pos#long" values"["47.10222"],"predicate": “http://www.w3.org/2003/01/geo/wgs84_pos#lat” "reference": “http://dbpedia.org/page/Unternberg” !Type: Place

“label":"Florianifeier", "reference": “http://rdf.salzburgerland.com/events/event/dea7fde1-5583-4002-97eb-007

4a182fa9c.html” !Type: Event

Tim Berners-Lee.

LANGUAGE EVENT THING LOCATION

ENGLISH FLORIANIFEIER MAY UNTERNBERG

[Très Riches Heures du duc de Berry, Raymond Cazelles et Johannes Rathofe]

“This May don't miss the Florianifeier, we'll have fun as usual in Unternberg”

Dynamic Semantic Publishing with

• Data from the Redlink triple store is made available for content enrichment and can be edited using WordLift.

Data Curation

• Using Linked Data the Web becomes our new CMS

• information is automatically imported in WordPress

• posts are connected with entities

• properties for each entity can be edited using WordPress

• any change is automatically reflected in the triple-store and re-published as Open Data

Using Linked Data the Web becomes our new CMS.

editing a blog post

editing an entity

Web Search 19.900 results

no answer

Touristic applications attempting to discover events in SalzburgerLand.

“Which events occur in May in Lungau?”

Linked Open Data Query 5 result

5 answer

Unternberg is a village in the area of Lungau

Better SEO using Semantic Markup

Florianifeier

Unternberg

• Using schema.org the data from the triple-store is added to the pages as semantic markup

• Search engines can “recognise” entities that were previously unknown (i.e. Florianifeier)

Keep the content fresh with LOD-driven Widgets

Florianifeier

• Using the WordLift’s timeline (configured to receive events related to locations being detected in the text) we can keep the pages always up-to-date with new events coming out from the Feratel DB.

where are we going next?

•Media in cross-media context, allowing to analyse media resources as well as connected content, including video, images, audio, text, link structure and metadata;

• Investigate cross-media analysis along the complete, distributed analysis chain, namely extraction, metadata publishing, querying and recommendations;

•Contribute its main software development results as Open Source components to two established Apache projects, Apache Marmotta and Apache Stanbol, simplifying the use of the technology in industrial products.

Cross-media in action: analysis and querying

MICO-PROJECT.EU

•TourPack aims to build a linked data -empowered system for touristic service packaging. Integrating information from multiple sources and systems employing linked data as a global information integration platform, and mining from the depths of the “closed” data, the touristic service package production system will be able to cater to creating the most optimal travel experience for the traveler. !In partnership with:

On-demand data-driven Touristic Service Packages

TOURPACK.STI2.AT

JOIN.WORDLIFT.ITDanke schön!

redlink.co wordlift.it insideout.io