ForgetIT: Beyond the page: Giving content a meaning and value

72
Concise Preservation by combining Managed Forgetting and Contextualized Remembering

description

Following the concept of human memory Forget IT aims to create a framework which will bring “managed forgetting” to TYPO3 CMS. It will provide semantic annotation, intelligent preservation and managed archiving of content objects. Learn what dkd plans for 2014 and how you can contribute. While preservation of digital content is now well established in memory institutions such as national libraries and archives, it is still in its infancy in most other organizations, and even more so for personal content. ForgetIT combines three new concepts to ease the adoption of preservation in the personal and organizational context. Managed Forgetting: Managed Forgetting models resource selection as a function of attention and significance dynamics. It is inspired by the important role of forgetting in human memory and focuses on characteristic signals of reduction in salience. Synergetic Preservation: Synergetic Preservation crosses the chasm that exists between active information use and preservation management by making intelligent preservation processes an integral part of the content lifecycle in information management. Contextualized Remembering: Contextualized Remembering targets keeping preserved content meaningful and useful. It will be based on a process of dynamic evolution-­aware contextualization. Impact on TYPO3 CMS: Together with the TYPO3 community and selected pilot customers, dkd will work on establishing the respective extensions to provide these concepts to TYPO3 CMS and its user base. Olivier will introduce you the project, its concepts and the framework architecture. The past year has been used to define these and a solid foundation was laid. We elaborated the design and functional requirements by using two use cases (I. Press release, II. DAM integration into the backend). The current year in the project will be used to create a first and working implementation. What does this mean for you? After a short break, a joint brainstorming about how you can be involved and what potential benefits would be, shall take place. Things to look at will be: * the value of content objects * semantic annotation and contextualization * memory buoyancy, allowing mechanics to forget content over time * utilization of open standards like CMIS, ODATA, Stanbol

Transcript of ForgetIT: Beyond the page: Giving content a meaning and value

Page 1: ForgetIT: Beyond the page: Giving content a meaning and value

Concise Preservation by combining Managed Forgetting and Contextualized Remembering

Page 2: ForgetIT: Beyond the page: Giving content a meaning and value

Olivier Dobberkau (R&D)

T3DD2014!Beyond the page - Giving content a meaning and value!TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Page 3: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

About

Olivier Dobberkau

R&D dkd

President of TYPO3 Association

@TReverendNeverend

Page 4: ForgetIT: Beyond the page: Giving content a meaning and value

The problem

Page 5: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Welcome to the digital, information age... …a never ending flood of content!

Technology enables us to produce nearly unlimited data

We are still „hunters and collectors“ somehow

Currently storage space feels to be „infinite“, but resources on

earth are limited sooner or later

Velocity of innovation/evolution of technology increases, which

brings new technology/formats/standard at an increasing

frequency -> so how do we handle this?

Page 6: ForgetIT: Beyond the page: Giving content a meaning and value

Storage capacity is ever increasing Prices for storage are falling

Page 7: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Easy - let‘s keep everything!

There’s a lot more costs

Retrieval

Maintenance

Indexing

Updates

Deprecated formats

Page 8: ForgetIT: Beyond the page: Giving content a meaning and value

Should we really keep everything as it was created ?

Page 9: ForgetIT: Beyond the page: Giving content a meaning and value

“The digital dark age is a possible future situation where it will be difficult or impossible to read historical electronic documents and multimedia, because they have been stored in an obsolete and obscure file format.” Wikipedia

Page 10: ForgetIT: Beyond the page: Giving content a meaning and value

How do we tackle this?

Page 11: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

What is preservation?

“Preservation — The protection of cultural property through activities that minimize chemical and physical deterioration and damage and that prevent loss of informational content. The primary goal of preservation is to prolong the existence of cultural property.”

Preservation 101

Page 12: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Preserving a website is not trivial

What do want you preserve?

Content only?

Content and Design?

How often? Stock prices vs. Company History page

How do you deal with browser differences?

How do you preserve functionality? E.g. insurance fee calculator

Page 13: ForgetIT: Beyond the page: Giving content a meaning and value

The project

Page 14: ForgetIT: Beyond the page: Giving content a meaning and value

Concise Preservation by combining Managed Forgetting and Contextualized Remembering

Page 15: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

The project

Deliver a framework for intelligent preservation, incl. pilot applications (personal use case, organizational use case) that already bring value to their target groups.

Page 16: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

The Project

EU research project

Part of the Seventh framework programme

Countries involved : Germany, Sweden, Israel, Turkey, Greece, United Kingdom, Italy

Project duration: 2013/2016

Page 17: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Core concepts

Synergetic Preservation

Contextualised Remembering

Managed Forgetting

Page 18: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Core values

Preservation valueMemory buoyancy

Page 19: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Memory buoyancy and preservation value

Page 20: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Memory buoyancy and preservation value

Digital preservation

Page 21: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Memory buoyancy and preservation value

Digital preservation

Forgetting without context

Page 22: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Memory buoyancy and preservation value

Digital preservation

Forgetting without context

Preservation with learning

Page 23: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Memory buoyancy and preservation value

Digital preservation

Forgetting without context Preservation with context

Preservation with learning

Page 24: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Memory buoyancy and preservation value

Digital preservation

Forgetting without context

Managed digital preservation

Preservation with context

Preservation with learning

Page 25: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Memory buoyancy and preservation value

Archive or delete

Digital preservation

Forgetting without context

Managed digital preservation

Preservation with context

Preservation with learning

Page 26: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Memory buoyancy and preservation value

Archive or delete

Information not neededDigital preservation

Forgetting without context

Managed digital preservation

Preservation with context

Preservation with learning

Page 27: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Use cases

Organizational Preservation

Personal Preservation

Page 28: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Organizational use case

Organizational Preservation

Digital Asset Management

Versioning

Archiving a complete Website

Individual genres and their specific requirements

Example: Press Release

Page 29: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Business case / Value preposition

Creating metrics to actually „measure“ the value of content is unique to ForgetIT and will be a USP

Sustainable and integrated tools to manage the process of preservation, which is new to CMS systems

The utilized standards (e.g. CMIS, ODATA, STANBOL, etc.) and newly created tools within the context of TYPO3 CMS will lead to CMS interoperability and thus prevent future loss of content due to technological evolution (see „preventing the digital dark age“)

Page 30: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Content Value Performance Indicators: Potential dimensions to look at:

Production Inner relevance Outer relevance „Meaning"

Effort ReferencesSocial Media

relevance Context

ComplexityPage

impressionsGoogle page

rank Ontologies

VersionsTYPO3 CMS

page rank Backlinks Annotation

…Memory

Buoyancy … …

Page 31: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Why TYPO3 CMS?

Open source

large base of installation

Want to create awareness on the concept of preservation

Page 32: ForgetIT: Beyond the page: Giving content a meaning and value
Page 33: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Architecture

Page 34: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Page 35: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Page 36: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Page 37: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Page 38: ForgetIT: Beyond the page: Giving content a meaning and value

Technology

Page 39: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Content Management Interoperability Services (CMIS)

Standard allowing interoperability between CMS

Abstraction layer

Defined domain model

OASIS Standard

Page 40: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Semantic web

A web that can be processed by machines

Resource Description Framework (RDF)

Page 41: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Ontologies / Domains Semantic relations in Content

industry specific

concepts !

geography,time,

abstract concepts !

!company related products, events,

concepts, ...

This is our set of concepts

to annotate content with!

during creation/update

flows over time

as the basis for defining

value

future „smart semantic“

editing

Page 42: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

But - what does semantic annotation mean? How does it look to us in a press release (tt_news)?

„.. to announce, that the Global Toy fare will be held in Nuremberg on February 12th, 2014. LEGO will be presenting it products in Hall ... “

company event

common geography

common date

industry conceptof a brand

Page 43: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Suggestion how to tackle this from dkd:

Treat semantics like learning the system a foreign (company) language

Implementing a semantic „overlay“ within the backend, so that during the creation/update of content annotation can happen

Suggest annotations if the backend already knows a word/concept

Using these content annotations to level up DAM in TYPO3CMS

Integrating semantic search in back end and front end

Connect DAM to the Media Mixer from ForgetIT Framework

Page 44: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Text summarization

Generation of visual summaries!• Content Detection analyzes a

document to determine which sections are useful in terms of content (e.g. removing the generic menus in a web page; avoids irrelevant material biasing the summary)!

• TermRaider extracts representative, weighted terms (words, entities etc.) from documents which can provide a summary (e.g. as a term cloud)

Page 45: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Outlook: Semantic text composition

Semantic text editor!• Tool for inferring and suggesting semantic annotations for text while it

is being composed

Page 46: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Outlook: Semantic text composition

Semantic text editor components!• Editor!

− An extended version of the open-source HTML-based rich text editor CKEditor, which allows for annotating and tracking arbitrary parts of the text !

• Natural Language Processing component!−Named entity recognition locates and classifies atomic elements in text into

predefined categories such as people, organizations, and locations!−Coreference resolution identifies which words refer to which things in a text!−Relation extraction extracts binary relations from the text being composed!

• Linked Open Data component !− Entity disambiguation distinguishes between different entities that have similar

or identical names!−Relation extraction searches for relations among entities!−Context inference finds contextual information about entities mentioned in the

text

Page 47: ForgetIT: Beyond the page: Giving content a meaning and value

Annotation/Contextualisation of Images

Page 48: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Image analysis

ForgetIT visual analysis

technologies demonstrator!• Concept detection and feature

extraction!• Visual quality assessment!• Image clustering!• Face detection

http://multimedia.iti.gr/ForgetIT/CostaRica/demonstrator.html

Page 49: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Image feature extraction and concept detection

Page 50: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Image clustering for summarization

Page 51: ForgetIT: Beyond the page: Giving content a meaning and value

Want to support the ForgetIT project?

How to get involved?

Page 52: ForgetIT: Beyond the page: Giving content a meaning and value

Ideas

Page 53: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Code contributions

Page 54: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Test and evaluate

Page 55: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Take our survey! (1/2)

Organizational Preservation

http://bit.ly/U65uL6

Page 56: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Take our survey! (2/2)

Personal Preservation

http://bit.ly/1kJPNhZ

Page 57: ForgetIT: Beyond the page: Giving content a meaning and value

Timeline

Page 58: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Timeline

2013

• D10.3

• Mockups

• Proof of concept

2014

• Architecture

• FAL

• Semantic UI / Layer

• DAM Dashboard

• Log Aggregation Toolkit

2015

• Content value framework

2016

• Final

Page 59: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

D10.1

Research

Analysis

Application Design

Application Logic and Workflow

Page 60: ForgetIT: Beyond the page: Giving content a meaning and value

Mockups

Page 61: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Use Case I: Press release

Use case I: Press release

• Creating a press release

• Adding meta data

• Semantic annotation

Page 62: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Page 63: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Ingest press release

Automatic annotation

• Initiated by user

• Add entity to own ontology

• Color coded according to type

Page 64: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Page 65: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Ingest press release

Manual annotation

• Selection from text or clipboard

• Add entity to own ontology

Page 66: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Use Case II: Preservation-aware digital asset management

Use Case II: Preservation-aware digital asset management

• Searching for assets

• Managing digital assets

• Handling digital assets

Page 67: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Page 68: ForgetIT: Beyond the page: Giving content a meaning and value

Summary

Page 69: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Where to find us

http://www.forgetit-project.eu

Page 70: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Contact

@ForgetITProject

Olivier Dobberkau

[email protected]

Page 71: ForgetIT: Beyond the page: Giving content a meaning and value

TYPO3 Developer Days, 19/22 June 2014, Eindhoven

Call to Action

Join our efforts in creating:

a semantic layer in TYPO3 CMS

defining the future of DAM within the TYPO3 world

establishing content value measures

preparing TYPO3 and our customers to manage forgetting and preservation of content

Page 72: ForgetIT: Beyond the page: Giving content a meaning and value

Thank you for your attention!