Experimental Workflow Development in Digitisation

11

Click here to load reader

description

Experimental Workflow Development in Digitisation 2nd Qualitative and Quantitative Methods in Libraries International Conference (QQML2010), 25-28 May 2010, Chania, Greece.

Transcript of Experimental Workflow Development in Digitisation

Page 1: Experimental Workflow Development in Digitisation

IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.

Mustafa Dogan (Göttingen State and University Library)Clemens Neudecker (Koninklijke Bibliotheek)Gerd Zechmeister (Austrian National Library)Sven Schlarb (Austrian National Library)

Experimental workflow developmentin digitisationThe concept of collaborative workflow development in the IMPACT project

Page 2: Experimental Workflow Development in Digitisation

27.5.2010 QQML Chania/Greece

IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.

2

Agenda Background of IMPACT Digitisation workflows Collaborative workflow development Architectural principles Workflow development platform Key success factors Outlook and future scenarios

Page 3: Experimental Workflow Development in Digitisation

27.5.2010 QQML Chania/Greece

IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.

3

Background of IMPACT Project partners

– 26 Libraries, Research Institutes and Industry Partners Main objective

– Improve access to historical books and newspapers printed before 1900

Software tools and prototypes– Image Enhancement & Segmentation Toolkit– Improved ABBYY FineReader OCR Engine, IBM Adaptive OCR– Post-processing and -correction modules– Lexical resources for several European languages

Support to the MLA community– Best Practises & Strategic/Operational Guidelines– Online Helpdesk– Tool Showcases & Demonstrators– Centre of Competence

Page 4: Experimental Workflow Development in Digitisation

27.5.2010 QQML Chania/Greece

IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.

4

Digitisation workflows Digitisation: a sequence of steps, from selection of

analogue source material to presentation of digital objects for end-users

Workflow: software-based execution of a sequence without human interaction

Challenges and barriers– Workflows are tailored to specific needs– Lack of interoperability for applied software and input/outdata

data– Lack of collaboratively used and developed resources and

expertise

Page 5: Experimental Workflow Development in Digitisation

27.5.2010 QQML Chania/Greece

IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.

5

Collaborative workflow development Workflow Development as a community-driven activity

using an experimental platform Scientific workflows: using web services representing

individual software modules (Shiyong Lu et al. 2009) Providing highly innovative and efficient tools to a wider

community to design workflows Technical staff providing the platform,

conceptual/library staff designing workflows Using Web 2.0 features to share and expand knowledge

and resources

Page 6: Experimental Workflow Development in Digitisation

27.5.2010 QQML Chania/Greece

IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.

6

Architectural platform principles Modularity Transparency Flexibility Extensibility Open standards based Accessibility Scalability Collaboration

Page 7: Experimental Workflow Development in Digitisation

27.5.2010 QQML Chania/Greece

IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.

7

Community

ArchivesMuseums Libraries …..

Component A Component B Component C Component D

Workflow Registry

Workflow 1 Workflow 2 Workflow 3

Experimental workflow development platform

Workflow development platform

rate

modify

comment

compare

share

measure

Page 8: Experimental Workflow Development in Digitisation

27.5.2010 QQML Chania/Greece

IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.

8

Workflow development phases

Cen

tral

Dat

a R

epos

itory

Select

Design

Execute

Evaluate

Workflow Development Workbench

Page 9: Experimental Workflow Development in Digitisation

27.5.2010 QQML Chania/Greece

IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.

9

Evaluation criteria OCR: correctly recognised characters/words Segmentation: correctly identified text and graphical

regions Workflows: comparing workflows and identifiying most

suitable Statistical and provenance data: e.g. processing time

Page 10: Experimental Workflow Development in Digitisation

27.5.2010 QQML Chania/Greece

IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.

10

Outlook Keys to success

– Joint effort by library and software development staff– Usability of tools and platform– Incentive to collaborative work– Testing and adaptation of workflows– Permanently tailoring and optimizing workflows

Future work– Demonstration of current (web) services– Experimental platform as sustainable resource for a Centre of

Competence for the MLA community

Page 11: Experimental Workflow Development in Digitisation

27.5.2010 QQML Chania/Greece

IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.

11

Thank you very much!

Contact:Project Website: http://www.impact-project.euProject Office: [email protected]