Goobi

Post on 26-May-2015

1.455 views 0 download

Tags:

description

Goobi web based digitization lifecycle software4. German – Chinese Symposium Hannover, 13.10.2008 Christian Mahnke, Goettingen state and university library

Transcript of Goobi

Goobiweb based digitization lifecycle software

4. German – Chinese SymposiumHannover, 13.10.2008

Christian Mahnke, Goettingen state and university library

Topics

• About the GDZ (Göttinger Digitalisierungszentrum)

• Goobi– Architecture– Example Workflow– Benefits

13.10.2008

Mahnke

2

About the digitization centre (GDZ)

• Founded in 1998 as one of two German digitization centres.

• Part of the university library of Goettingen

• Known projects include:– Gutenberg Bible– EZOOLO (zoological works by Linné)– Works of C. F. Gauß – Historical travel literature– Digizeitschriften.de (journals)

13.10.2008

Mahnke

3

13.10.2008

Mahnke

4

Introduction

• Modular architecture for digital libraries

• Strong digitization focus

• Web based (production and presentation)

• Standards based

13.10.2008

Mahnke

5

History and current status

• Goobi was developed as workflow tool for the federated digitization project RusDML.– During this project it was used in a distributed model: Scanning took

place in Goettingen, metadata was gathered in Berlin and Moscow.

• It’s used under production conditions by the GDZ since 2006.

• SUB Goettingen and SLUB Dresden got an 18 month grant from the German Research Foundation (DFG) for further development.

• We are currently working on release 1.5. The main Goal for 1.5 is to be able to produce a software that is capable to fulfil the recommendations of the German Science Foundation (DFG) regarding digitization projects.

13.10.2008

Mahnke

6

Architectural overview

13.10.2008

Mahnke

7

Technical background

• Workflow software Java (J2EE) based

• Presentation PHP (Typo3) based

• File exchange via SMB/CIFS, WebDAV, NFS – SAN storage backend

• Authentification via LDAP

• METS / MODS and TEI as data formats (XML based)

13.10.2008

Mahnke

8

Features (workflow management)

• Completely web based• Integrates Windows Clients and existing

software• On demand OCR• Highly configurable• Configurable metadata mapping• Integration of server based programs and

scripts

• Complete web based metadata editor

13.10.2008

Mahnke

9

Example digitization workflow

(please note that the type and order of workflow steps is depending highly on your project)

13.10.2008

Mahnke

10

Example digitization workflow

13.10.2008

Mahnke

11

Preperation

• At first a new record is added to the catalogue by a librarian

• A barcode Scanner is used to gather the bibliographical metadata from the catalogue

13.10.2008

Mahnke

12

Image production - Task view (user perspective)

13.10.2008

Mahnke

13

Tasks ProjectsIdentifier Actions

Quality assurance (QA) and image post production

• Quality assurance and image post production is done outside of Goobi

• An image viewer is used for QA

• Photoshop and PixEdit are used for image post production

• The users access the image files via a network volume

13.10.2008

Mahnke

14

Metadata acquisition - Metadata editor (user perspective)

13.10.2008

Mahnke

15

Documentstructure

Preview image

OCR result

metadata

Examples

Presentation

13.10.2008

Mahnke

16

Presentation – Single page

13.10.2008

Mahnke

17

Zoom

Navigation

Identifier

Presentation - Structural metadata

13.10.2008

Mahnke

18

Benefits of an integrated workflow tool

13.10.2008

Mahnke

19

Benefits of an integrated workflow management system

• Transparencysee what your staff is working onsee which steps are bottle necks

• FlexibilityDefine your workflow steps depending on the needs of your project – not the other way aroundHost multiple projects in on environment

13.10.2008

Mahnke

20

Process overview (administrative perspective)

13.10.2008

Mahnke

21

Workflow steps Progress Actions

Process details (administrative perspective)

13.10.2008

Mahnke

22

Workflow steps Status

Statistics (administrative perspective)

13.10.2008

Mahnke

23

Status of each task

Statistics cont. (administrative perspective)

13.10.2008

Mahnke

24

Duration of each task

Any Questions?

13.10.2008

Mahnke

25

Thank You!

Contact: mahnke@sub.uni-goettingen.dehttp://gdz.sub.uni-goettingen.de/

13.10.2008

Mahnke

26