Goobi
description
Transcript of Goobi
Goobiweb based digitization lifecycle software
4. German – Chinese SymposiumHannover, 13.10.2008
Christian Mahnke, Goettingen state and university library
Topics
• About the GDZ (Göttinger Digitalisierungszentrum)
• Goobi– Architecture– Example Workflow– Benefits
13.10.2008
Mahnke
2
About the digitization centre (GDZ)
• Founded in 1998 as one of two German digitization centres.
• Part of the university library of Goettingen
• Known projects include:– Gutenberg Bible– EZOOLO (zoological works by Linné)– Works of C. F. Gauß – Historical travel literature– Digizeitschriften.de (journals)
13.10.2008
Mahnke
3
13.10.2008
Mahnke
4
Introduction
• Modular architecture for digital libraries
• Strong digitization focus
• Web based (production and presentation)
• Standards based
13.10.2008
Mahnke
5
History and current status
• Goobi was developed as workflow tool for the federated digitization project RusDML.– During this project it was used in a distributed model: Scanning took
place in Goettingen, metadata was gathered in Berlin and Moscow.
• It’s used under production conditions by the GDZ since 2006.
• SUB Goettingen and SLUB Dresden got an 18 month grant from the German Research Foundation (DFG) for further development.
• We are currently working on release 1.5. The main Goal for 1.5 is to be able to produce a software that is capable to fulfil the recommendations of the German Science Foundation (DFG) regarding digitization projects.
13.10.2008
Mahnke
6
Architectural overview
13.10.2008
Mahnke
7
Technical background
• Workflow software Java (J2EE) based
• Presentation PHP (Typo3) based
• File exchange via SMB/CIFS, WebDAV, NFS – SAN storage backend
• Authentification via LDAP
• METS / MODS and TEI as data formats (XML based)
13.10.2008
Mahnke
8
Features (workflow management)
• Completely web based• Integrates Windows Clients and existing
software• On demand OCR• Highly configurable• Configurable metadata mapping• Integration of server based programs and
scripts
• Complete web based metadata editor
13.10.2008
Mahnke
9
Example digitization workflow
(please note that the type and order of workflow steps is depending highly on your project)
13.10.2008
Mahnke
10
Example digitization workflow
13.10.2008
Mahnke
11
Preperation
• At first a new record is added to the catalogue by a librarian
• A barcode Scanner is used to gather the bibliographical metadata from the catalogue
13.10.2008
Mahnke
12
Image production - Task view (user perspective)
13.10.2008
Mahnke
13
Tasks ProjectsIdentifier Actions
Quality assurance (QA) and image post production
• Quality assurance and image post production is done outside of Goobi
• An image viewer is used for QA
• Photoshop and PixEdit are used for image post production
• The users access the image files via a network volume
13.10.2008
Mahnke
14
Metadata acquisition - Metadata editor (user perspective)
13.10.2008
Mahnke
15
Documentstructure
Preview image
OCR result
metadata
Examples
Presentation
13.10.2008
Mahnke
16
Presentation – Single page
13.10.2008
Mahnke
17
Zoom
Navigation
Identifier
Presentation - Structural metadata
13.10.2008
Mahnke
18
Benefits of an integrated workflow tool
13.10.2008
Mahnke
19
Benefits of an integrated workflow management system
• Transparencysee what your staff is working onsee which steps are bottle necks
• FlexibilityDefine your workflow steps depending on the needs of your project – not the other way aroundHost multiple projects in on environment
13.10.2008
Mahnke
20
Process overview (administrative perspective)
13.10.2008
Mahnke
21
Workflow steps Progress Actions
Process details (administrative perspective)
13.10.2008
Mahnke
22
Workflow steps Status
Statistics (administrative perspective)
13.10.2008
Mahnke
23
Status of each task
Statistics cont. (administrative perspective)
13.10.2008
Mahnke
24
Duration of each task
Any Questions?
13.10.2008
Mahnke
25