An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009,...

54
An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009

Transcript of An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009,...

Page 1: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

An Iterative Approach to Building Sustainable Repository Services on Fedora

       

Open Repositories 2009, May 19, 2009

Page 2: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Outline

• Organizational overview and backgroundo Claire Stewart, Head, Digital Collections

• Winterton Collection project o Karen Miller, Monographic and Digital Projects Cataloger, Bibliographic Services

• Iterative approacho Bill Parod, Repository Architect, Enterprise Systems

Page 3: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Parallel committee/department structure

Page 4: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 5: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

may 2008may 2008

Repository Implementation Group project schedule

Page 6: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Repository Implementation Group project schedule

may 2009

Page 7: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

EAD to images to EAD+images

Page 8: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Winterton Collection cataloging

• Full cataloging for each of the 76 original collections and at the container level (album, envelope, etc.) for collections of more than one container.

• Individual photographs are not (generally) cataloged fully:o Titleo Note (optional)o Publisher or Creator (if available)

Page 9: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

• Full cataloging includedo Titleo Dates of coverageo Abstracto Scope and contents descriptiono Biographical or historical noteo Physical description (size of album, how many

pages, photos, etc.)o Subject headings

Page 10: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Providing cataloging at the album level means that • Many individual photographs will not be

described concisely by the subject headings assigned.

• Some subject headings may not apply at all to some photographs.

Page 11: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Transcribing only the photograph titles results in such problems as these when keyword searching:• Non-English words are not translated• People referred to in captions by their initials, not

names• Animals referred to by given name, not by

species• Non-descriptive captions

Page 12: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 13: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 14: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

A.E.B. and his well-identified crowd

Page 15: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

"Enmei and his rhino"

Page 16: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 17: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 18: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 19: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 20: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 21: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 22: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 23: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 24: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 25: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 26: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 27: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 28: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 29: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 30: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 31: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Repository Development Strategy

1. Implement models and services for ingest, preservation, and access of core content.

2. Provide tools for staff to ingest and manage repository content. 

3. Facilitate integration of repository materials with end-user tools and services.

4. Iterate…

Page 32: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Draw Detailed Requirements from Project Commitments:A) OAI-ORE Annotation of OCA texts B) Cross Collection Search ProjectC) Winterton Photography CollectionD) Kirtas Mounting Books ProjectE) EAD InitiativeF) Hesler Photography CollectionG) Chemical BulletinH) Fava Masks I) Curator-driven Digitization ProjectJ) Charlotte Moorman / Prgm. African Studies Audio 

Page 33: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Inventory Content Types

1) EAD encoded finding aids2) TEI encoded text transcriptions3) High resolution images4) Virtual crops of high resolution images5) Page imaged books6) 3D objects7) Aggregations: full text, fielded, and faceted search8) Audio 9) Video

Page 34: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Project / Content Type Matrix

 

Page 35: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Services by Content Type

Text ServiceImage ServiceMetadata Conversion ServiceDiscovery Service

Page 36: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Text ServiceEAD Objects

EAD Disseminator Methods:getEADHeadergetComponentAsHTML(unitid)getComponentStructuregetChildComponents(unitid)getComponentsgetComponentStructure(unitid)getAncestorComponents(unitid)getComponentChildrenAsJSON(unitid)getComponentAsEmbeddedHTML(unitid)getComponent (unitid)getElementById (xml:id)getArchDescNoComponentsgetElementsByName(element_name)getDigest(unitid)getComponentAsDC(unitid)getComponentAsMODS(unitid)reindex

Datastreams:DC MODSEADEAD to DC XSL EAD to MODS XSLEAD to HTML XSLEAD to HTML Frag XSLEAD Children to JSON XSLRELS-EXT

 TEI Objects

TEI Disseminator Methods:getTOCgetImageTextTOCgetStructuredTextTOCgetHeader(xml:id)getHeadinggetChunk(xml:id)getPageByNumber(pageOrdinal)getPageByID(xml:id)reindex

Datastreams:DC MARCXMLDejaVuBook ORE REMPage Image ORE REMTEIRELS-EXT

Page 37: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

EAD Objects

EAD Service methods:getEADHeadergetComponentAsHTML(unitid)getComponentStructuregetChildComponents(unitid)getComponentsgetComponentStructure(unitid)getAncestorComponents(unitid)getComponentChildrenAsJSON(unitid)getComponentAsEmbeddedHTML(unitid)getComponent (unitid)getElementById (xml:id)getArchDescNoComponentsgetElementsByName(element_name)getDigest(unitid)getComponentAsDC(unitid)getComponentAsMODS(unitid)reindex

Datastreams:DC MODSEADEAD to DC XSL EAD to MODS XSLEAD to HTML XSLEAD to HTML Frag XSLEAD Children to JSON XSLRELS-EXT

Page 38: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Fedora Text Disseminator

getComponent: unitidgetComponentAsHTML: unitidgetComponentAsDC: unitidgetComponentAsMODS: unitid....reindex

SGREPServlet

Encapsulate query syntaxXSLT optional on query result

SGREP : Executable program on service host

Add Fedora Disseminator MethodsAdd Fedora Disseminators

Add/Modify XSLT Processing on RetrievalAdd/Modify SGREP Queries

Replace Retrieval Software

Text Service Stack Enhancement Options

Examples: EAD “Digest”- C0n + title/id of children and ancestorsJSON support for EXT-JSHTML design iterationEAD to MODS conversion maturation

Page 39: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Image Service: Source Image

 

Page 40: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Cropped Image

Single Image File Referenced By Crop Information:

<svg:svg xmlns:xlink="http://www.w3.org/1999/xlink"  xmlns:svg="http://www.w3.org/2000/svg">    <svg:image x="0" y="0" width="10656" height="7992"  xlink:href="inu-wint/inu-wint-22.30.jp2"><svg:clipPath><svg:rect x="0" y="1166" width="8034" height="6036"/></svg:clipPath></svg:image></svg:svg>

Page 41: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

CroppedPhoto

Single Image File Referenced By Crop Information:

<svg:svg xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:svg="http://www.w3.org/2000/svg">   <svg:use         xlink:href="http://repository.library.northwestern.edu/fedora/get/inu:inu-wint-22-30/DELIV-OPS">                  <svg:clipPath>                     <svg:rect x="4246" y="1436" width="2997" height="2518"></svg:rect>                  </svg:clipPath>               </svg:use>            </svg:svg>

Page 42: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Image and Crop ObjectsImage Service methods (supported by both image and crop objects):

getWithWidth(width)getWithLongSide(length)getWithHeight(height)getCropWithWidth(x,y,width, height,destwidth)getCropWithHeight(x,y,width, height,destheight)getCropWithSize(x,y,width,height, destwidth , destheight)getWithSize(destwidth , destheight)

Image Object Datastreams:DCMODSPREMISSVGTIFFEXIFJP2MIX_TIFFMIX_JP2RELS-EXT

Crop Object Datastreams:DCMODSPREMISSVGRELS-EXT

Page 43: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

http:/.../fedora/get/inu:inu-wint-22-30-2/inu:sdef-addimage/getWithLongSide?length=150

Page 44: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Fedora Image Disseminator

getWithWidth(width)getWithLongSide(length)getWithHeight(height)getCropWithSize(x, y, width, height, destwid…)

Image Servlet

Encapsulate rendering parametersObject specific rendering parameters (SVG)User request rendering parametersRendering service parameters and location

Rendering Service : Aware, DJatoka

Add Fedora Disseminator MethodsAdd Fedora Disseminators

Add/Modify Rendering OptionsAdd/Modify Rendering Service Parameters

Replace Rendering Software

Image Service Stack Enhancement Options

Examples: Added getLongSide(length)Added rotationOptimized rendering parametersRendering features - vector overlayObject reference chainingDjatoka experimentation

Page 45: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

EAD Objects

EAD Service methods:

getEADHeadergetComponentAsHTML(unitid)getComponentStructuregetChildComponents(unitid)getComponentsgetComponentStructure(unitid)getAncestorComponents(unitid)getComponentChildrenAsJSON(unitid)getComponentAsEmbeddedHTML(unitid)getComponent (unitid)getElementById (xml:id)getArchDescNoComponentsgetElementsByName(element_name)getDigest(unitid)getComponentAsDC(unitid)getComponentAsMODS(unitid)reindex

Datastreams:DC MODSEADEAD to DC XSL EAD to MODS XSLEAD to HTML XSLEAD to HTML Frag XSLEAD Children to JSON XSLRELS-EXT

Image/Crop Objects

Image Service methods:

getWithWidth(width)getWithLongSide(length)getWithHeight(height)getCropWithWidth(x,y,width,

height,destwidth)getCropWithHeight(x,y,width,

height,destheight)getCropWithSize(x,y,width,height,

destwidth , destheight)getWithSize(destwidth , destheight)

Datastreams:DCMODSPREMISSVGTIFFEXIFJP2MIX_TIFFMIX_JP2RELS-EXT

Page 46: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Searching

• SOLR• MODS described collections• Metadata conversion services• Faceting• “Searchable” Interface

o MODS Collection Datastreamo Facet listo Field List

Page 47: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 48: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 49: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Project Checklist

A) OAI-ORE Annotation of OCA texts B) Cross Collection Search ProjectC) Winterton Photography CollectionD) Kirtas Mounting Books ProjectE) EAD InitiativeF) Hesler Photography CollectionG) Chemical BulletinH) Fava Masks I) Curator-driven Digitization ProjectJ) Charlotte Moorman / Prgm. African Studies Audio 

Page 50: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
Page 51: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Next Step:Collection Management Tools

EAD Ingest Processing

Page 52: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Image Ingest Processing

Page 53: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Heterogeneous Ingest Processing

Page 54: An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Ingest Management Tools

Curator-selected Ingest (Ad-hoc collections)Digital Image Library (Art Slide Library +)Kirtas Book Scanning

Mounting Books Project - OR09 Tuesday, 3:00: Session 8B