Simon Waddington BL RIC WORKSHOP 22032011

Post on 06-Dec-2014

934 views 1 download

Tags:

description

 

Transcript of Simon Waddington BL RIC WORKSHOP 22032011

CLIF Project22 March 2011

Simon WaddingtonCentre for e-Research (CeRch)King’s College London

1

2

CLIF Project• CLIF - Content Lifecycle Integration Framework• Funded by JISC

• 01 July 2009 – 31st March 2011• University of Hull• King’s College London

• Centre for e-Research• Software to be released under open source

licence• Currently in evaluation phase

3

Objectives• Study role of digital repositories in the content

lifecycle

• Understand how cross-system user activity can be supported by the repository

• Investigate where the repository is best positioned in the content lifecycle• Includes use as a staging area

• Investigate:• The flow of content into the repository for preservation • The retrieval of content from the repository for reuse.

• Understand usage of integrated system in context of institutional preservation policies

4

Use cases• Use cases cover research, teaching and

administration• Learning materials

• Examination papers - prepared in SharePoint, and copied to a public repository.

• Exams may be carried out online via VLE• Student solutions and marks stored in private repository

• Research publications and datasets• Publications – preparation in collaboration system

(SharePoint, Sakai). • After approval, publication is public repository• Data associated with research paper stored in private

repository folder

5

System overview

6

SharePoint – Fedora integration • Microsoft Office SharePoint Server 2007

• Fedora Commons repository v3.4• Open source application developed by DuraSpace

• Aim to provide a “reference implementation”• Produce components that are reusable in production

deployments• SharePoint MySite used as basis of integration• Provides administrative and end user interfaces• Creation of MySite for new user automatically

deploys CLIF solution – “feature stapling”• Includes creation of Fedora repository user account and

private folders • Enabled multiple optional features (e.g. for deposit)

that can be deployed according to user requirements

7

Fedora repository• Fedora provides only a

repository platform • Front ends – Hydra, Islandora, ..

• Uses Hydra content models (simple Fedora objects)

• Hierarchical folder structure• Private and public repository

folders• Access management handled

by XACML policies created within SharePoint

8

Deposit to Fedora• Deposit options selected from menu on item in

document library• Copy to Repository

• Copies content item and metadata to private repository folder

• User can browse to repository folder• Bulk copy – copies multiple items

• Move to Repository• Moves content item and metadata to private repository

folder• Replaces item in document library by a hyperlink• Places copy of hyperlink in Archive List (on MySite sidebar)

• Publish to Repository• Copies item to public repository• Default is to run approval workflow• Option to provide MODS metadata entry form for entry of

detailed preservation metadata

9

Retrieval of documents• Retrieval of moved documents

• Documents (not metadata) can be retrieved by selecting hyperlink in Archive List

• Search – based on SharePoint indexing• Search of document metadata in SharePoint• Full text search of documents in SharePoint• Search of document metadata in Fedora

• Currently restricted to retrieval of simple Fedora objects• Required to implement compund and complex objects

to provide full repository browse functionality

10

Configuration options• Document list contains “project” column

• User can configure list of active projects from link on MySite sidebar

• Documents can be associated to specific projects• Publication of documents

• Administrator can configure locations of public repository folders

• On publication, user can select from list of allowable locations

• Set access level • Selectable from document context menu• Enables private documents to be shared

11

Implementation• Implementation uses C# middleware

• Wrapper for Fedora API-M and API-A web services• Performs Fedora object creation in FOXML schema (native

Fedora format)• Document metadata extracted from SharePoint and

mapped to Dublin Core• Additional Policy.xml created to handle access• Generates RDF relations between Fedora objects to

create repository folder structure "RELS-EXT"

• SharePoint UI implemented as features• Features can be selectively deployed by administrator

according to requirements of specific user groups.

12

Further developments• SharePoint 2010 porting

• Currently under investigation• Basic framework including Hydra content object creation

can be directly ported• Packaging as RIC plug-in?

• Search• Integrate with Solr indexing to provide full text search of

Fedora • Repository browse functionality

• Need to handle compound/complex Fedora objects• Verification of Fedora objects for Hydra compatibility

• Security• User account creation and management in Fedora

• Excel web services• Provide features to support processing and archival of

research data using shared Excel spreadsheets

13

Further Information

Project website

http://www2.hull.ac.uk/discover/clif.aspx

14

Demo

Available today