Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference...

23
Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director Fedora Project Cornell University
  • date post

    20-Dec-2015
  • Category

    Documents

  • view

    215
  • download

    2

Transcript of Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference...

Page 1: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Update on the Fedora ProjectWhere we’ve been and where we’re going

Fedora Users ConferenceRutgers UniversityMay 13-14 2005

Sandy PayetteCo-Director Fedora Project

Cornell University

Page 2: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Fedora Brief History

• Cornell Research (1997-present) – DARPA and NSF-funded research– First reference implementation developed– Interoperable Repositories (experiments with CNRI)– Policy Enforcement

• First Application (1999-2001)– University of Virginia digital library prototype – Technical implementation: adapted to web; RDBMS

storage– Scale/stress testing for 10,000,000 objects

• Open Source Software (2002-present)– Andrew W. Mellon Foundation grants– Technical implementation: XML and web services– Fedora 1.0 (May 2003)– Fedora 2.0 (Jan 2005)– Fedora 2.1 (coming soon!)

Page 3: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Fedora Development Team

Cornell University

• Sandy Payette (co-director)• Chris Wilper• Carl Lagoze• Eddie Shin

University of Virginia

• Thorny Staples (co-director)

• Ross Wayland• Ronda Grizzle• Bill Niebel• Bob Haschart• Tim Sigmon

Page 4: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

“Fedora Inside” Known Use Cases

• Digital Library Collections• Institutional Repository • Educational Software

• Information Network Overlay • Digital Archives and Records Management• Digital Asset Management• File Cabinet / Document Management• Scholarly publishing

Page 5: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Fedora Repository 2.0

M anage AuthN AuthZ

Access Validation Resource Index

Storage Dissemination Fie ldSearch

Fedora Repository M odules

M anag e A c c e s s Se arc hR ISe arc h(R D F )

R E S T

C lie n tA pp

Ba tchProgram

O th e rS e rv ice

W e bBrows e r

R E S T S O A PS O A P R E S T S O A PR E S T

O A IP ro v ide r

R E S TWeb Services

Exposure

Page 6: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Fedora – what’s new(version 2.0)

• FOXML (Fedora Object XML)– Simple XML format directly expresses Fedora object model– Easily adapts to Fedora new and planned features– Easily translated to other well-known formats

• Enhanced Ingest/Export of objects– FOXML, METS (Fedora extension)– Extensible to accommodate new XML formats– Planned: METS 1.4, MPEG21 DIDL

Page 7: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Fedora 2.0 (continued)

• Object-to-object Relationships– Ontology of common relationships (RDF schema)– Relationships stored in special datastream (RELS-EXT)

• Resource Index (RI)– RDF-based index of repository (Kowari triple-store)– Graph-based index includes:

– Object properties and Dublin Core– Object Relationships– Object Disseminations

• RI Search (Search the repository as a graph)– Powerful querying of graph of inter-related objects– REST-based query interface (using RDQL or ITQL)– Results in different formats (triples, tuples, sparql)

Page 8: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

info :fe do ra/im age :1 1

la stM odDate

hasMem ber

hasM em ber

hasR ep

hasR ep

info:fe dora/ imag e :1 1 /B LDG

in fo:fe dor a/ imag e :1 1 /bde f:2 /g e tR e latedLe tte r

hasRep

i n fo:fe dor a/c ol l e c tion :1 / bde f:1 /MEMB ERS

info :fe do ra/im age :1 2

in fo :fe do ra/c o lle c tio n :1

la stM odDatehasR ep

"2 0 0 5 - 0 1 - 1 0 :1 1 :0 2 "

"2 0 0 5 - 0 2 - 0 1 :1 2 :0 5 "

lastModD

a te

"2 0 0 5 - 0 1 - 0 1 :1 0 :0 0 "

dc :c rea

tor

"E lly C r am er "

dc:crea to r

"C h r is W ilp er "

in fo:fe dor a/ imag e :1 2 /B LDG

dc:creato r

"E d d ie S h in "

in fo:fe dor a/ imag e :1 2 /bde f:2 /g e tHIGH

hasR ep

Fedora Objects – RDF Graph view

CollectionObject

MemberObject

Page 9: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Fedora 2.0 (continued)

• New Utilities• Batch Modify Utility • Repository Administrator Reporting

• Performance Tuning (~1 million objects)– Ingest testing (800K objects; 40 millisec/object)– Concurrency testing (access requests)

• Communications and Outreach• New Fedora Web Site• Improved Documentation• Tutorials

Page 10: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Preview Fedora 2.1 (May 2005)

• Authentication plug-ins– HTTP Basic auth – Tomcat realms and login modules

• Plug-in #1 : Tomcat user/password file or database• Plug-in #2 : LDAP tie-in• Plug-in #3 : Radius Authentication

• Support for SSL

• Authorization module– XML-based policies using XACML– Repository-wide policies– Object-specific policies– Fine-grained policy enforcement

• API actions X subject attributes X object attributes

Page 11: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Authorization: Example Policies

• Repository Policy– Deny access to all API-M methods if not “administrator” – Deny access to the deleteDatastream method if not “author”

• Specific Object Policy– Deny access to object “uva:100” if user is not Thorny.

• Group Object Policy– Deny access to the “getFullArticle” dissemination of objects

whose content model is “journal-article” if not “faculty”– Deny access to the “secret” datastream if user is “guest”

Page 12: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Authorization: Example Policies

• Time-oriented Policy– Permit students access to “answers” datastream of

learning object cs:125 after May 15, 2005– Deny all access learning object cs:125 after June 15, 2005

• Backend Service Security Policy– Deny callback by external service represented by

Bmech:10

Page 13: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Preview Fedora 2.1 (May 2005)

• Policy Builder Client– Simple user interface to create access policies– Automatically generate XACML

• Handle Generation Plug-in (PIDs as Handles)

• Enhanced OAI Provider Service (prOAI)– Harvest multiple metadata formats– Harvest datastreams and disseminations– Support for incremental harvest by modified date– Support for OAI sets– Highly configurable via queries against Resource Index

• Directory Ingest Service (and client tool)– Facilitate ingest of hierarchical directories of files– Submit files as .zip or .jar (with a METS manifest)– Automatically asserts parent-child relationships in RELS-EXT– Stages content and ingests as FOXML objects into repository

Page 14: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Fedora Service Framework(beginning Fedora 2.1)

F e d o ra Re p o sito rySe rv ice

OAIProvide rSe rvice

Dire ctoryInge st

Se rvice

Futur e

Ser v ice

Serv ices

Apps

Other

Ser v ice

Administrator

Other

Ser v ice

Futur e

Ser v ice

PolicyBuilder

ZIP or JARinput

Page 15: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Fedora Service Framework(2005-2006)

F e d o ra Re p o sito rySe rv ice

Serv ices

Apps

P re se rva tionInte grityS e rvice

Ex te rna lW orkflow

JHOV E

GDFR

Ba sicW orkflowS e rvice

Dialog Box Name

O KTex t:

Tex t

Tex t

Tex t

Tex t

Tex t

Canc el

H elp

Sample Text Here Sample Text Here Sample TextHere Sample Text Here Sample Text Here SampleText Here Sample Text Here Sample Text HereSample Text Here Sample Text Here

S am ple Tex t Here S am ple Tex t Here S am ple Tex t Here Sam ple Tex t HereS am ple Tex t Here S am ple Tex t Here S am ple Tex t Here Sam ple Tex t HereS am ple Tex t Here S am ple Tex t Here S am ple Tex t Here Sam ple Tex t Here

Fedora-Web-IRAdministrator

OAIP rovide rS e rvice

Dire ctoryInge st

S e rvice

W e b-ba se dsubm ission a ndba sic w orkflow

Dyna m icDisse m ina tor

S e rvice

PolicyBuilder

Fe doraS e a rchS e rvice

Other

Ser v ice

Page 16: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Fedora Service Framework(2006-2007)

Serv ices

Apps

P re se rva tionInte grityS e rvice

Ex te rna lW orkflow

JHOV E

GDFR

Ba sicW orkflowS e rvice

Dialog Box Name

O KTex t:

Tex t

Tex t

Tex t

Tex t

Tex t

Canc el

H elp

Sample Text Here Sample Text Here Sample TextHere Sample Text Here Sample Text Here SampleText Here Sample Text Here Sample Text HereSample Text Here Sample Text Here

S am ple Tex t Here S am ple Tex t Here S am ple Tex t Here Sam ple Tex t HereS am ple Tex t Here S am ple Tex t Here S am ple Tex t Here Sam ple Tex t HereS am ple Tex t Here S am ple Tex t Here S am ple Tex t Here Sam ple Tex t Here

Fedora-Web-IRAdministrator

OAIP rovide rS e rvice

Dire ctoryInge st

S e rvice

W e b-ba se dsubm ission a ndba sic w orkflow

Pre s e rvationM onitor ing

Se rvice

Eve ntNotification

Se rvice

Fe doraS e a rchS e rvice

Dyna m icDisse m ina tor

S e rvice

PolicyBuilder

Other

Ser v ice

FedoraRepos itory

S erv ice

Fe dora Re po sito rySe rv ice

Fe de rationPID Re s olution

Se rvice

FedoraR epository

Serv ice

Page 17: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

You asked…

• “We wish for a out-of-box” end-user client for Fedora.”

• “Can’t you put the DSpace interface on top of a Fedora repository?”

• “We need something to show people Fedora right away (before we get $$ for development resources).”

• “We love Fedora. It would be really great if you distributed a default end-user client.”

Page 18: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

The Answer: FedoraWebIR Client

• Web-based client for “institutional repository”• Configurable• End-user submission• Object creation template for “content

models” • Basic Workflow• Search/Browse• Easy configuration of access policy

• Development to begin this summer.

Page 19: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

More Dev-Team Priorities

• Federated Repositories• Federation with other repositories (DSpace, aDORE)

– note the Cornell/LANL NSF Pathways project.

• Fedora Showcase and News (on new website)• “Content Model” Specification Language• Advanced Object Creation Workbenches• Tools for RDF browse and graph traversal• Performance Tune – millions of objects• Web services security and Shibboleth• Code Refactoring

• Fedora as web app (.war)• MVC2 pattern for REST-based web exposures• Other misc.

Page 20: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Preview Dinner Talk

• Fedora Development Consortium– Advisory Board– Commission Working Groups– Prioritize Development

• Sustainability Models• Collaborative Development• User-contributed Tools, Apps,

Services

Page 21: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

New Fedora Web Site!

www.fedora.info

Page 22: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Solution: Information Network Overlay

DataStores

DocumentRepositories

Databases

WebResources

PublisherRepositories

Information Network API

Source Layer

NetworkRepresentation

Layer

Client Layer

Page 23: Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May 13-14 2005 Sandy Payette Co-Director.

Process-orientation - workflows

Validatebyte-

streams

Ingestto

Repo

Link to Simulation

Service

AssignAccessPolicy

Indexand

Register

Ingest-oriented process

VisitThe

Doctor

FormatMigration

ObjectVersioning

In Repo

MakeCopies

IngestTo

ArchivePreservation-oriented process

IngestTo

Archive

SIP

DigitalObject

World of Services