IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database...

25
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator

Transcript of IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database...

Page 1: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

IUScholarWorks Technical Overview

Randall FloydDigital Library Program

Programmer/Database Administrator

Page 2: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Questions:

• What is DSpace?• Why DSpace? • What technology does DSpace use?

• How does DSpace use metadata?

Page 3: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Other Interesting Things

• Customizations• Challenges

Page 4: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Terminology

• DSpace, IUScholarWorks, Institutional Repository, Open Access …

Page 5: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Terminology

• DSpace, IUScholarWorks, Institutional Repository, Open Access …

• IUScholarWorks

Page 6: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Terminology

• DSpace, IUScholarWorks, Institutional Repository, Open Access …

• IUScholarWorks is an Open Access

Page 7: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Terminology

• DSpace, IUScholarWorks, Institutional Repository, Open Access …

• IUScholarWorks is an Open Access Institutional Repository

Page 8: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Terminology

• DSpace, IUScholarWorks, Institutional Repository, Open Access …

• IUScholarWorks is an Open Access Institutional Repository built on DSpace

Page 9: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

What is DSpace?

According to DSpace marketing:

• “DSpace is a groundbreaking digital library system to capture, store, index, preserve, and redistribute all your scholarly research material in digital formats.”

Page 10: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
Page 11: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

What is DSpace?

• DSpace is freely available open source software jointly developed by MIT Libraries and Hewlett-Packard Labs as a solution for creating various kinds of digital repositories

Page 12: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Why DSpace?• Other competing solutions: EPrints, ProQuest's Digital Commons@

• Performed full evaluation of solutions

• Important Criteria– Features– Ownership/support: local vs. service provider

– Technology– Cost is always a factor

Page 13: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Why DSpace?• Choice also influenced by current DLP infrastructure initiatives

• Hard choice since DSpace was just emerging and "rough around the edges"

• Eventually you have to get started --sometimes you just have to choose something, even if it isn't perfect

Page 14: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

That's our story and we're stuck with it

-- Stanley Laurel

Page 15: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

What technology does DSpace use?

• It is a Java web application– Java servlets in an application server– Java Server Pages render the interfaces– Look and feel from Cascading Style Sheets

• Metadata is stored in a relational database– PostgreSQL by default, Oracle also supported

Page 16: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

What technology does DSpace use?

• The document objects are stored as "bitstreams" on disk

• Persistent identifiers to items are provided by CNRI Handle system– http://hdl.handle.net/2022/131

• Searchable indexes– Apache Lucene text search engine– indexes both metadata and document text

Page 17: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

How does DSpace use metadata?

• Based on Dublin Core • Submitters fill out forms with fields representing Dublin Core elements

• Fully customizable through the Dublin Core Registry

• Searchable via basic and advanced search controls

Page 18: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

How does DSpace use metadata?

• Dublin Core Metadata available via OAI-PMH server– Open Archives Initiative – Protocol for Metadata Harvesting

• Can be indexed and searched by Google– in normal fashion by crawling communities and browse pages

– via Google Scholar and OAI-PMH server

Page 19: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Other Interesting Things

• Customizations• Challenges

Page 20: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Customizations• Interface and Branding

– Top and side "news" bars maintained through the administrative interface

– Labels and textual elements are customizable via a single file

– Minor style sheet changes to affect overall appearance

• Submission Process– Form labels and target elements configurable via XML configuration file

Page 21: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Customizations

• IU CAS Authentication– No need to maintain separate user accounts and passwords

– Transparently adds IU user to DSpace when accessing for the first time

Page 22: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Researcher Pages• Is a DSpace add-on developed by University of Rochester

• UR discovered that communities wanted IR to be more people-centric– DSpace is content-centric

• Creates "cookie-cutter" mini web pages about authors

• Authors can link to both internal DSpace items and external URLs

• Example Researcher Page at University of Rochester

Page 23: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Challenges• Customizations difficult to merge into base code when upgrading

• Convergence with DLP infrastructure project

• Metadata issues– Need to get it as right as possible the first time

– Diverse content makes metadata planning hard– A "one size fits all" approach may not describe any one type of content very robustly

– Must be reusable when/if moved to a different system

Page 24: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Discussion

Page 25: IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.

Resources

• IUScholarWorks– http://scholarworks.iu.edu

• DSpace– http://www.dspace.org

• Scholarly Communication– http://www.libraries.iub.edu/index.php?pageId=3562

– http://www.library.arizona.edu/services/faculty/scholcom/resources.html