JATS for both journals and books? -- A case study of adopting JATS to build a single search for...

Post on 05-Jan-2016

218 views 2 download

Transcript of JATS for both journals and books? -- A case study of adopting JATS to build a single search for...

JATS for both journals and books?--A case study of adopting JATS to build a single search for Ejournals and Ebooks

Wei Zhao & Jayanthy ChenganOctober 17, 2012

OCUL Scholars Portal Services

• OCUL is a consortium of twenty-one university libraries in the province of Ontario.

• Scholars Portal is a project of OCUL to provide shared technology infrastructure and shared collections to OCUL universities.

• Scholars Portal services includes digital content of ebooks, ejournals, statistics data, geo data and other services including interlibrary loan and research management.

SP E-Journals--Introduction• Digital repository containing over 32 Million articles

from over 14,000 full text journals of 25 publishers which covers every academic discipline

• 75,000 records added daily in 2012• Top research resource for OCUL universities with the

average monthly download of 555,000• Implement NLM DTD for its XML based database

using MarkLogic since 2006

SP E-Books--Introduction

• Scholars Portal Books is a PDF-based platform containing 460,000 ebooks from 25 collections of various publishers running on Ebrary’s ISIS system.

• While the PDF is still the dominating format, the publishers start to move from PDF to XML book.

• The XML books are loaded on Ebook platform, but with some major problems.

Loading XML ebook chapters on Journals • Transform the XML book source data into

NLM book DTD XML format in MarkLogic. • When the users do a search from Ejournals

platform, the query is also sent to XML book chapter database.

• Users can get the book chapter level search result from Ejournals platform and then are directed to Ebook platform.

Crosswalk: Springer A++ V2.4 to NLM book DTD• The match was good with a few gaps

identified in NLM book DTD.-- A set of tags to describe the book series

metadata--Subject or classification for the book--Chapter level DOI

The Loader

• A program has transformed the Springer source data into NLM book DTD XML based on the crosswalk.

• 837,469 Springer book chapters xml have been loaded in MarkLogic.

Example of source data

Example of SP NLM book DTD data

The display of the search results

• Scholars Portal Journals application is built in Xquery.

• The search in the journals extended to books database through AJAX call and the result is displayed on the right side of the page.

Book chapters in the search result list

Book chapters in the journal article details page• In journal article details page, the keywords

from the article are used to search for the matching chapters.

• the result is displayed in "Related Chapters" tab.

Book chapters in the journal article details page

Linking to Ebook platform

• The PDF link for the book chapter direct the user to the Ebook platform for the book and allows the user to download the book chapter PDF if they are entitled.

Exmaple of the linking to Ebook platform

Future directions

• This pilot project is still in testing phase. More analysis need to be done after it goes alive and get the feedback from the users.

• We will be reviewing the new NLM Book DTD to see how we will implement it in the book platform.

http://www.ocul.on.ca/http://www.scholarsportal.info

http://journals.scholarsportal.infohttp://books.scholarsportal.info

Questions?