Post on 27-Mar-2015
CrossAsia at the Staatsbibliothek zu Berlin
an approach to organise access to research material in the field of Asian studies
The Berlin State Library its East Asia Department and CrossAsia
Background – some facts on the library and its responsibilities
What is CrossAsia? Special subject information guide
Collecting, cataloguing, archiving
Who can benefit of this work?
Who needs CrossAsia? - Evaluation & needs
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Background
Germany is historically and politically a decentralized country with 16 federal states
Material is non-German languages is collected on a national level by some 20 major libraries in Germany supported by „German Research Foundation“
Within this framework Berlin State Library is in responsible for:law, slavic studies and literature, foreign newspapers, topographic maps, cartography, East- and Southeast Asia
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Berlin State Library
10 million books
4.400 incunabula
18.350 Occidental manuscripts
40.000 Oriental manuscripts
250.000 autographs
67.000 music autographs
1.400 estates and personal archives
1 million maps and atlases
32.000 subscription periodicals and monograph series
180.000 early newspapers and 350 subscription newspapers
Diverse electronic databases and full-texts
2.3 million microfiches / microfilms
13.5. million images in the picture archive
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
East and Southeast Asia Collection
Partly funded and supervised by German Research Foundation
A kind of „national collection“
Responsible for collection building, processing, acquisition, e-resources, digitising projects, technical solutions
Special inter-library-loan service „Blauer Leihverkehr“ open to German and European institutions/users
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
CrossAsia
Co-operative project (start 2005 – end of funding 2010/11)
Organising access to relevant material in each language and form (print/digital)
CrossAsia is the technical platform to offer and distribute “our” content (our printed collection and more than 40 databases)
Giving access to all kinds of metadata
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
CrossAsia Partners
German Institute of Global and Area Studies, Hamburg
the State and University of Library Lower Saxony in Göttingen
the Institute for Chinese Studies at Heidelberg University, Heidelberg
the Department of Chinese and Korean Studies at Tübingen University, Tübingen,
the Department of Japanese Studies at Tübingen University, Tübingen
the Internet Guide for Chinese Studies, Leiden, the Netherlands
and CrossAsia in conjunction with the Asian Studies WWW Virtual Library and the Nordic Institute of Asian Studies
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
CrossAsia
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
CrossAsia – E-Resources
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Online Guide East Asia – special subject information
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Online Guide East Asia - Workflow System
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
CrossAsia Digital Archive as part of OGEA
Input OGEA Data
has to be archived ?
NO1. set archive flag2. set interval3. make archive URL
YES1. pass data to Heritrix2. activation per CRON3. saving *.arc.gz4. indexing by NutchWAX
generating index
search and present with NutchWAXand Wayback-Machine
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Simple and advanced search
eGovernment-Archiv CrossAsia
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
eGovernment-Archiv CrossAsia - Result
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Building search applications with lucene and nutch
1. We inject a set of URLs into the web database (crawldb).
2. We fetch the web pages:
1. To fetch, we first generate a fetch-list from the database.This generates a fetch-list in a new segment.
2. Now we run the fetcher on this segment.
3. We have to update the database with the results of the fetch.
3. Now we can build a new index of resources catalogued in the Online Guide East Asia.
inject
fetching1. generate
2. fetch3. update
indexing
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
OGEA and Search Engine technology
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
CrossAsia Questions
Do we need to or did we already reinvent the wheel? We all know the Asian Studies WWW VL, the Internet Guide for Chinese Studies or the Portal to Asian Internet Resources etc.?
Will we be able to continue our work to create new records into the central system or do we have to stop when the funding has been expired?
Is it a good idea to archive non-permanent material and how can we solve the copyright issue?
Is this work in google/rss/Twitter/del.icio.us etc-times really necessary?
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Outlook
Printed monographs, series, and newspapers with access to it via interlibrary-loan services
Access to our own digitised collection which will start in summer 2009 (visible in 2010 with 2,5 Million pages in 2012)
Electronic databases like full-text databases, fact databases, E-journals, EBooks, full-text statistics, which are subject to licence with access for German users
Information of other subject collections and projects like resources from the National Library of China and National Central in Taibei, National Diet Library in Japan, or activities in for example Laos National Library (to be continued)
Special Subject Information (OGEA to be continued)
Archiving web-resources (started )
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Organising Access
central system & central index
1. EACAT2. Main Libray Catalogue3. OGEA4. E-ressource 1 (CAJ)5. E-ressource 2 (Apabi)6. …
digital collectionworkflow system
cataloguing systemNACSIS/NII
Online Guide East Asia (incl. DigitalArchive CrossAsia)
other resources …
E-ressource 1 (CAJ – journal level & article level
E-ressource 2 (Apabi – differentcollections)
Searchinterface /
OPAC
for usersfree arrangement
of metadatasets /collections
Matthias Kaun | CrossAsia | CEAL - Chicago 2009
Thank you very much!
Matthias KaunBerlin State LibraryEast Asia Department
http://ead.staatsbibliothek-berlin.de
http://crossasia.org