ARCHIVE IMAGING SEARCHABLE VIA THE WEBPAC
description
Transcript of ARCHIVE IMAGING SEARCHABLE VIA THE WEBPAC
ARCHIVE IMAGING SEARCHABLE VIA THE
WEBPAC
Marthie de Kock The Hong Kong Institute of Education
9 December 2002
Education Imaging System(EdIS)
Hong Kong Institute of Education Library
3
Points for discussion
• Scope and functions
• EdIS Phase I
• EdIS Phase II
• Background
• Different document classes
• Data retrieval & searching
• INNOPAC and the Z server
4
ScopeScope
• Provide a sophisticated system to manage the growing electronic media including text, black & white scanned images, colour photos, audio, video and multimedia presentations available to and in HKIEd library.
• Provide an effective web interface to retrieve on-line digitised materials.
5
System FunctionsSystem Functions
• Capture of content, storage & management
• Scanning & OCR
• Supports both English and Chinese indexing and full text searching
6
BackgroundBackground
First Digital Library initiatives of HKIed Library
• Joint project between IBM & Library with technical support by ITS
• July 1997 - signed contract with IBM and it’s Digital Library
• June 23 1998 - the system was launched
7
Search Interface of EdIS > The Main Screen
8
Contents of EdIS Phase I Contents of EdIS Phase I Four Document TypesFour Document Types
Document types Digitised itemsNewspaper clippings Image scanning & OCR
Examination papers Image scanning & OCR
Curriculum materials Multimedia objects
Student Projects Multimedia objects
9
Document Types:Document Types:News Clippings & Exam PapersNews Clippings & Exam Papers
• News clippings:• Past newspaper clippings
• scanning, OCR, indexing
• Wiser News indexing & CMC operations
• Exam Papers:• Departments
• scanning, OCR, indexing
10
Document Types:Document Types:Curriculums & Student ProjectsCurriculums & Student Projects
• Digitising procedures included:• Content Analysis
• Categorise multimedia objects
• Write a summary
• Digitise materials, saving files with logical file names, web page design & preparing scripts for uploading
• Upload documents & testing
11
Basic Search Screen of Curriculum Materials
12
Search results screen of [Title = dance]
13
Selected the target page from the hit-list.
14
EdIS Phase II
• Include Archive materials
• Improve multimedia searching
• Search Archive materials via INNOPAC
• No response – IBM’s DL and CMC
• June 2001 new Tender specifications
• Vitova
15
EdIS Phase II Development
• Customise system
• Project development – July 2001
• Z server
• System delivered – April 2002
• Interface – uploading of Wiser news
16
System ArchitectureSystem Architecture
Three subsystems:
• Client subsystem• The front-end PC workstations with
Netscape or Microsoft web browser are available for record retrieval and viewing.
• Capturing Subsystems • Used for content preparation
(scanning OCR and indexing)• Server Subsystem • The production server - stores
records and manages the systems operations
17
ConfigurationConfiguration
• Hardware:• SUN Enterprise 250 server
• 36 GB data storage space
• Configured as RAID 0 (disk mirror)
• Operating Software:• ORACLE Database 8i for SUN Sparc Solaris Unix 2.7
Z39.50 server for document searching
18
Hardware and software
• Application software• VitalDoc Document Imaging system - 40 user
license
• Two VitalScan licenses for desktop Scanning and OCR
• Chinese OCR - TsingHau Wintone ver. 8.0
19
20
21
Other hardware
•Two scanning/OCR workstations
•Minolta PS7000 Scanner
•Ricoh IS330DC DF and Flatbed scanner
22
23
24
25
26
27
Typical Searching ProcedureTypical Searching Procedure
Enter Searching Criteria
Browsing Hit List
View Result/Content
Review HistoryNew Search
Select Class/Database
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
Future?
End