ARCHIVE IMAGING SEARCHABLE VIA THE WEBPAC Marthie de Kock The Hong Kong Institute of Education 9...

Post on 18-Dec-2015

213 views 0 download

Tags:

Transcript of ARCHIVE IMAGING SEARCHABLE VIA THE WEBPAC Marthie de Kock The Hong Kong Institute of Education 9...

ARCHIVE IMAGING SEARCHABLE VIA THE

WEBPAC

Marthie de Kock The Hong Kong Institute of Education

9 December 2002

Education Imaging System(EdIS)

Hong Kong Institute of Education Library

3

Points for discussion

• Scope and functions

• EdIS Phase I

• EdIS Phase II

• Background

• Different document classes

• Data retrieval & searching

• INNOPAC and the Z server

4

ScopeScope

• Provide a sophisticated system to manage the growing electronic media including text, black & white scanned images, colour photos, audio, video and multimedia presentations available to and in HKIEd library.

• Provide an effective web interface to retrieve on-line digitised materials.

5

System FunctionsSystem Functions

• Capture of content, storage & management

• Scanning & OCR

• Supports both English and Chinese indexing and full text searching

6

BackgroundBackground

First Digital Library initiatives of HKIed Library

• Joint project between IBM & Library with technical support by ITS

• July 1997 - signed contract with IBM and it’s Digital Library

• June 23 1998 - the system was launched

7

Search Interface of EdIS > The Main Screen

8

Contents of EdIS Phase I Contents of EdIS Phase I Four Document TypesFour Document Types

Document types Digitised itemsNewspaper clippings Image scanning & OCR

Examination papers Image scanning & OCR

Curriculum materials Multimedia objects

Student Projects Multimedia objects

9

Document Types:Document Types:News Clippings & Exam PapersNews Clippings & Exam Papers

• News clippings:• Past newspaper clippings

• scanning, OCR, indexing

• Wiser News indexing & CMC operations

• Exam Papers:• Departments

• scanning, OCR, indexing

10

Document Types:Document Types:Curriculums & Student ProjectsCurriculums & Student Projects

• Digitising procedures included:• Content Analysis

• Categorise multimedia objects

• Write a summary

• Digitise materials, saving files with logical file names, web page design & preparing scripts for uploading

• Upload documents & testing

11

Basic Search Screen of Curriculum Materials

12

Search results screen of [Title = dance]

13

Selected the target page from the hit-list.

14

EdIS Phase II

• Include Archive materials

• Improve multimedia searching

• Search Archive materials via INNOPAC

• No response – IBM’s DL and CMC

• June 2001 new Tender specifications

• Vitova

15

EdIS Phase II Development

• Customise system

• Project development – July 2001

• Z server

• System delivered – April 2002

• Interface – uploading of Wiser news

16

System ArchitectureSystem Architecture

Three subsystems:

• Client subsystem• The front-end PC workstations with

Netscape or Microsoft web browser are available for record retrieval and viewing.

• Capturing Subsystems • Used for content preparation

(scanning OCR and indexing)• Server Subsystem • The production server - stores

records and manages the systems operations

17

ConfigurationConfiguration

• Hardware:• SUN Enterprise 250 server

• 36 GB data storage space

• Configured as RAID 0 (disk mirror)

• Operating Software:• ORACLE Database 8i for SUN Sparc Solaris Unix 2.7

Z39.50 server for document searching

18

Hardware and software

• Application software• VitalDoc Document Imaging system - 40 user

license

• Two VitalScan licenses for desktop Scanning and OCR

• Chinese OCR - TsingHau Wintone ver. 8.0

19

20

21

Other hardware

•Two scanning/OCR workstations

•Minolta PS7000 Scanner

•Ricoh IS330DC DF and Flatbed scanner

22

23

24

25

26

27

Typical Searching ProcedureTypical Searching Procedure

Enter Searching Criteria

Browsing Hit List

View Result/Content

Review HistoryNew Search

Select Class/Database

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

Future?

End