Realizing the Vision Improving Interoperability with Z39.50 Profiles

45
Realizing the Vision Improving Interoperability with Z39.50 Profiles OCLC SiteSearch User Group Meeting Dublin, Ohio May 2, 2000 William E. Moen William E. Moen <[email protected]> <[email protected]> School of Library and Information Sciences School of Library and Information Sciences University of North Texas University of North Texas Denton, TX 72603 Denton, TX 72603

description

Realizing the Vision Improving Interoperability with Z39.50 Profiles. William E. Moen School of Library and Information Sciences University of North Texas Denton, TX 72603. OCLC SiteSearch User Group Meeting Dublin, Ohio May 2, 2000. Presentation Components. - PowerPoint PPT Presentation

Transcript of Realizing the Vision Improving Interoperability with Z39.50 Profiles

Page 1: Realizing the Vision Improving Interoperability with Z39.50 Profiles

Realizing the VisionImproving Interoperability with

Z39.50 Profiles

Realizing the VisionImproving Interoperability with

Z39.50 Profiles

OCLC SiteSearch User Group MeetingDublin, OhioMay 2, 2000

William E. MoenWilliam E. Moen<[email protected]><[email protected]>

School of Library and Information SciencesSchool of Library and Information Sciences

University of North TexasUniversity of North Texas

Denton, TX 72603Denton, TX 72603

Page 2: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 2William Moen

Presentation ComponentsPresentation ComponentsThe Vision:

Distributed and Integrated Access

The Question of Interoperability

The Z Texas Profile

The Bath Profile

Next Steps

Answer as Many Questions as I Can!

Page 3: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 3William Moen

What is Z39.50What is Z39.50

An American National Standardo Z39.50 means it was developed by NISOo Z39.50 means it was the 50th standard approved

by NISO

Z39.50 is a set of specifications, procedures, and structures for computer communication.

Z39.50 allows users to search one or more databases from a single interface

Page 4: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 4William Moen

Purpose of Z39.50Purpose of Z39.50

To overcome problems of multiple database searching

To allow users of one system:o to search databases on another systemo to retrieve records from that systemo without knowing the unique features of each

database system Search and retrieve bibliographic and non-

bibliographic resources (integrated access) Lay a technical foundation for resource sharing

Page 5: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 5William Moen

Traditional Online Access to Library Information

Traditional Online Access to Library Information

Different:Different:InterfacesInterfacesCommandsCommandsPresentationPresentationData FormatsData Formats

Local Digital Collections

The Online Catalog

Locally-mountedDatabases

Page 6: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 6William Moen

Information Access From a Single Resource

Information Access From a Single Resource

Our library catalogs epitomize the power of this approach User interface is optimized

Interface tightly linked with database search engine

Knowledge of available access points

Pre-determined display formats

System-specific user options and functions

Page 7: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 7William Moen

Integrated Information Access Option 1

Integrated Information Access Option 1

Learn to search each resource using native interface

To achieve: burden is on user to learn each on

Providing web interfaceweb interface does not reduce burden

Page 8: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 8William Moen

Integrated Information Access Option 2

Integrated Information Access Option 2

Develop mechanism for querying separate resources in a standardized manner

To achieve: burden is on information systems to communicate in ways that hide the difference between them

Providing Z39.50 access is a solution

Page 9: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 9William Moen

Integrated AccessIntegrated Access

Single Interface to Multiple ResourcesSingle Interface to Multiple Resources

LocalLocalRemoteRemoteUsersUsers

LocalRemoteUsers

The Online Catalog

Local Digital Collections

Locally-mountedDatabases

Your Library’s Your Library’s Information SystemInformation System

Page 10: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 10William Moen

Distributed and Integrated Access

Distributed and Integrated Access

Resources are geographically dispersed (possibly around the globe)

Resources may hold similar or dissimilar types/formats of information

Single user interface to distributed resources

Page 11: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 11William Moen

Distributed and Integrated Access

Distributed and Integrated Access

RemoteRemoteInformation SystemInformation System

(e.g., museum system)(e.g., museum system)

Remote Remote Information SystemInformation System

(e.g., archives)(e.g., archives)

YourYourInformationInformation

SystemSystem(e.g., library resources)

Z39.50Z39.50Internet AccessInternet Access

Your Z39.50 Client

Page 12: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 12William Moen

Challenges to Distributed andIntegrated Access

Challenges to Distributed andIntegrated Access

Modeling search and retrieval across a variety of information resources

Providing vocabulary and grammar to express searches (e.g., access points and other query criteria)

Providing mechanisms for retrieving different formats/types of resources (e.g., bibliographic records, full-text resources, digital objects, or components of database records

Page 13: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 13William Moen

Z39.50 Addresses the ChallengesZ39.50 Addresses the Challenges Two systems share a common

“understanding”o of access pointso of databases records

Standardized language -- vocabulary (semantics) and grammar (syntax) foro initiating and closing information retrieval sessionso expressing searches and their meaningo delivering result set records

The common understanding allows interaction with a remote system

Page 14: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 14William Moen

Centralized vs. Distributed IRCentralized vs. Distributed IR

Two models for networked information retrieval

Centralizedo Web search engine modelo IR from one server is not too difficult

Distributedo Z39.50 modelo IR from multiple databases across multiple

servers is more problematic

Page 15: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 15William Moen

Interoperability: The Fundamental Problem

Interoperability: The Fundamental Problem

Ability of client to successfully search and retrieve information in a meaningful way

Key issue when searching multiple databases containing similar resources

Difficult problem when searching databases containing diverse types of resources

Z39.50 profiles provide a solution

Page 16: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 16William Moen

InteroperabilityInteroperability

Working definition:User’s ability to successfully search and retrieve information in a meaningful way and have confidence in the results

Major issue in networked resources & digital library environment

Z39.50 interoperabilityo Theoretical vs practical interoperabilityo Conformance to standard vs demonstrable interop

Page 17: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 17William Moen

Threats to InteroperabilityThreats to Interoperability

The functionality supported by the standard

Differences in implementation of the standard

Differences in local IR systemso Z39.50 cannot improve searchability of

resources o It can only support what local and remote only support what local and remote

systems offersystems offer

Page 18: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 18William Moen

Levels of InteroperabilityLevels of Interoperability

User Task levelo do systems support IR tasks of one or more user

groups? Semantic level

o can Z-clients and Z-servers and local IR systems preserve and act on meaning of IR tasks

High-level protocol (functional)o do Z-client and Z-servers support appropriate

Z39.50 services for user tasks Low-level protocol (syntactic)

o do Z-client and Z-servers interchange PDUs according to standard?

Page 19: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 19William Moen

Conversion

Z39.50Origin

Conversion

Z39.50Target

Database

User Interface

UT

SemUT

Sem

Interoperability Crisis Points

ProtUT = User TaskSEM = SemanticPROT = Protocol

Page 20: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 20William Moen

Communication Between Z-client and Z-server

Communication Between Z-client and Z-server

UserInterface

Client

Z39.50Client

(Origin)

Info Server

NetworkAccess

via Internet

QueryQueryFormulationFormulation

Standardized Representations:• Query• Database Record

for Intersystem Communication

Search Engine&

Database

Z39.50Server

(Target)

Client-SideClient-Side Server-SideServer-Side

Page 21: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 21William Moen

Interoperability and Z39.50 Searching

Interoperability and Z39.50 Searching

Issueso Use attributes supported (Z39.50 Implementation)o Differences in indexing, searchable fields available,

search support, etc. (Local IR System)

Implicationso Different results from similar databases

implemented on separate Z-serverso Different results on same database when searched

locally or through Z39.50

Page 22: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 22William Moen

Interoperability and Z39.50 Retrieval

Interoperability and Z39.50 Retrieval

Issueso Z39.50 Record Syntaxes supported (Z39.50

Implementation)o Capability of local IR system to prepare records in

one or more formats for interchange (Local IR System)

Implicationso Clients and servers may or may not be able to

interchange records

Page 23: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 23William Moen

Problems We FaceProblems We Face Strategic use of Z39.50 for integrating access to

information resources (local and global)

Dissatisfaction with Z39.50 given the Web

Uncertainty about functional requirements for Z39.50 features and specifications for procuring Z39.50 products

Lack of vendor information/support

Implementation complexity and interoperability problems

Z39.50 profiles provide solutions

Page 24: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 24William Moen

Z39.50 ProfilesZ39.50 Profiles Represents community consensus on

requirements for Z39.50

Identifies Z39.50 specifications to support those requirements

Improves search and retrieval results

Aids in purchasing decisions

Provides specifications for vendors to build Z39.50 products

Page 25: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 25William Moen

Profiles -- Z39.50 SpecificationsProfiles -- Z39.50 Specifications

Represents community consensus on requirements

Identifies Z39.50 specifications to support those requirements

Improves search and retrieval results

Aids in purchasing decisions Provides specifications for

vendors to build Z39.50 products

Complete

Z39.50

Specifications

Z39.50

Profile

Z39.50Profiles

ATS-1

GILS

CIMI

GEO

******

Z Texas

Bath

Page 26: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 26William Moen

The Z Texas ProfileThe Z Texas Profile

Reflects Texas library requirements & specifications

Who’s involved?o Libraries with Z39.50 implementationso Libraries acquiring Z39.50 productso Libraries considering a Z39.50 implementation

Currently representatives from:o Texas libraries -- public, school, academic, specialo State Institutions/Organizationso Z39.50 Experts

The Texas Z39.50 Implementors Group (TZIG) First Initiative: Z Texas: The Z39.50 Profile

Page 27: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 27William Moen

Step 1: Identifying Requirements

Step 1: Identifying Requirements

Providing access -- To what resources? Types of searching Interchanging MARC records -- which flavor? Interlibrary loan, item ordering, document

delivery Full text search and retrieval

An individual effort to determine requirements for your library

A collaborative community effort to identify common requirements for developing Z39.50 specifications

Page 28: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 28William Moen

Step 2: Specifying Z39.50 Step 2: Specifying Z39.50

Common specifications for procurement and implementation documented in a profile

Profile addresses Z specifications such as:o Protocol versiono Searching:

Query, Attributes, Attribute Combinations, Default Values, Expected Server Behavior, etc.

o Retrieval: Record syntax, element set names, etc.

o Other Z services desired or required In synch with other existing/emerging profiles

Gain community (including vendors) support

Page 29: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 29William Moen

Scope and Structure of the Profile

Scope and Structure of the Profile

Scopeo Current version: search & retrieval from OPACs,

including holdings informationo Subsequent versions: other networked electronic

resources Structure

o Modularo Current Functional Areas Defined

Functional Area -- Library Catalog Search & Retrieval Functional Area -- Retrieval of Bibliographic Holdings Information

o Potential Functional Areas Cross Domain Searching

Page 30: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 30William Moen

Addressing InteroperabilityAddressing Interoperability

Identify searching requirements (tasks)o Several categories of searching: basic, specialized

Basic: Author, Title, Subject, Keyword, Boolean Specialized: ISBN, Call Number, Specific Controlled Vocabulary

Defining the searches (semantics and behavior)o Exact Title Search -- exact match beginning with first

character of first word appearing in the title of an item, in the series title, or the uniform title

Specifying Z39.50 query to represent the searcho Standard combination of Z39.50 Attribute Types and

Values Suggested mapping and indexing of local

systems (e.g., MARC fields to index, etc.)

Page 31: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 31William Moen

Benefits of the Z Texas ProfileBenefits of the Z Texas Profile

Focuses time and resources to build consensus among Texas librarians

Ensures Texas libraries’ needs are communicated and addressed to Z39.50 vendors

Educates and informs Texas librarians Ensures effective Z39.50 implementation for

Texas libraries Provides technical infrastructure for statewide

resource sharing

Page 32: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 32William Moen

The Bath ProfileThe Bath Profile

The Bath Profile: An International Z39.50 Specification for Library Applications and Resource Discovery (Draft available)

International in use and scope Based on structure and requirements of

Z Texas Profile Specifies three Functional Areas:

o Basic Bibliographic Search & Retrievalo Bibliographic Holdings Search & Retrievalo Cross-Domain Search & Retrieval

Specifies several Conformance Levels

Page 33: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 33William Moen

Bath Profile Functional AreaBath Profile Functional Area

Basic Bibliographic Search and Retrievalo Basic Bibliographic Search

Level 0 (4 searches) Level 1 (15 searches)

o Browsing Indexes 6 Scans specified

o Basic Bibliographic Retrieval Combination of UNIMARC or MARC21 and SUTRS and XML.

Bibliographic Holdings Search and Retrievalo 3 Levels

Cross-Domain Search and Retrievalo Level 0 (4 searches) Level 1 (9 searches)o SUTRS and XML as record syntaxes

Page 34: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 34William Moen

Level 0 Title Keyword SearchLevel 0 Title Keyword Search

Attribute Type Attribute Value

Attribute Names

Use (1) 4 Title

Relation (2) 3 Equal

Position (3) 3 Any

Structure (4) 2 Word

Truncation (5) 100 Do not truncate

Completeness (6) 1 Incomplete subfield

Uses: Searches for complete word in a title of a resource.

Page 35: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 35William Moen

Level 1 Title First Words in FieldLevel 1 Title First Words in Field

Attribute Type Attribute Value

Attribute Names

Use (1) 4 Title

Relation (2) 3 Equal

Position (3) 3 Any

Structure (4) 1 Phrase

Truncation (5) 100 Do not truncate

Completeness (6) 1 Incomplete subfield

Uses: Searches for complete word(s) in the order specified in fields that contain a title of a resource. The field must begin with the specified character string. This search is useful when the beginning words in a title are known to the user.

Page 36: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 36William Moen

Z39.50 & Dublin CoreZ39.50 & Dublin Core

Z39.50 enables a common “understanding” by reference to shared vocabulary and structure

Dublin Core can provide the standardized vocabulary for access points in database

Dublin Core can provide the standardized grammar for labeling retrieved data elements

Extensible Markup Language (XML) serves as an interchange structure within Z39.50

Page 37: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 37William Moen

Dublin Core as LensDublin Core as LensZ39.50

Museum System

Records with a field:

Artist Name

Z39.50Library Catalog

Records with a field:

Personal Author

Z39.50 Search: Author = Van Gogh

Z39.50 Retrieval: Author = Van Gogh

DC.Creator = Van Gogh

Page 38: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 38William Moen

Z39.50 & Dublin Core at WorkZ39.50 & Dublin Core at Work

The Bath Profileo uses Dublin Core for Resource Discovery to

represent access points and label retrieval elementso uses XML as the interchange format

Z39.50 Application Profile for Cultural Heritage Information

o <http://www.cimi.org/standards/index.html#THREE>

o uses Dublin Core to represent access points and label retrieval elements

Page 39: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 39William Moen

Synergistic Solutions:Z39.50 and MetadataSynergistic Solutions:Z39.50 and Metadata

Z39.50 developed to interact with structured metadata in bibliographic databases

Metadata is now key approach for networked information organization

Networked environment will see varieties of metadata schemes and implementation models

Structured metadata + Z39.50 = integrated access to distributed global resources

Page 40: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 40William Moen

Z Texas and BathZ Texas and Bath

Z Texas Profileo Z Texas as companion profile to Batho Z Texas as compatible superset of Batho Gaining expert advice from development of Bath

Bath Profile o Bath development has slowed work on Z Texaso Bath is now in a stable draft versiono Bath is under review as Internationally Registered

Profile

Page 41: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 41William Moen

The Next Steps:Harmonizing Z Texas with Bath

The Next Steps:Harmonizing Z Texas with Bath

Upon IRP decision, Bath will be released for use Final revisions to Z Texas to harmonize with Bath Publication of Z Texas Profile, Release 2.0 Continuing development on Texas-specific

requirements:o for searchingo for retrievalo for addressing other types of information

Developing guidance for implementing Z Texas Developing indexing and mapping guidance

Page 42: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 42William Moen

The Next Steps: Assuring Goodness of Products

The Next Steps: Assuring Goodness of Products

Approacheso Taking the vendor’s word for ito Conformance testingo Interoperability testing

Challenges of interoperability testingo Rigorous methodologyo Tests for different levels of interoperabilityo Metrics for benchmarks and comparisono Setting up an interop testbed

Page 43: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 43William Moen

An Interoperability TestbedAn Interoperability Testbed

Technical (e.g., for semantic interoperability)o Reference implementation (trusted)o Test database (known contents)o Search and retrieval scenarioso Known results of searches

Organizational o Neutral sponsor

Politicalo Motivating participation

Page 44: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 44William Moen

Final ThoughtsFinal Thoughts

Convergence of interest, need, motivation

Collaboration within and among communities

Education and marketing

Window of opportunity for Z39.50 andWindow of opportunity for Z39.50 and

it’s up to us to take advantage of it!it’s up to us to take advantage of it!

We can realize the vision for our users!We can realize the vision for our users!

Page 45: Realizing the Vision Improving Interoperability with Z39.50 Profiles

OCLC SiteSearch User Group Meeting -- May 2, 2000 45William Moen

Z Texas and Bath Profile Resources

Z Texas and Bath Profile Resources

Z Texas Profileo Project Website

http://www.tsl.state.tx.us/LD/z3950/

o TZIG Member Listhttp://www.tsl.state.tx.us/LD/z3950/committee.htm

o Texas Z39.50 Listservhttp://www.tsl.state.tx.us/LD/z3950/list.htm

Bath Profileo Copy of draft profile

http://www.ukoln.ac.uk/interop-focus/activities/z3950/int_profile/bath/draft/

o Listserv for Discussion ZIP-PIZ-L (subscribe by sending message to:

[email protected])