50 david yakimischak

13
Metadata and its Relationship to Journal Workflow SSP Seminar in Philadelphia, PA David Yakimischak, Chief Technology Officer November 18, 2004 www.jstor.org

description

 

Transcript of 50 david yakimischak

Page 1: 50 david yakimischak

Metadata and its Relationshipto Journal Workflow

SSP Seminar in Philadelphia, PADavid Yakimischak, Chief Technology Officer

November 18, 2004 www.jstor.org

Page 2: 50 david yakimischak

Agenda

• What is JSTOR?• What is Metadata?• What is Journal Workflow?• What is the relationship?• Discussion

Page 3: 50 david yakimischak

JSTOR Mission

• JSTOR is a not-for-profit organization with a mission to help the scholarly community take advantage of the advances in information technology. This includes: (1) building a reliable and comprehensive archive of core scholarly journals, and (2) dramatically improve access to this scholarly material

• In pursuing its mission, JSTOR takes a system-wide perspective, seeking benefits for libraries, publishers and scholars

Page 4: 50 david yakimischak

JSTOR Today

• 2,160 participating libraries• 269 participating publishers• 449 journals online• 16,379,559 pages scanned (and

counting!)• Formed an Electronic Archiving

initiative in 2003

Page 5: 50 david yakimischak

JSTOR Monthly Usage

Meaningful Accesses per Month

0

5,000,000

10,000,000

15,000,000

20,000,000

25,000,000

Jan-97

Apr-97

Jul-97

Oct-97

Jan-98

Apr-98

Jul-98

Oct-98

Jan-99

Apr-99

Jul-99

Oct-99

Jan-00

Apr-00

Jul-00

Oct-00

Jan-01

Apr-01

Jul-01

Oct-01

Jan-02

Apr-02

Jul-02

Oct-02

Jan-03

Apr-03

Jul-03

Oct-03

Jan-04

Apr-04

Jul-04

Oct-04

Page 6: 50 david yakimischak

What is Metadata?

• Data about data• Internally we outlaw the unqualified

use of the word metadata• Three broad categories

• Descriptive Metadata• Technical Metadata• Administrative Metadata

Page 7: 50 david yakimischak

Descriptive Metadata

• Describes the content or underlying asset• Bibliographic information• Title, author, journal, date, page, etc.• Used to create citations• Use to categorize or organize information• Creates the browse interface to content

Page 8: 50 david yakimischak

Technical Metadata

• Describes the structure or format of an item

• Allows for verification or validation• Typically used for internal operations• A registry can be used to catalog these• Commonly needs to be migrated,

although this applies to all metadata• Most suitable for automated processing

Page 9: 50 david yakimischak

Administrative Metadata

• Generally deals with how an item has been handled

• Most relevant to the archival records• Tracks history of change or access• Closely related to workflow• Can be gathered automatically and

unobtrusively

Page 10: 50 david yakimischak

What is Journal Workflow?

• The process of creating the finished product

• Can be formal or ad-hoc• Automated or manual• Stress-relieving or stress-inducing

Page 11: 50 david yakimischak

What is the relationship?

• Metadata can be managed during the journal workflow process• Added, ingested, modified, calculated, stored, versioned

• Metadata is used to drive the workflow process itself

• The intended use must be considered carefully• Why are we gathering this?• Who is the audience? When? Why?• What about statistics, for example?• Connections into other business process systems

Page 12: 50 david yakimischak

JSTOR Observations

• Metadata is created for every word, illustration, page, article, issue, title

• Authorizable units are described with metadata• Users are described with metadata• We have established internal guidelines and

standards• Relationship to, but not the same as external exposure

• Quality assurance and quality control are involved in oversight

• Formalize workflows early when it is easier• Get it right up front, it is expensive to go back

Page 13: 50 david yakimischak

Discussion