Viaf and isni ifla 2013 08-16

19
The world’s libraries. Connected. VIAF and ISNI Interoperability Janifer Gatenby EMEA Program Manager Metadata OCLC VIAF Council Meeting Singapore 2013-08-16

description

VIAF and ISNI; interoperation; cluster level maintenance Linked data

Transcript of Viaf and isni ifla 2013 08-16

Page 1: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

VIAF and ISNIInteroperability

Janifer Gatenby

EMEA Program Manager MetadataOCLC

VIAF Council Meeting

Singapore 2013-08-16

Page 2: Viaf and isni  ifla 2013 08-16

Libraries

Text RightsMusic Rights

Trade Sources

Encyclopaedias

Researchers & Professional

Page 3: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

Provisional: Unassigned9,563,590

Provisional: Possible580,738

Assigned6.87 million

Assigned ISNIs July 2013

2 + independent sources 2,730,6313+ VIAF sources 656,976

Unique name 3,157,075Single source (JISC

names, BOEK, Ringgold) 296,417Total 6,636,916

Assigned ISNIs to VIAF July 2013

2 + independent sources 2,496,141

3+ VIAF sources 656,976

Unique name 2,643,958

Single source 0

Total 5,797,075

Page 4: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

Assigned

bav bibsys bne bnf dbc dnb egaxa iccu jpg

Assigned 181597 40634 221254 1067572 1065 1403508 14121 12736 47783Total 263382 70347 415630 1715817 2263 3205940 34688 36554 181623                   Percentage assigned 68,95 57,76 53,23 62,22 47,06 43,78 40,71 34,84 26,31

Unique 41653 15633 46685 213047 524 572443 107 7154 145

lac lc ndl nkc nla nli nszl nta nukatAssigned 245604 3530966 366043 358085 345220 293140 13075 1399918 828316Total 509126 6973060 775719 520334 647341 440566 33673 2347967 1143046                   Percentage assigned 48,24 50,64 47,19 68,82 53,33 66,54 38,83 59,62 72,47

Unique 64647 778749 226188 94632 27482 29882 25 314163 173921

ptbnp rero rsl selibr sudoc swnl vlacc wkp VIAFAssigned 116038 81155 293 96930 727610 22738 3647 261673 5797075Total 286490 119523 586 157939 1002970 38228 5132 326347 14501337                   Percentage assigned 40,50 67,90 50,00 61,37 72,55 59,48 71,06 80,18 39,97614

 Unique 38100 4019 118 21861 182138 6553 49 16395 2643958

Page 5: Viaf and isni  ifla 2013 08-16

Links from Current Non-VIAF sources to VIAF

clusters

VIAF source links to ISNI = > 7.3 million

Page 6: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

VIAF Scope

• Persons

• Organisations

• Works / uniform titles

• Expressions

• Meetings

• Geographic

• All public data

ISNI Scope

• Persons

• + musicians, researchers

• Organisations

• (excluding sparse)

• (excluding undifferentiated)

• Includes private data

VIAF and ISNI are Complementary

Page 7: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

VIAF Role

• Ingest authority records from the world’s major national and research libraries

• Make clusters

• Expose and diffuse

ISNI Role

• Create permanent IDs

• By batch

• On demand

• Diffuse those IDs

• Libraries, trade, rights management, professional societies, education

VIAF and ISNI are Complementary

Page 8: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

VIAF System

• Harvester

• Clustering mechanism

• Web site (5 interface languages)

• Download in multiple formats

• Linked data & SRU

1 million personal visitors p.a.

ISNI System

• Batch load

• Online request API

• Web site (English only)

• Allows end user input

• Member input and correction

• 16+ indexes

• SRU; soon linked data

• Quality Team monitoring & correcting

• Diffusion, including corrections

VIAF and ISNI are Complementary

Page 9: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

• Samples data regularly

• c. 2% VIAF clusters have mixed identities

• Duplicate clusters are higher, nearer 5%

• Makes corrections at cluster level

• Merges, splits, error notifications

• Access to cataloguing client / macros

• Makes system recommendations

• Gives approval for single source assignment

• Responds to End User input

ISNI Quality Team

Page 10: Viaf and isni  ifla 2013 08-16

Example record fixed by QT• 3 VIAF records merged• ISNI sources British Library Sound Archive, MusicBrainz• notice instruments, performances

Page 11: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

Another example of a merge in ISNI

Page 12: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

• Cause duplicate ISNI assignment

• Where both clusters have more than 3 VIAF sources

• Where an ISNI source matches with a single or 2 source VIAF record

• (Where VIAF sources move between the clusters)

• ISNI as a VIAF source will help VIAF merge clusters where ISNI QT has manually merged them (2,444)

• ISNI has flagged 481,766 VIAF records as possible duplicates

Duplicate clusters

Page 13: Viaf and isni  ifla 2013 08-16

Titles of other identities

Vocabulaire anglais-français, français-anglais, de terminologie économique et juridique

Il Piemonte visto da un inglese

Italia, Italia

The politics of the Vatican

The Pope's divisions : the Roman Catholic Church today

La notte comincia ancora una volta

Page 14: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

Amazon is differentiating

Page 15: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

• Signalled by ISNI Quality Team

• Most cases encountered now are due to source data

• ISNI QT would like to notify VIAF sources directly

• VIAF is currently notified by a field in the ISNI record; notifications indicate if a cluster error or a source error

Undifferentiated Identities

Page 16: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

• ISNI is assigned to public identities. Pseudonyms = different identities; but related

• VIAF sources- some treat as name variants, some as related names

• ISNI suite of programs

• Converts pseudonym name variants to related names

• Flags records with dissimilar main names

• Links and protects

Public Identities versus Persons

Page 17: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

• Policy on pseudonyms

• Study notification work flows

• How to remove record protect flags

• Participate in cluster sampling in VIAF and ISNI

• Help define new anomaly detectors

• ISNI has dissimilar main name / publishing before age 9, life span greater than 120 years

VIAF ISNI Task Force

Page 18: Viaf and isni  ifla 2013 08-16

NUKAT 99036027record for Thomas Meier (1953-) -2 erroneous titles- VIAF cluster 267789223

ISNI 0000 0003 9867 7425Thomas Meier (1953-)

Titles belong to

ISNI 0000 0004 0034 1112Thomas Meier 1966-

• ISNI QT to delete the titles from the ISNI record

• Notification to VIAF and directly to the contributor (977)

NUKAT

VIAF cluster 267789223

NUKAT deletes the 2 titles and creates a new authority record for Thomas Meier 1966

This new NUKAT record matches with the VIAF cluster for Thomas Meier 1966-on the 2 titles added by ISNI

VIAF cluster 12431062(with ISNI in the cluster)

VIAF cluster 12431062

Actions by ISNI

• ISNI QT adds the 2 titles to this ISNI record

• Notification to VIAF (977)

New NUCAT authority record for

Thomas Meier 1966

Without the 2 titles for Thomas Meier 1966

Actions by ISNI

Page 19: Viaf and isni  ifla 2013 08-16

The world’s libraries. Connected.

• Flag undifferentiated records• e.g. those generated by programs comparing authority and bibliographic

name strings

• Respond to ISNI notifications • correct “home” data

VIAF sources

As ISNI Members:

• Control “own identities” in VIAF and ISNI * Check possible matches and suspect records on ISNI

• Use ISNI for direct maintenance of clusters * Will generate notifications to VIAF and VIAF sources