VIAF Global Council - Lyon, France 15 August 2014 Janifer Gatenby VIAF and ISNI Synchronisation EMEA...

34
VIAF Global Council - Lyon, France 15 August 2014 Janifer Gatenby VIAF and ISNI Synchronisat ion EMEA Program Manager Metadata

Transcript of VIAF Global Council - Lyon, France 15 August 2014 Janifer Gatenby VIAF and ISNI Synchronisation EMEA...

VIAF Global Council - Lyon, France 15 August 2014

Janifer Gatenby

VIAF and ISNI Synchronisation

EMEA Program Manager Metadata

Libraries

Text Rights

Music RightsTrade Sources

Encyclopaedias

Researchers & Professional Granting organisations Professional Societies Article databases Theses databases

cross-domain cross-domain bridging-domains bridging-domains

Archives and Museums

• 8.01 million assigned ISNIs (was 1 million 2 years ago)• 15.4 million links; ISNI as linked data• ORCID Registration process is accessing ISNI• New members: Harvard University, La Trobe University and COPYRUS (Russia)• Linked Content Coalition names ISNI as # 1 strategy

ISNI Status at July 2014

Databases Assigned Links

Research 12 836,142 1,845,165

Text rights 7 129,816 692,580

Music 5 315,918 450,717

Libraries & trade 4 6.8 million 12,356,010

Organisations 3 446, 237 109,204

RIGHTS MANAGEMENT

Access Copyright, Canada ACCE

Authors’ Licensing and Collecting Society, UK

ALCS

Centrum Dienstverlening Auteurs- en aanverwante Rechten, Netherlands

CEDA

Centro Español de Derechos Reprográficos

CEDR

Irish Copyright Licensing Agency ICLA

Prolitteris, Switzerland PROL

VG WORT, Germany VGWO

MUSIC

American Musicological Society AMS

British Library Sound Archive BLSA

International Performers’ Database Association

IPDA

MusicBrainz MUBZ

RESEARCHERS AND PROFESSIONALS

American Musicological Society AMS

Authors Guild AGLD

British Library Theses BRTH

Digital Author identifier, Netherlands DAI

Jisc Names Project, UK JNAM

La Trobe University AU:VLU

Modern Languages Association MLA

OCLC Theses OCLCT

ORCID and DataCite Interoperability Network

ODIN

AuthorClaim and RePec OPENL

Proquest Theses PROQ

Scholar Universe, Proquest SCHU

Electronic tables of content ZETO

ORGANISATIONS

American Chemical Society ACS

Boekenbank, Belgium BOEK

Bowker Publishers BOWP

Publishers Licensing Society, UK PLS

Ringgold RING

GENERAL SOURCES

Bowker Books in Print BOWKER

The European Library (48 national libraries)

TEL

Virtual International Authority File (33 libraries)

VIAF

Current ISNI Sources 30…and growing

VIAF and ISNI are Complementary

VIAF Scope• Persons• Organisations• Works / uniform titles• Expressions • Meetings• Geographic

• All public data

ISNI Scope• Persons

– + musicians, researchers• Organisations

• (excluding sparse)• (excluding

undifferentiated)

• Includes private data

VIAF and ISNI are Complementary

VIAF Role• Ingest authority records

from the world’s major national and research libraries

• Make clusters• Expose and diffuse

ISNI Role• Create permanent IDs

– By batch– On demand

• Diffuse those IDs– Libraries, trade, rights

management, professional societies, educational institutions

VIAF and ISNI are Complementary

VIAF System• Harvester• Clustering mechanism (re-

clustered monthly)• 5 web interface languages• Download in multiple

formats• Linked data & SRU

1 million personal visitors p.a.

ISNI System• Batch load• Online request API• Web site (English only)

– Allows end user input– Member input and correction– 16+ indexes

• SRU; linked data• Quality Team monitoring &

correcting• Diffusion, including

corrections

Synchronisation ISNI to VIAF

• VIAF provides full file each month• ISNI compares previous & current files &

creates separate files for processing– Deletes (VIAF cluster ID in old but not new)

• If assigned or has other sources, source becomes ISNI

– Contents changed– Sources added or deleted– New (VIAF cluster ID in new but not old)– Re-matches VIAF deletes

• VIAF cluster movement reports for BL and BnF

VIAF ingest into ISNI

VIAF Global Council - Lyon, France 15 August 2014

Maintaining Clusters

Mixed identities

Source 1 Source 2 Source 1

Cluster Error Source Error

End User Note

Dear Sir / Madam, The ISNI 0000000117488848 refers to "Marco AntonioCasanova", Professor at the Catholic University of Rio de Janeiro. I am notthe author of "Fragmentos póstumos. - Nietzsche uma introduçãofilosófica" or "Segunda consideração intempestiva da utilidade edesvantagem da história para a vida". The author of these works is "MarcoAntonio dos Santos Casa Nova". You may confirm this information byconsulting our CVs at the Brazilian Research Council: Marco AntonioCasanova(me): http://lattes.cnpq.br/0400232298849115 Marco Antonio dos

Santos Casa Nova(the other author): http://lattes.cnpq.br/3409704326617178

I

Correction – Source Error• Reply to End User

Thank you for using the ISNI database and suggesting improvements to your record. There is now another ISNI record for Marco Antonio dos Santos Casa Nova (ISNI 0000 0004 3077 6045). I have corrected your record, removed the erroneous titles and added a link to your online CV (Lattes database).

If you have any further queries, please let me know.

• Email to SourceI am part of the the ISNI Quality Team (experts from the British Library and

Bibliothèque nationale de France in charge of the quality of the ISNI database). We perform manual checking and corrections in the ISNI database such as splits, merges/deduplications and data corrections. ISNI Quality team received a request from an enduser about ISNI records 0000 0001 1748 8848 and 0000 0004 3077 6045, VIAF 19998588 and their related

Authority record XXX 109895029 mixes 2 identities (see the snapshot below) :

1/ Marco Antonio Casanova (ISNI 0000 0001 1748 8848) 2/ Nova, Marco Antonio dos Santos Casa (ISNI 0000 0004 3077

6045) Philosoph, and author of "Segunda consideração intempestiva da

utilidade e desvantagem da história para a vida" I hope this information will be useful.

Source 1 Source ISNI Source ISNI

=

Correction – Cluster Error

• ISNI marks its two records as verified & sends to VIAF• These records are given the same status as XA

records in VIAF clustering. • No two XA records may occur in the same cluster

Source ISNI Source ISNI

End User Note

• It seems 2 ISNIs has been assigned to the French singer Laika Fatien (born 1968 in Paris): ISNI 0000 0000 8065 8419 and ISNI 0000 0000 7238 637X. I think the last one can be deleted.

Correction – Merged duplicate• Reply to End User

• Thank you for using the ISNI database and providing us with information about the duplicate records for Laika Fatien.

• • There is now just one record on the ISNI database for this identity –

ISNI: 0000 0000 8065 8419.• • If you have any further queries, please let me know.

• Notification to VIAF via ISNI record

• ISNI record contains verification note (i.e. treat as XA)

• ISNI record contains 2 VIAF cluster identifiers

VIAF A VIAF B

=ISNIVIAF AVIAF B

• Samples data regularly – c. 2% VIAF clusters have mixed identities– Duplicate clusters are higher, nearer 5%

• Makes corrections at cluster level

– Merges, splits, error notifications– Access to cataloguing client / macros

• Makes system recommendations• Gives approval for single source assignment• Responds to End User input• Sends emails to sources for error correction (12 VIAF sources

currently participating)

ISNI Quality Team

ISNI System Notification (Push process)

Someone else has

matched & details

Someone else has

matched & details

You probably need to take

action

You probably need to take

action

ISNI Assignment Agency

• Matching, merging and splitting infrastructure• Correction of errors• Sampling and anomaly checks,

• e.g. date anomalies, unlikely mixture of sources

• Pseudonym splitting• Re-importing and re-matching• Diagnostic indexes and reports• Enrichment

– e.g. Wikipedia, Dewey

• Notification system

VIAF ISNI Interoperability Task Force

• Met in Paris 22-23 April 2014• Representatives from

– Bibliothèque nationale de France– Biblioteca Nacional de España– British Library– Deutsche Nationalbibliothek– Sudoc– OCLC (VIAF system)– OCLC Leiden (ISNI Assignment Agency)

Recommendations to VIAF at OCLC

• Use profession and other disambiguating data• Investigate making an anomaly report• Investigate changing the clustering rules to flag and prevent a record with a mixed

identity from entering the clusters where 2 or more sources have established separate identity

• Investigate changing the clustering rules to prevent duplicate clusters.• Provide deprecated VIAF Ids in the distributed data• Treat records from ISNI that are flagged as manual as XA records• Include ISNI in RDF• Remove test from ISNI icon• Only show one name form for ISNI in the wheel• Investigate why SUDOC titles are not appearing

Recommendations to ISNI at OCLC

• Flag manual merges and splits (joint specification to be made)• Indicate to VIAF that a VIAF source needs to be split from a VIAF cluster

(joint specification to be made) • Keep up to date with VIAF• Produce anomaly reports• Produce notifications to VIAF sources• [Provide only one ISNI record per VIAF cluster ID; make split off records

ISNI source]• [Provide records with ISNI source to VIAF]

• Mark undifferentiated authorities or consider not supplying them to VIAF• Include nationality, particularly for own national identities• Use VIAF in authority control and select VIAF cluster ID

– Also use ISNI• If a mixed identity is found in VIAF or ISNI, use either the public interface

or [preferably] the member interface of ISNI to request resolution by the ISNI Quality Team. All manual corrections made in ISNI will come to VIAF as records with XA status to ensure merges or splits.

Recommendations to VIAF Council

VIAF Global Council - Lyon, France 15 August 2014

Become Involved

Jointly let’s maintain clusters

• Board members are British Library and Bibliothèque nationale de France (Representing CENL)

• Seeking Associate Members– KB, Netherlands in process– Control own identities– Access to client maintenance software– Access to restricted data– Provide back-up for end user responses

The ISNI Quality Team

ISNI Members

• View whole database (but not restricted fields)• Access to compare screen; can merge• Reports on request

– ISNIs – simple report or enhanced– Cluster movement report– Diagnostic reports

• Statistics and links

ISNI Database: Member view

Member viewMember view

Public viewPublic view

Public view – only see assigned

Member view – list of additional data displayed (if not private)

• Related identities

• Related persons

• Related organisations

• Nationality

• Gender

• Keyword or key phrase

• Dewey classification

• Publisher

• Dates active

• Associated countries

• Provisional records• Including links to possible matches, if applicable

Private data• Dates• Personal Affiliations• Titles of works

These can be masked from the public and from member view. However most sources allow titles to be seen by other members to facilitate merging.

Do not merge Anything that looks suspicious :Report it in a general note and the QT will review

This title belongs to

This is not the same person

ISNI StatisticsBasic statistics

Cross matches

VIAF matches

La Trobe University: 1,864 VIAF Links

Linked Data: isni.org/isni/

Explore. Share. Magnify.

Janifer GatenbyEMEA Program Manager Metadata

[email protected]