Archiving AV Materials FAIR |An Oral History Collection in ...€¦ · Archiving AV Materials FAIR...
Transcript of Archiving AV Materials FAIR |An Oral History Collection in ...€¦ · Archiving AV Materials FAIR...
dans.knaw.nlDANS is an institute of KNAW en NWO
Archiving AV Materials FAIR | An Oral History Collection in the Repository DANS-EASY
Eliane Fankhauser
Data Archiving and Networked Services (DANS)
15 May 2019, Archiving 2019, Arquivo Nacional da Torre do Tombo, Lisbon
Outline |
• Introduction • DANS and its repository EASY• FAIR and the FAIR Principles
• The FAIR Principles in Practice • Use case: Data collection “Journey of the Raid” (2014)• Is the collection
• Findable and Accessible?• Interoperable? • Reusable?
• Usage licenses and the GDPR
• Support for making data(sets) FAIR: 2 assessment tools • Take home message
DANS | The Institute
• DANS: Data Archiving and Networked Services• Data archive, supporting institute of KNAW, NWO• Founded in 2005, predecessors since 1964• Office in The Hague, ca. 50 employees• www.dans.knaw.nl
• Mission: to promote sustained access to digital research data files and encourage researchers to archive and reuse data
• Motto: access to digital research data should be “Open if possible, protected if necessary”
DANS | Services: Repository
• Electronic Archiving SYstem• Store and share data
sustainably upon completion of research
• Long-term storage• Software developed and
managed by DANS• www.easy.dans.knaw.nl
• Contains thousands of datasets (currently about 85,000)
Your 7 steps to sustainable data
Are you looking for sustainable storage of your research data? With EASY, the online archiving system from DANS, you can save your data in a secure and future-proof manner.
You yourself decide how your data will be accessible to others. At DANS, you set up your own digital archive in seven easy steps.
1. Prepare your data Select the relevant data files. Check them for privacy aspects and file
format issues against the guidelines issued by DANS.
2. Go to EASY Log in at http://easy.dans.knaw.nl. If you are new to EASY, you will have
to register for an account first.
3. Start the deposit procedure Go to ‘New deposit’, select your discipline and click ‘Start deposit’.
4. Documentation and access level Describe the dataset and indicate whether it is open access or conditionally accessible.
5. Upload your data files Select your data files and click ‘Upload dataset’.
6. Submit your data files Accept the license agreement and send your dataset to DANS by clicking
the ‘Submit’ button.
7. Publication by DANS DANS will verify the dataset and publish it with the access level set by you.
Your data have now been sustainably archived and will be accessible to others on a permanent basis.
More detailed instructionsScan this QR code with a smartphone to visit http://www.dans.knaw.nl/depositingdata for more information and detailed instructions for depositing data.
>>>
DANS | Other Services
• National Academic Research and Collaborations Information System (NARCIS)• Portal for access to information about researchers and their work• Software developed and managed by DANS• www.narcis.nl
• DataverseNL: repository for sharing and storing research data• Store and share data already during research • Intermediate storage, up to 10 years• Original software developed by the IQSS at Harvard, Dutch version
managed by DANS• 10+ participating institutions, most of them universities• www.dataverse.nl
EASY | AV Collections in EASY
• About 3000 AV datasets in EASY (total amount of datasets: ca. 85,000)
• Categoristaion “Oral History”• 294 datasets Open Access
EASY | Layout Jump-off Page
FAIR | FAIR Principles
• Findable, Accessible, Interoperable, Reusable • Term coming into existence in 2014-2016• Original goal: improvement of the reusability of research
data and develop interoperability in a bigger ecosystem • Set of measurable guidelines first published in 2016
FAIR | The 15 FAIR Principles
Accessible
Interoperable
Find
able
Reusable
F1. (Meta)data are assigned a globally unique and persistent identifierF2. Data are described with rich metadata (defined by R1 below)F3. Metadata clearly and explicitly include the identifier of the data they describeF4. (Meta)data are registered or indexed in a searchable resource
A1. (Meta)data are retrievable by their identifier using a standardised communications protocolA1.1 The protocol is open, free, and universally implementableA1.2 The protocol allows for an authentication and authorisation procedure, where necessaryA2. Metadata are accessible, even when the data are no longer available
I1. (Meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation.I2. (Meta)data use vocabularies that follow FAIR principlesI3. (Meta)data include qualified references to other (meta)data
R1. Meta(data) are richly described with a plurality of accurate and relevant attributesR1.1. (Meta)data are released with a clear and accessible data usage licenseR1.2. (Meta)data are associated with detailed provenanceR1.3. (Meta)data meet domain-relevant community standards
+ additional metrics
FAIR | The 15 FAIR Principles
• Difference between human and machine readable principles • Focus on those principles which are most relevant for archiving of
AV materials:
1. F2. Data are described with rich metadata (defined by R1 below)2. I2. (Meta)data use vocabularies that follow FAIR principles3. R1. Meta(data) are richly described with a plurality of accurate
and relevant attributes4. R1.1. (Meta)data are released with a clear and accessible data
usage license5. R1.3. (Meta)data meet domain-relevant community standards
USE CASE | Erik de Jager: Journey of the Raid
• Thematic collection titled “Journey of the Raid” • 76 interviews (= 76 datasets) containing:
• The interview (streaming on EASY website)• Transcription of interview in pdf• Additional information (documents in various file formats)• Invisible to EASY user: informed contents of interviewer AND
interviewees • Metadata page (“description”)
USE CASE | Journey of the Raid
The project “The Journey of the Raid” is based on filmed testimonials from men who have experienced the raid and the subsequent journey, to fill a gap in the historiography and to provide insight into the events on the theme "Scope of action of an individual in a society under pressure. -Erik de Jager
• Sheds light on cordon built in Rotterdam by Germans in November 1944
• Over 50,000 inhabitants arrested and transported to Germany
• Forced labor
USE CASE | FAIR applied to Journey of the Raid
F2. Data are described with rich metadata (defined by R1 below)
Ø Are metadata and data findable by humans and machines?• Broad range of metadata including clear title, creator datasets,
contributors, temporal and spatial coverages, list of keywords ü FAIR Principle F2 is covered
USE CASE | FAIR applied to Journey of the Raid
I2. (Meta)data use vocabularies that follow FAIR principles
Ø Are so-called controlled vocabularies used?• No use of controlled vocabularies (in keywords) also due to lack
of existence of controlled vocabularies in history • Especially relevant for findability and interoperability of datasets
in different repositories
USE CASE | FAIR applied to Journey of the Raid
R1.3. (Meta)data meet domain-relevant community standards
Ø Are standards in the field like descriptions, temporal and spatial coverages, file formats and protocols used? • No national or international standards established in the field of
history
USE CASE | FAIR applied to Journey of the Raid
File formats
• Crucial for long-term preservation • List of preferred formats provided by most data repositories ü Formats of De Jager’s interviews (.mov, QuickTime) today
non-preferred in EASY but used to be preferred at time of deposit
USE CASE | FAIR applied to Journey of the Raid
R1.1. (Meta)data are released with a clear and accessible data usage license
Ø Under which circumstances and how may data be reused? ü Journey of the Raid is open access • Possible because informed consents of interviewers and
interviewees are present • Usage licenses in EASY currently under review because of GDPR • How open data is published depends on sensitivity and informed
consents
More FAIR | Trustworthy Digital Repositories
• Covered 5 out of 15 principles – only? • Choice of repository crucial for FAIRness of datasets • CoreTrustSeal (CTS) Certification • Requirements are FAIR to a certain extent à CTS-certified
repositories offer a basic level of FAIR
Tools | FAIRdat tool
• Developed at DANS in 2016-17• Questionnaire, yes-no questions • Targeted users: data managers / curators and researchers • Evaluation of FAIRness of any datasets deposited in a
repository • Score: 5-star rating for F, A and I • Beta version
Tools | FAIR checklist
• Developed at DANS in 2018• Checklist, yes-no questions • Evaluation of FAIRness of data(sets) before deposition• Targeted users: researchers • Score: points • Beta version
Take home message
FAIR aspects important for archiving AV materials:
1. Providing rich metadata and additional documentation2. Existence of informed consents3. Using community-specific vocabularies 4. Storing AV files in sustainable formats
dans.knaw.nlDANS is an institute of KNAW en NWO
Thank you for your attention!
LinkedIn: /elianefankhauser
More infowww.dans.knaw.nl