Infrastructure and technology challenges in Open Access to research data – The RECODE Project...

37
Infrastructure and technology challenges in Open Access to research data – The RECODE Project perspective LORENZO BIGAGLI AND THE RECODE TEAM CNR IIA (FLORENCE DIVISION ) [email protected] Centre International de Conférences de Genève (CICG) 17, rue de Varembé Geneva (Switerland) Tenth Plenary Session of the Group on Earth Observations (GEO-X) & Geneva Ministerial Summit

description

Infrastructure and technology challenges in Open Access to research data – The RECODE Project perspective. Tenth Plenary Session of the Group on Earth Observations (GEO-X) & Geneva Ministerial Summit. Florence Division. Centre International de Conférences de Genève (CICG) - PowerPoint PPT Presentation

Transcript of Infrastructure and technology challenges in Open Access to research data – The RECODE Project...

Page 1: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

Infrastructure and technology challenges in Open Access to research data –The RECODE Project perspectiveLORENZO BIGAGLI AND THE RECODE TEAMCNR I IA (FLORENCE DIVISION )

[email protected]

Centre International de Conférences de Genève (CICG)17, rue de VarembéGeneva (Switerland)

Tenth Plenary Session of the Group on Earth Observations (GEO-X) & Geneva Ministerial

Summit

Page 2: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

OutlineIntroduction

◦The RECODE ProjectKey findings on Infrastructure and Technology challenges◦Online questionnaire, case studies interviews, literature review, workshop

Contacts and next dates

Page 3: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

The RECODE projectPolicy RECommendations for Open access to research Data in Europe (RECODE)1 February 2013 – 31 January 2015 (24 months)Eight partners across five countries

Grant agreement no: 321463

Page 4: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Objectives

Reduce stakeholder fragmentationIdentify stakeholder values &

inter-relationships

Identify gaps, tensions & good practice solutions

Identify and promote over-arching good practice policy solutions

Evaluate grand challenges using 5

case studies

Produce policy guidelines

Page 5: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Case Studies

Physics

Particle Physics and Particle Astrophysics (PPPA) Group

Health

FP7 Project EVA (Markers for emphysema

versus airway disease in

COPD)

Bioengineering

Bioengineering Institute,

Auckland NZ, and Virtual

Physiological Human (VPH) Community

Earth Sciences

Global Earth Observation

System of Systems (GEOSS)

Archaeology

Open Context

Page 6: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Grand challenges

Stakeholder values & inter-relationships

Infrastructure & technology

Legal and ethical issues

Institutional and policy issues

Page 7: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

The GEOSS lesson Global Earth Observation System of Systems◦ EuroGEOSS◦ EGIDA

◦ Coordinating Earth and Environmental cross-disciplinary projects to promote GEOSS

◦ EGIDA Methodology

Page 8: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

EGIDA Methodology

General approach for (re-) engineering national Science & Technology infrastructures for a sustainable contribution to GEOSS◦ System of Systems (SoS)

approach◦ Mobilization of resources made

available by national, European and global initiatives

Page 9: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

RECODE Stakeholder Taxonomy

We congregate the disseminator and curator roles

◦ We assume they share similar concerns for our purpose

Citizens may be considered as research data user/producers (cf. Citizen Science)

◦ We assume their involvement in use and production of data is mediated by appropriate applications (e.g. mobile apps) that practically isolate them from the implied technological and infrastructural issues

Page 10: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Online questionnaireClosed, concise questions~50 responses so farStill open

◦ http://goo.gl/raafCF

Page 11: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Online questionnaire

Page 12: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

InterviewsExperience with OA publications, but not with data publications or data preservationMetadata are considered crucial to enable retrieval, re-use and preservation of research dataFinancial and legal barriers are considered higher priority then technical onesData management plans are being developed, but still at an early stageSolutions for data management and preservation are not common nor centralized

Page 13: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

EC Public Consultation on Open Research Data2nd July 2013, in Brussels~130 attendants stakeholders from the research community, industry, funders, libraries, publishers, infrastructure developers, etc.Focused on five questions

◦ How can we define research data and what types of research data should be open?

◦ When and how does openness need to be limited?◦ How should the issue of data re-use be addressed?◦ Where should research data be stored and made accessible?◦ How can we enhance data awareness and a culture of sharing?

Page 14: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

H2020 Pilot on Open Access to Research DataRequirements

◦ Detailed data management plan covering individual datasets (within 6 months)

◦ Deposit the research data, preferably into a research data repository ◦ Take measures to enable for third parties to access, mine, exploit,

reproduce and disseminate (free of charge for any user) this research data◦ E.g., Creative Commons License (CC-BY, CC0)

◦ Provide information about tools and instruments at the disposal of the beneficiaries and necessary for validating the results◦ E.g., specialized software, algorithms, analysis protocols. Where possible, they should

provide the tools and instruments themselves

Page 15: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Big DataThe Big Data issue (the 4 V’s)

◦ Volume - data at rest◦ Velocity - data in motion (streaming)◦ Variety - many types, forms and structures or no structures◦ Veracity - trustworthiness, provenance, lineage, quality◦ Validity - data that is correct◦ Visualization - data in patterns◦ Vulnerability - data at risk◦ Value - data that is meaningful

Page 16: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Workshop questionsWhere should open research data be stored and made accessible?How can we mitigate the technological barriers to Open Access to research Data?How can we cope with technological sustainability and obsolescence?What emerging technologies could be optimized to promote ease of deposit and retrieval of research data?

Page 17: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Infrastructure and Technology challenges – Overview

Creator Disseminator / Curator

Funder End user

Heterogeneity standardization, semantics

interoperability, reuse, organizational arrangements, i18n

discoverability, accessibility, usability

Sustainability obsolescence, persistence (PID), preservation, reuse

preservation, persistence, energy footprint

obsolescence

Volume bandwidth bandwidth, storage, scalability (data deluge)

scalability discoverability, accessibility, bandwidth

Quality description, training, fitness for use

description, completeness, peer review

certification description, peer review, fitness for use

Security authorization, licensing

accounting licensing authentication, privacy, trust

Page 18: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Inputs from the workshop

Importance of metadata◦ For quality, etc.

Heterogeneity is here to stay◦ Heterogeneity of user requirements makes it difficult to identify a unique

solution to be adopted

To build on the existing research infrastructure, supplementing and not supplanting it

To be flexible applying extensible technological and organizational solutions

To share does not mean to give everything to everyone◦ To distinguish among different approaches to Open Access

To discuss new professional roles stemming from the Data Science and Open Data in particular

Page 19: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Contacts and next dates http://recodeproject.eu/ @RECODE_Project

3rd RECODE Workshop “Ethical and legal issues in open access to research data”◦ 14 March 2014 ◦ Meertens Institute, Joan Muyskenweg 25, 1096 CJ Amsterdam (NL)

Page 20: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Fill in our questionnaire!

http://goo.gl/raafCF

Page 21: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Visit us at the Exhibition!RECODE booth (booth 15)

GEO Members Area

GEO Participant Organisations Area

GEO Projects Area

European Commission: RTD + JRC

European Commission: Copernicus

ESA

GCI Test AreaRECODE booth

Page 22: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Contribute to our session!European Geosciences Union General Assembly Meeting 2014◦ Vienna, 27 April-2 May 2014

ESSI 2.13 - Open Access to Research Data and Public Sector Information: perspective, drivers, and barriers◦ Convener: Lorenzo Bigagli ◦ Co-Conveners: Stefano Nativi , Paolo Mazzetti , Kush Wadhwa , José

Miguel Rubio Iglesias

Call for Abstract deadline:◦ January 16th 2014, 13:00 CET

Page 23: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

Infrastructure and technology challenges in Open Access to research data – the RECODE Project perspectiveLORENZO BIGAGLI AND THE RECODE TEAMCNR-I IA (FLORENCE DIVISION )

[email protected]

Centre International de Conférences de Genève (CICG)17, rue de VarembéGeneva (Switerland)

Tenth Plenary Session of the Group on Earth Observations (GEO-X) & Geneva Ministerial

Summit

Page 24: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

Extra slides

Page 25: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Online questionnaire

Page 26: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Online questionnaire

Page 27: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Online questionnaire

Page 28: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Online questionnaire

Page 29: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Online questionnaire

Page 30: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Online questionnaire

Page 31: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Online questionnaire

Page 32: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Online questionnaire

Page 33: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

Online questionnaire

Page 34: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

EC Public Consultation on Open Research DataHow can we define research data and what types of research data should be open?◦ Definitions vary, with some contributions defining research

data as potentially all data (including public sector information), and some limiting it to data that is the product of research◦ For researchers, research data includes all data from an experiment,

study or measurement, including metadata and details on processing data

◦ For publishers, data linked to publications is part of the publication

Page 35: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

EC Public Consultation on Open Research DataHow should the issue of data re-use be addressed?

◦ Discussions about licensing, but also about technical aspects of open research data◦ Not just on whether and how data should be re-used, but also on the

adequacy of e-infrastructures for data re-use◦ Directive on the re-use of public sector information

(2003/98/EC, currently under revision) was mentioned several times◦ While public sector information (PSI) is distinct from research data and

governed by a specific directive, it is important to remember that this type of information can also be useful for research

Page 36: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

EC Public Consultation on Open Research DataWhere should research data be stored and made accessible?

◦ The need for improved data management practices and better data accessibility is a key concern

◦ Issues closely linked with data preservation and sustainability of data repositories

◦ The readiness of professionals to engage in data curation was also highlighted◦ All stakeholders agreed that any funding body policy on open research data must call on

researchers to take the issue of data management seriously by developing data management plans (DMPs) for their research projects

Page 37: Infrastructure and technology challenges in Open Access to research data  – The  RECODE Project perspective

[email protected] X Plenary (Geneva, 14-16 Jan 2014)

H2020 Guidelines on Data ManagementScientific research data should be easily

◦ Discoverable◦ Accessible◦ Assessable and intelligible◦ Useable beyond the original purpose for which it was collected◦ Interoperable to specific quality standards