OSGIS Conference: report on RDA/MPG Science workshop

17
06/05/14 RDA-Europe Forum 1 1 st Science Workshop on Data 10-11 February, Munich, Germany Herman Stehouwer Thanks to Raphael Ritz for the slides Garching Compute Center, Max-Planck-Society, Germany

description

Short presentation reporting on the RDA Europe and MPG science workshop on data

Transcript of OSGIS Conference: report on RDA/MPG Science workshop

Page 1: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe Forum 106/05/14

1st Science Workshop on Data10-11 February, Munich, Germany

Herman Stehouwer

Thanks to Raphael Ritz for the slides

Garching Compute Center, Max-Planck-Society, Germany

Page 2: OSGIS Conference: report on RDA/MPG Science workshop

2

Science Workshop on Data

February 10-11, 2014, in Munich

15 leading scientists + data practitioners

Hosted by RDA-E and MPG

Key points:Rewards (“what’s in there for me?”)

Trust (“are these data and metadata correct?”)

Persistence (“what will be in 20 years?”)

Report available

RDA-Europe/MPG Science Workshop06/05/14

Page 3: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 3

Participants - Scientists• Bernard Schutz - Gravitational Physics - Cardiff U / MPG • Bruce Allen - Gravitational Physics - MPI for Gravitationphysics, Hannover • Bruno Leibundgut - Astronomy - ESO, Garching • Cécile Callou - Archaezoology/Biodiversity-Ecology-Environment - Museum d'Histoire

Naturelle, Paris • Christine Gaspin - Bio-Informatics - INRA, Toulouse • Dick Dee - Meteorology - ECMWF • Jan Bjaalie - Neuroanatomy and Computer Science - University of Oslo• Francoise Genova – Astronomy – CNRS, Strasbourg• Jochem Marotzke - Climate Model - MPI for Meteorology, Hamburg • Manfred Laubichler - History of Science - New Mexico University • Marc Brysbaert - Psychology - Ghent University • Mark Hahnel - Biology - Figshare and Imperial College, London• Markku Kulmala - Atmospheric Sciences - University of Helsinki • Peter Coveney - Chemistry, biomedecine - UCL, London • Stefano Nativi – Earth System Science and Environmental Technologies – CNR, Rome

06/05/14

Page 4: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 4

Participants - Experts

• Carlos Morais-Pires - e-Infrastructures - European Commission • Donatella Castelli - RDA/E Member/Computer Science - ISTI-CNR, Pisa • Frank Sander - MPDL Director - MPDL, Munich • Herman Stehouwer - RDA Secretariat - MPI for Psycholinguistics, Nijmegen • Leif Laaksonen - RDA/E Coordinator - CSC, Helsinki • Peter Wittenburg - RDA-TAB Member/Linguistics - MPI for

Psycholinguistics, Nijmegen • Ramin Yahyapour - GWDG Director - GWDG, Göttingen • Raphael Ritz - RDA/E Member/NeuroInformatics - MPG, Garching • Stefan Heinzel - RZG Director - MPG, Garching • Reinhard Budich – Data Scientist – MPI Meterology, Hamburg • Riam Kanso – Data Policies/Cognitive Neuroscience – UCL, London• Ari Asmi – Atmospheric Sciences – University of Helsinki

06/05/14

Page 5: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 5

Multiple Dimensions

06/05/14

Page 6: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 6

Observations

Sharing and Re-use of Data

Publishing and Citing Data

Infrastructure and Repositories

Leading to

Recommendations

06/05/14

Page 7: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 7

Sharing and Re-use of Data

Still in its infancy

Lack of efficient, cross-disciplinary methods enabling re-use

Reference data is costly to establish and maintain

Trust needed in the identity, integrity, authenticity of data and the seriousness of all actors involved

06/05/14

Page 8: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 8

Publishing and Citing Data

Core of the scientific process

Referencing data (e.g. via PIDs) must be stable

Reproducible Science

Quality assessment (e.g. peer review)

Career building

Suitable infrastructure needed

06/05/14

Page 9: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 9

Infrastructure and Repositories

They are desperately needed

Researchers in the driving seat

Limits to Open Access

Services on data

Legacy data and systems

Companies more cost effective?

Registries and catalogues needed

06/05/14

Page 10: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 10

Recommendations to RDA

Come up with recommendations and specifications to overcome the one-shot, point solutions of today

Be really bottom-up

Engage a “middle layer” of data scientists

Compete or cooperate with commercial players

06/05/14

Page 11: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 11

Expectations towards RDA

Invest in training younger generations of data scientists

Push demo projects, act as clearing house, provide advice on data management, access and re-use

Have data experts visit and support institutes

Perform good quality assessment on own results

Don’t oversell

06/05/14

Page 12: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 12

A New Way of Doing Science?

06/05/14

https://materialsproject.org/escience

Page 13: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 13

Challenges of Data Intensive Science

Large Volumes

Accumulated at high rate (Velocity)

Integration across a Variety of sources

Trust/Validity/Veracity

This has implications on data capture, curation, storage, search, sharing, transfer, analysis and visualization

06/05/14

Page 14: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 14

Data Sharing != Open Access

Data Sharing starts in your own laboratory.

It continues with collaborators, consortia, peers, the scientific communities and maybe the general public.

06/05/14

Page 15: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 15

Are you willing to take the time to

prepare your data for sharing?

Is there somewhere for you to put your

data?

Can I read your data?

If so, can I understand it?(e.g., how it was acquired and

analysed?)

Will it be possible to still understand the data in

the future?

Making data sharing feasible

Develop methods to ensure

interoperability

Develop standardised

and comprehensive

metadata schemes

Develop automated methods for metadata

capture

06/05/14

Page 16: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 16

A Lot of Work

Managing data in a lab requires increasing amounts of time by the scientists involved.

Preparing for sharing requires additional work.

How can we make this as effective as possible?

RDA’s idea: adopt a bottom-up approach towards Research Data Sharing without barriers.

06/05/14

Page 17: OSGIS Conference: report on RDA/MPG Science workshop

RDA-Europe/MPG Science Workshop 17

Future

2015: CERN

2016: CNRS

Follow-up workshopsContinue Discussion

Deepen Themes

06/05/14